Don Miner, PhD

CTO

Specialties

Hadoop
Large-scale Data Science on Big Data and Cloud
Machine Learning Operations
Anomaly Detection

Education

Ph.D. Computer Science, UMBC
B.S. Computer Science, UMBC

Location

Maryland

Recent Projects

Large Clothing Company
GCP Migration and Starting Data Science Practice
Advised leadership and architected the migration from a Teradata data warehouse and other data silos to a centralized analytical warehouse in Google Cloud’s BigQuery. Designed several high-impact net new data science applications around hyper-personalization.

Railroad IT Company
Next Generation Estimated Time of Arrival System
Designed and architected a machine learning and Big Data solution on Hadoop using Spark and HBase to improve the estimated time of arrival of trains.

Technical Expertise

  • Python data stack: scikit-learn, pandas, numpy, Jupyter, others
  • Hadoop: Spark, MapReduce, Hive, HBase, Accumulo
  • Databases: PostgreSQL, Greenplum
  • Machine Learning
  • Deep Learning: CNNs and RNNs
  • Amazon Web Services: EC2, ECS, RedShift, EMR, Lambda, S3
  • Google Cloud Platform: BigTable, BigQuery, Dataproc