Don Miner, PhD
CTO

Specialties
Hadoop
Large-scale Data Science on Big Data and Cloud
Machine Learning Operations
Anomaly Detection
Education
Ph.D. Computer Science, UMBC
B.S. Computer Science, UMBC
Location
Maryland
Recent Projects
Large Clothing Company
GCP Migration and Starting Data Science Practice
Advised leadership and architected the migration from a Teradata data warehouse and other data silos to a centralized analytical warehouse in Google Cloud’s BigQuery. Designed several high-impact net new data science applications around hyper-personalization.
Railroad IT Company
Next Generation Estimated Time of Arrival System
Designed and architected a machine learning and Big Data solution on Hadoop using Spark and HBase to improve the estimated time of arrival of trains.
Technical Expertise
- Python data stack: scikit-learn, pandas, numpy, Jupyter, others
- Hadoop: Spark, MapReduce, Hive, HBase, Accumulo
- Databases: PostgreSQL, Greenplum
- Machine Learning
- Deep Learning: CNNs and RNNs
- Amazon Web Services: EC2, ECS, RedShift, EMR, Lambda, S3
- Google Cloud Platform: BigTable, BigQuery, Dataproc