Telemetry Data Scientist
26 days ago
Houston
Job Description Telemetry Data Scientist / Real-Time ML Engineer Division: DATUM, Impac Exploration Services Location: Remote or Oklahoma City Type: Full-Time or Contract Build Models That Run at 10,000 Feet Per Second We're not looking for someone to build another dashboard. We're looking for someone who sees streaming sensor data and thinks "I could predict that failure 30 seconds out." DATUM is where industrial operations meet cutting-edge ML. We're streaming terabytes of WITS telemetry from drilling operations—pressure, torque, vibration, flow rates—all live from assets worth millions per day. This isn't historical analysis. This is real-time ML where your models make decisions while the drill bit is turning. Here's what makes this interesting: The physics are unforgiving. The data is messy. The stakes are real. And when your anomaly detection prevents a catastrophic failure at 10,000 feet? That's the kind of win that gets noticed—by operators, by industry, and yeah, by recruiters from places you've dreamed about working. What You'll Actually Build • Streaming ML pipelines that process millions of sensor readings per minute, • Anomaly detection that understands the difference between "weird but fine" and "stop everything now", • Predictive models that see patterns in vibration harmonics humans miss, • Real-time feature engineering on streams that never stop Core stack: • Streaming: Kafka/Pulsar, Flink/Spark Streaming, or River (bonus if you've pushed these to their limits), • ML Ops: MLflow, Weights & Biases, or Kubeflow—because models in notebooks don't save rigs, • Time Series: Prophet, LSTM/GRU implementations, or custom architectures for sequential data, • Languages: Python (NumPy, Pandas, Scikit-learn) + SQL + whatever else gets the job done, • Deployment: Docker, Kubernetes, edge computing frameworks, • Physics-informed neural networks or domain-aware ML, • Experience with industrial protocols (OPC-UA, MQTT, Modbus), • Edge computing / model optimization for embedded systems, • You've built something real-time that stayed up when it mattered, • Sensor fusion excites you more than computer vision, • You can explain Kalman filters and gradient boosting to a drilling engineer, • You've debugged model drift at 3 AM because production doesn't sleep, • You've touched industrial systems, IoT, or anything with real sensors, • Built telemetry systems for fun (racing sims, drone projects, even elaborate Raspberry Pi setups), • Contributed to time-series or streaming ML libraries Small team. Big problems. No bureaucracy. We move fast, ship faster, and celebrate when our work is so good that other companies try to poach you. You'll have the compute you need, the freedom to experiment, and colleagues who push you to be better. Fair warning: This isn't a "set it and forget it" role. When a $100M operation depends on your model, you'll feel it. But when your algorithm catches something nobody else saw? That feeling is addictive. Why This Matters While everyone else fights over chatbot improvements, we're building ML for the physical world. Energy is complex, consequential, and desperately needs fresh thinking. Come help us prove that the best telemetry systems don't just live at race tracks—they're 10,000 feet underground, running 24/7, keeping people safe and operations efficient. Ready to build ML that matters? Let's talk. We are not sponsoring visas or participating in CPT programs at this time.