Fusemachines is looking for a Senior Machine Learning Engineer to architect, build, and deploy high-performance machine learning systems that power our technology stack. You will work across the entire ML lifecycle—from processing massive volumes of data to developing and deploying low-latency models. Our mission is democratizing AI for the masses by providing high-quality AI education in underserved communities and helping organizations achieve their full potential with AI.
What You'll Do
- Process and extract features from massive, highly sparse datasets (terabytes/petabytes of bidstream and user event data) using SQL, Python, and distributed computing frameworks like Spark and Ray.
- Architect offline and online feature pipelines, managing real-time feature computation and low-latency feature stores to ensure zero online/offline skew.
- Perform rigorous missingness analysis, leakage checks, and handle high-cardinality categorical variables safely.
- Train, tune, and scale supervised learning models, utilizing advanced gradient boosting (XGBoost, LightGBm, CatBoost) and Factorization Machines.
- Design and implement Deep Learning architectures for structured/recommendation data using PyTorch or TensorFlow.
- Apply rigorous tabular modeling practices: meticulous leakage prevention, class imbalance strategies, and robust cross-validation on time-split data.
- Write clean, object-oriented, and modular production code, transitioning models from Python research environments to high-performance serving environments (packaging with ONNX, TensorRT, etc).
- Design and maintain robust MLOps pipelines: automated model retraining, versioning, shadow deployments, and CI/CD for machine learning.
- Monitor production models for data drift, concept drift, and performance degradation in real-time, implementing automated alerting and fallback mechanisms.
- Design rigorous A/B and multivariate testing frameworks.
What We're Looking For
- A strong hybrid skill set: deep expertise in applied machine learning combined with production-grade software engineering skills.
Technical Stack
- SQL, Python, Spark, Ray
- XGBoost, LightGBM, CatBoost
- PyTorch, TensorFlow, ONNX, TensorRT
Work Mode
This is a remote position.





