Pathway is looking for a Machine Learning DevOps Engineer to focus on operationalizing machine learning models and ensuring scalability, reliability, and automation across the ML lifecycle. This role involves managing cloud and compute cluster processes and scaling infrastructure to support a growing team and production needs.
What You'll Do
- Optimize infrastructure for ML training and inference, including GPUs and distributed compute.
- Automate and maintain ML/LLM pipelines covering data ingestion, training, validation, and deployment.
- Manage model versioning, reproducibility, and traceability.
- Work with terabyte-large datasets.
- Implement ML-centric CI/CD practices.
- Monitor model performance and data drift in production.
- Collaborate with machine learning engineers, software engineers, and platform teams.
What We're Looking For
- Experience in cloud and compute cluster management.
- Experience scaling infrastructures.
- Experience with Linux administration.
Pathway is an equal opportunity employer.






