Remote, US; DMV; McLean, VA; Boston, MA; San Antonio, TX; Colorado Springs, CO; Tampa, FL; Honolulu, HI Remote (City) Employment $150,000.00 - $200,000.00

Raft is hiring a Principal MLOps Engineer

About the Role

Oversee the design, implementation, and maintenance of machine learning operations infrastructure, ensuring seamless integration and optimal performance of machine learning models.

Responsibilities

  • Design and implement scalable machine learning pipelines.
  • Collaborate with data scientists and engineers to integrate machine learning models into production systems.
  • Develop and maintain CI/CD pipelines for machine learning models.
  • Ensure the reliability and scalability of machine learning infrastructure.
  • Monitor and optimize the performance of machine learning models in production.
  • Implement best practices for version control and reproducibility of machine learning experiments.
  • Conduct regular code reviews and provide mentorship to junior team members.
  • Stay updated with the latest advancements in machine learning and MLOps technologies.
  • Work closely with cross-functional teams to understand business requirements and translate them into technical solutions.
  • Develop and maintain documentation for machine learning operations processes and best practices.
  • Troubleshoot and resolve issues related to machine learning model deployment and performance.
  • Collaborate with DevOps teams to ensure smooth integration of machine learning models into the overall software development lifecycle.
  • Implement security measures to protect machine learning models and data.
  • Conduct performance testing and optimization of machine learning models.
  • Develop and maintain monitoring and alerting systems for machine learning models.
  • Participate in on-call rotations to ensure 24/7 support for machine learning operations.
  • Provide technical guidance and support to data scientists and engineers.
  • Develop and maintain tools for automated testing and validation of machine learning models.
  • Implement strategies for efficient data management and storage in machine learning pipelines.
  • Collaborate with stakeholders to define and implement machine learning model deployment strategies.
  • Conduct regular audits of machine learning operations processes to ensure compliance with industry standards and best practices.
  • Develop and maintain dashboards for monitoring the performance and health of machine learning models.
  • Participate in the design and implementation of machine learning model training and evaluation frameworks.
  • Collaborate with product managers to understand user needs and translate them into machine learning solutions.
  • Develop and maintain scripts for automated data preprocessing and feature engineering.
  • Conduct regular performance reviews and provide feedback to team members.
  • Implement strategies for efficient resource management in machine learning operations.
  • Develop and maintain tools for automated deployment and scaling of machine learning models.

Nice to Have

  • Experience with MLOps tools such as MLflow, Kubeflow, or Seldon.
  • Familiarity with machine learning model explainability and interpretability techniques.
  • Experience with machine learning model fairness and bias mitigation techniques.
  • Familiarity with machine learning model lifecycle management tools.
  • Experience with machine learning model deployment in edge environments.
  • Familiarity with machine learning model deployment in multi-cloud environments.
  • Experience with machine learning model deployment in hybrid cloud environments.
  • Familiarity with machine learning model deployment in on-premises environments.
  • Experience with machine learning model deployment in serverless environments.
  • Familiarity with machine learning model deployment in containerized environments.
  • Experience with machine learning model deployment in virtualized environments.
  • Familiarity with machine learning model deployment in cloud-native environments.
  • Experience with machine learning model deployment in microservices architectures.
  • Familiarity with machine learning model deployment in monolithic architectures.
  • Experience with machine learning model deployment in service-oriented architectures.
  • Familiarity with machine learning model deployment in event-driven architectures.
  • Experience with machine learning model deployment in real-time processing environments.
  • Familiarity with machine learning model deployment in batch processing environments.
  • Experience with machine learning model deployment in streaming processing environments.
  • Familiarity with machine learning model deployment in big data environments.
  • Experience with machine learning model deployment in small data environments.
  • Familiarity with machine learning model deployment in edge computing environments.
  • Experience with machine learning model deployment in fog computing environments.
  • Familiarity with machine learning model deployment in cloud computing environments.

Compensation

Competitive salary and equity

Work Arrangement

Remote

Team

Collaborate with a diverse team of data scientists, engineers, and product managers.

What You'll Do

  • Design and implement scalable machine learning pipelines.
  • Collaborate with data scientists and engineers to integrate machine learning models into production systems.
  • Develop and maintain CI/CD pipelines for machine learning models.
  • Ensure the reliability and scalability of machine learning infrastructure.
  • Monitor and optimize the performance of machine learning models in production.
  • Implement best practices for version control and reproducibility of machine learning experiments.
  • Conduct regular code reviews and provide mentorship to junior team members.
  • Stay updated with the latest advancements in machine learning and MLOps technologies.
  • Work closely with cross-functional teams to understand business requirements and translate them into technical solutions.
  • Develop and maintain documentation for machine learning operations processes and best practices.

What You'll Need

  • Proven experience as a MLOps Engineer or similar role.
  • Strong proficiency in Python and experience with machine learning frameworks such as TensorFlow and PyTorch.
  • Experience with cloud platforms such as AWS, GCP, or Azure.
  • Familiarity with containerization technologies such as Docker and Kubernetes.
  • Experience with CI/CD pipelines and tools such as Jenkins, GitLab CI, or CircleCI.
  • Strong understanding of machine learning model deployment and scaling strategies.
  • Experience with version control systems such as Git.
  • Strong problem-solving skills and ability to troubleshoot complex issues.
  • Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK Stack.
  • Strong communication and collaboration skills.
  • Experience with data preprocessing and feature engineering techniques.
  • Familiarity with machine learning model evaluation and validation techniques.
  • Experience with automated testing and validation of machine learning models.
  • Strong understanding of machine learning model training and evaluation frameworks.
  • Experience with data management and storage solutions.
  • Familiarity with security best practices for machine learning models and data.
  • Experience with performance testing and optimization of machine learning models.
  • Strong understanding of machine learning model deployment strategies.
  • Experience with on-call rotations and 24/7 support for machine learning operations.
  • Familiarity with industry standards and best practices for machine learning operations.
  • Experience with developing and maintaining documentation for machine learning operations processes.
  • Strong understanding of machine learning model version control and reproducibility.
  • Experience with developing and maintaining dashboards for monitoring machine learning model performance.
  • Familiarity with machine learning model training and evaluation frameworks.
  • Experience with developing and maintaining tools for automated deployment and scaling of machine learning models.
  • Strong understanding of machine learning model deployment and scaling strategies.

Nice to Have

  • Experience with MLOps tools such as MLflow, Kubeflow, or Seldon.
  • Familiarity with machine learning model explainability and interpretability techniques.
  • Experience with machine learning model fairness and bias mitigation techniques.
  • Familiarity with machine learning model lifecycle management tools.
  • Experience with machine learning model deployment in edge environments.
  • Familiarity with machine learning model deployment in multi-cloud environments.
  • Experience with machine learning model deployment in hybrid cloud environments.
  • Familiarity with machine learning model deployment in on-premises environments.
  • Experience with machine learning model deployment in serverless environments.
  • Familiarity with machine learning model deployment in containerized environments.
  • Experience with machine learning model deployment in virtualized environments.
  • Familiarity with machine learning model deployment in cloud-native environments.
  • Experience with machine learning model deployment in microservices architectures.
  • Familiarity with machine learning model deployment in monolithic architectures.
  • Experience with machine learning model deployment in service-oriented architectures.
  • Familiarity with machine learning model deployment in event-driven architectures.
  • Experience with machine learning model deployment in real-time processing environments.
  • Familiarity with machine learning model deployment in batch processing environments.
  • Experience with machine learning model deployment in streaming processing environments.
  • Familiarity with machine learning model deployment in big data environments.
  • Experience with machine learning model deployment in small data environments.
  • Familiarity with machine learning model deployment in edge computing environments.
  • Experience with machine learning model deployment in fog computing environments.
  • Familiarity with machine learning model deployment in cloud computing environments.

Our Benefits

  • Competitive salary and equity
  • Remote work arrangement
  • Collaborative team environment
  • Opportunities for professional growth and development
  • Flexible work hours
  • Health and wellness benefits
  • Generous time off and holidays
  • 401k matching
  • Employee assistance programs
  • Tuition reimbursement
  • Professional development opportunities
  • Employee recognition programs
  • Team-building activities
  • Performance bonuses
  • Stock options
  • Employee referral bonuses
  • Relocation assistance
  • Parental leave
  • Adoption assistance
  • Fertility benefits
  • Pet insurance
  • Commuter benefits
  • Gym membership reimbursement
  • Wellness programs
  • Mental health resources
  • Financial planning resources
  • Career coaching
  • Leadership development programs
  • Mentorship opportunities

Our Culture

  • Inclusive and diverse work environment
  • Collaborative and supportive team culture
  • Emphasis on work-life balance
  • Opportunities for continuous learning and development
  • Focus on innovation and creativity
  • Commitment to ethical and responsible AI practices
  • Encouragement of open communication and feedback
  • Support for professional growth and advancement
  • Recognition and celebration of employee achievements
  • Promotion of a healthy and positive work environment
  • Encouragement of teamwork and collaboration
  • Support for employee well-being and mental health
  • Commitment to sustainability and social responsibility
  • Promotion of a culture of continuous improvement
  • Encouragement of a growth mindset
  • Support for work-life integration
  • Promotion of a culture of inclusivity and belonging
  • Encouragement of a culture of innovation and experimentation
  • Support for employee-led initiatives and projects
  • Promotion of a culture of transparency and accountability
  • Encouragement of a culture of curiosity and learning
  • Support for employee development and growth
  • Promotion of a culture of respect and dignity
  • Encouragement of a culture of empathy and compassion
  • Support for employee engagement and satisfaction
  • Promotion of a culture of collaboration and teamwork
  • Encouragement of a culture of diversity and inclusion
  • Support for employee well-being and happiness
  • Promotion of a culture of innovation and creativity

How to Apply

  • Submit your resume and cover letter through our online application portal.
  • Include a portfolio of your previous work and any relevant projects.
  • Highlight your experience with machine learning and MLOps technologies.
  • Describe your approach to problem-solving and troubleshooting.
  • Explain your experience with cloud platforms and containerization technologies.
  • Detail your experience with CI/CD pipelines and version control systems.
  • Provide examples of your experience with monitoring and logging tools.
  • Describe your experience with data preprocessing and feature engineering techniques.
  • Explain your experience with machine learning model evaluation and validation techniques.
  • Provide examples of your experience with automated testing and validation of machine learning models.

Not provided

Landing international contracts?

Invoice globally with an EU company

GloPay creates an Estonian partnership for you automatically. Your clients get proper invoices, you keep 95% of payments. Setup takes 5 minutes, works in 100+ currencies.

EU-registered company for compliance
Multi-currency invoicing & payments
Expense tracking & tax reports
Money in your bank in 1 business day
Start invoicing free
5% per invoice • No subscriptions
About company
Raft

Raft delivers scalable AI, data integration, and resilience, enhancing mission clarity and operational success. The company builds operator-first, software-first data and AI products that adapt to the mission and conform to the operator, enabling faster, trusted decisions anywhere, anytime.

Modern warfare is software-driven, and Raft is the foundation it’s built on. Their products unify and federate data from every sensor, system, and domain into one live, operational picture, while enabling trusted human-AI partnership and scaling from the tactical edge to enterprise environments.

Founded by Shubhi Mishra, Raft focuses on solving the hardest problems for the Department of Defense and public agencies, ensuring every decision is informed, trusted, and mission-ready across complex, data-saturated environments.

All jobs at Raft Visit website
Job Details
Category other
Posted 4 hours ago