Paris; London; Montreal; New York; San Francisco; Toronto Remote (Global) Employment

Cohere is hiring a Member of Technical Staff, Training Infra Engineer

Responsibilities

  • Design and implement scalable and efficient training infrastructure.
  • Develop and maintain tools for monitoring and optimizing training processes.
  • Collaborate with data scientists and engineers to integrate training infrastructure with machine learning models.
  • Ensure the security and reliability of training infrastructure.
  • Troubleshoot and resolve issues related to training infrastructure.
  • Document training infrastructure processes and best practices.
  • Conduct performance testing and optimization of training infrastructure.
  • Stay updated with the latest advancements in training infrastructure technologies.
  • Provide technical support and guidance to team members.
  • Participate in code reviews and contribute to the improvement of training infrastructure codebase.
  • Implement automated testing and deployment pipelines for training infrastructure.
  • Work closely with cross-functional teams to understand training infrastructure requirements.
  • Develop and maintain training infrastructure documentation.
  • Implement security measures to protect training infrastructure.
  • Conduct regular audits of training infrastructure to ensure compliance with standards.
  • Develop and maintain training infrastructure dashboards for real-time monitoring.
  • Collaborate with stakeholders to define training infrastructure roadmap.
  • Implement and maintain training infrastructure logging and alerting systems.
  • Conduct training sessions and workshops for team members on training infrastructure.
  • Develop and maintain training infrastructure automation scripts.
  • Implement and maintain training infrastructure backup and recovery processes.
  • Conduct regular performance reviews of training infrastructure.
  • Collaborate with vendors and third-party service providers for training infrastructure.
  • Implement and maintain training infrastructure access controls.

Nice to Have

  • Experience with machine learning model training and deployment.
  • Knowledge of machine learning model serving and inference.
  • Experience with machine learning model optimization and tuning.
  • Knowledge of machine learning model evaluation and validation.
  • Experience with machine learning model versioning and management.
  • Knowledge of machine learning model interpretability and explainability.
  • Experience with machine learning model deployment and scaling.
  • Knowledge of machine learning model security and privacy.
  • Experience with machine learning model monitoring and logging.
  • Knowledge of machine learning model debugging and troubleshooting.
  • Experience with machine learning model performance testing and optimization.
  • Knowledge of machine learning model lifecycle management.
  • Experience with machine learning model deployment and management tools.
  • Knowledge of machine learning model deployment and management best practices.
  • Experience with machine learning model deployment and management frameworks.
  • Knowledge of machine learning model deployment and management standards.
  • Experience with machine learning model deployment and management tools.
  • Knowledge of machine learning model deployment and management best practices.
  • Experience with machine learning model deployment and management frameworks.
  • Knowledge of machine learning model deployment and management standards.

Compensation

Competitive salary and equity

Work Arrangement

On-site

Team

Collaborate with a team of engineers and data scientists

What You'll Do

  • Design and implement scalable and efficient training infrastructure.
  • Develop and maintain tools for monitoring and optimizing training processes.
  • Collaborate with data scientists and engineers to integrate training infrastructure with machine learning models.
  • Ensure the security and reliability of training infrastructure.
  • Troubleshoot and resolve issues related to training infrastructure.
  • Document training infrastructure processes and best practices.
  • Conduct performance testing and optimization of training infrastructure.
  • Stay updated with the latest advancements in training infrastructure technologies.
  • Provide technical support and guidance to team members.
  • Participate in code reviews and contribute to the improvement of training infrastructure codebase.

What You'll Need

  • Proven experience in designing and implementing training infrastructure.
  • Strong knowledge of machine learning and deep learning frameworks.
  • Experience with cloud platforms such as AWS, GCP, or Azure.
  • Proficiency in programming languages such as Python, Java, or C++.
  • Experience with containerization technologies such as Docker and Kubernetes.
  • Strong problem-solving and troubleshooting skills.
  • Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation.
  • Knowledge of CI/CD pipelines and automated testing.
  • Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK Stack.
  • Strong communication and collaboration skills.

Nice to Have

  • Experience with machine learning model training and deployment.
  • Knowledge of machine learning model serving and inference.
  • Experience with machine learning model optimization and tuning.
  • Knowledge of machine learning model evaluation and validation.
  • Experience with machine learning model versioning and management.
  • Knowledge of machine learning model interpretability and explainability.
  • Experience with machine learning model deployment and scaling.
  • Knowledge of machine learning model security and privacy.
  • Experience with machine learning model monitoring and logging.
  • Knowledge of machine learning model debugging and troubleshooting.

Our Benefits

  • Competitive salary and equity
  • Health, dental, and vision insurance
  • 401(k) retirement plan
  • Unlimited vacation time
  • Flexible work hours
  • Remote work options
  • Professional development opportunities
  • Employee assistance programs
  • Tuition reimbursement
  • Employee referral bonuses
  • Performance bonuses
  • Stock options
  • Employee discounts
  • Wellness programs
  • Employee resource groups
  • Diversity and inclusion initiatives
  • Community involvement opportunities
  • Employee recognition programs

Our Culture

  • Collaborative and inclusive work environment
  • Focus on innovation and continuous learning
  • Emphasis on work-life balance
  • Opportunities for career growth and development
  • Commitment to diversity, equity, and inclusion
  • Support for employee well-being and mental health
  • Encouragement of open communication and feedback
  • Recognition and reward for employee achievements
  • Opportunities for employee involvement in decision-making
  • Support for employee-led initiatives and projects

Our Values

  • Integrity and honesty
  • Respect and inclusivity
  • Innovation and creativity
  • Collaboration and teamwork
  • Customer focus and satisfaction
  • Continuous improvement and learning
  • Accountability and responsibility
  • Transparency and communication
  • Diversity and inclusion
  • Sustainability and social responsibility

Our Mission

  • To develop and deliver cutting-edge machine learning solutions
  • To empower our customers with advanced AI technologies
  • To foster a culture of innovation and continuous learning
  • To promote diversity, equity, and inclusion in the workplace
  • To contribute to the advancement of AI and machine learning
  • To provide exceptional customer service and support
  • To create a positive and inclusive work environment
  • To support the well-being and development of our employees
  • To drive business growth and success through AI and machine learning
  • To make a positive impact on society through our AI technologies

Our Vision

  • To be a global leader in AI and machine learning
  • To transform industries through innovative AI solutions
  • To create a world where AI and machine learning are accessible to all
  • To foster a culture of innovation and continuous learning
  • To promote diversity, equity, and inclusion in the workplace
  • To contribute to the advancement of AI and machine learning
  • To provide exceptional customer service and support
  • To create a positive and inclusive work environment
  • To support the well-being and development of our employees
  • To drive business growth and success through AI and machine learning

How to Apply

  • Submit your resume and cover letter through our careers portal
  • Include relevant experience and skills in your application
  • Highlight your achievements and accomplishments
  • Provide examples of your work and projects
  • Include any relevant certifications or training
  • Follow up with the hiring manager after submitting your application
  • Prepare for interviews by researching the company and role
  • Dress professionally and arrive on time for interviews
  • Ask thoughtful questions during the interview process
  • Follow up with the hiring manager after the interview

Equal Opportunity Employer

  • We are an equal opportunity employer and welcome applicants from all backgrounds
  • We do not discriminate based on race, color, religion, sex, national origin, age, disability, or any other protected characteristic
  • We are committed to creating a diverse and inclusive workplace
  • We encourage applicants from underrepresented groups to apply
  • We provide reasonable accommodations for applicants with disabilities
  • We comply with all applicable laws and regulations related to equal employment opportunity
  • We promote a culture of respect and inclusivity in the workplace
  • We value diversity and believe it strengthens our organization
  • We are committed to fostering a positive and inclusive work environment
  • We encourage open communication and feedback from all employees

Visa sponsorship available for eligible candidates

Starting a business in Thailand?

Company registration done right

Foreign ownership rules, licenses, tax registration — Thai business setup has many moving parts. SVBL guides you through every step with full legal compliance.

Company registration & structure
Foreign ownership solutions
License & tax registration
BOI promotion eligibility
Start your business
100% foreign ownership possible
About company
Cohere
Cohere trains and deploys frontier AI models for developers and enterprises building systems for content generation, semantic search, RAG, and agents.
All jobs at Cohere Visit website
Job Details
Department Modeling
Category other
Posted 2 hours ago