About the Role
The role involves building and managing robust infrastructure to power data-intensive workflows and machine learning applications, with a focus on automation, system resilience, and cloud operations.
Responsibilities
- Design and implement reliable cloud infrastructure for data processing and machine learning workloads
- Develop automated deployment pipelines to streamline system updates and configuration management
- Monitor system performance and proactively address scalability and availability challenges
- Collaborate with data scientists and software engineers to support research and production systems
- Ensure infrastructure complies with security best practices and operational standards
- Troubleshoot complex issues across distributed systems and cloud environments
- Optimize resource utilization and cost efficiency in cloud platforms
- Maintain documentation for infrastructure architecture and operational procedures
- Lead initiatives to improve system reliability and reduce operational toil
- Evaluate and integrate new technologies to enhance infrastructure capabilities
- Support incident response and contribute to on-call rotation as needed
- Implement and manage containerized environments using modern orchestration tools
- Enforce consistent infrastructure configurations using infrastructure-as-code practices
- Work closely with cross-functional teams to align infrastructure with project goals
- Drive adoption of monitoring, alerting, and observability tools across services
Compensation
Competitive salary and equity package commensurate with experience
Work Arrangement
Hybrid work model with flexibility for remote and office collaboration
Team
Collaborative engineering team focused on building scalable systems for scientific and data-driven discovery
Why This Role Matters
This position plays a critical role in enabling cutting-edge research by providing the foundational systems that power data analysis and machine learning at scale.
Our Approach to Engineering
We prioritize automation, reproducibility, and operational excellence to support rapid experimentation and deployment in a research-driven environment.
Visa sponsorship available for qualified candidates