About the Role
Oversee the development and operations strategy, driving automation, continuous integration, and cloud infrastructure improvements across engineering teams.
Responsibilities
- Manage a team of DevOps engineers focused on system stability and deployment pipelines
- Design and implement scalable cloud infrastructure solutions
- Drive automation initiatives across build, test, and deployment workflows
- Collaborate with software engineers to streamline development processes
- Monitor system performance and lead incident response improvements
- Establish best practices for configuration management and infrastructure as code
- Evaluate and integrate new tools to enhance operational efficiency
- Ensure compliance with security and regulatory standards
- Lead root cause analysis for critical production incidents
- Optimize CI/CD pipelines for speed and reliability
- Coordinate with product teams to align infrastructure with roadmap goals
- Mentor team members in technical and operational excellence
- Oversee disaster recovery planning and execution
- Implement observability and monitoring frameworks
- Manage capacity planning and resource allocation
- Promote a culture of blameless postmortems and continuous improvement
- Guide cloud cost optimization strategies
- Support migration from legacy systems to modern architectures
- Foster collaboration between development and operations teams
- Lead the adoption of containerization and orchestration technologies
- Ensure high availability and fault tolerance in production systems
- Evaluate third-party service providers and APIs
- Maintain documentation standards for infrastructure and processes
- Drive on-call readiness and operational rigor
- Partner with security teams to enforce access controls and auditing
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid work model with flexible scheduling
Team
Cross-functional engineering and operations teams
About the Team
This role operates within a dedicated DevOps unit that supports multiple product teams through shared tooling, infrastructure, and best practices. The team emphasizes automation, resilience, and developer enablement.
Technology Stack
Primary cloud platform is Google Cloud Platform, with Kubernetes for orchestration, Terraform for infrastructure provisioning, and GitLab CI for pipeline management. Monitoring is handled through Prometheus and Grafana.
Available for qualified candidates