Role Overview
This is a senior-level position for an experienced DevOps Engineer who will take full ownership of cloud infrastructure and database operations in a distributed, remote setting. You'll be responsible for designing and maintaining a resilient, scalable cloud-native platform, with deep involvement in MongoDB Atlas, infrastructure as code, and system reliability.
Key Responsibilities
- Design and manage cloud infrastructure across GCP, AWS, or Azure using Terraform, ensuring consistency and scalability.
- Lead the production deployment, scaling, and ongoing management of MongoDB Atlas clusters.
- Ensure high availability through robust replication, failover, and disaster recovery planning.
- Optimize database performance through indexing, query analysis, and configuration tuning.
- Enforce security policies, access controls, and compliance standards across database and cloud environments.
- Architect and operate containerized and serverless workloads using technologies like Kubernetes, Cloud Run, or ECS.
- Build and maintain automated CI/CD pipelines to support rapid, reliable deployments.
- Develop reusable Terraform modules to standardize infrastructure across multiple environments.
- Implement event-driven systems using messaging platforms such as Pub/Sub, SQS/SNS, or EventBridge.
- Establish comprehensive monitoring, alerting, and observability practices across distributed services.
- Lead incident response efforts and conduct root cause analyses to prevent future issues.
- Document system designs, create operational runbooks, and define best practices for engineering teams.
- Collaborate with developers to guide architectural decisions and improve system performance.
- Mentor team members and advocate for DevOps principles across the organization.
Required Qualifications
- Minimum of six years of hands-on experience in DevOps or infrastructure engineering in production environments.
- Proven expertise with at least one major cloud provider (GCP, AWS, or Azure) using Terraform for infrastructure provisioning.
- Advanced proficiency in Terraform, including modular design, remote state management, and multi-environment deployments.
- Extensive experience operating MongoDB Atlas in production, including cluster management, replication, backup, recovery, and performance optimization.
- Strong background in container technologies such as Docker and orchestration platforms like Kubernetes.
- Experience building and maintaining CI/CD pipelines for automated software delivery.
- Familiarity with event-driven architectures and related services.
- Deep understanding of monitoring, logging, and observability in distributed systems.
- Ability to lead technical direction and make high-impact infrastructure decisions independently.
- Excellent communication skills with experience working in remote, asynchronous teams.
Preferred Qualifications
- Experience operating across multiple cloud providers.
- Knowledge of GitOps methodologies and tooling.
- Familiarity with advanced observability tools such as Datadog, APM, or distributed tracing.
- Background supporting large-scale SaaS applications.
- Interest in platform engineering and improving developer experience.
Technical Environment
Technologies in use include GCP, AWS, Azure, Terraform, MongoDB Atlas, Docker, Kubernetes, Cloud Run, ECS, Pub/Sub, SQS, SNS, EventBridge, CI/CD systems, and Datadog for monitoring.
Work Environment
This is a fully remote role in an async-first culture. Work hours are aligned to IST: 12:00 PM – 9:30 PM during summer and 1:00 PM – 10:30 PM during winter months. There is no weekend work, supporting genuine work-life balance.
Benefits
- Immediate access to full medical insurance and company-provided laptop.
- Dedicated mentorship and professional development support.
- Inclusive culture that values integrity, excellence, and long-term growth.
- Work environment that prioritizes well-being, belonging, and meaningful contributions.


