Responsibilities
- Manage and maintain cloud-based infrastructure platforms
- Design, implement, and optimize automated CI/CD workflows
- Operate containerized environments using Kubernetes and related tools
- Oversee system monitoring, logging, and response to operational incidents
- Enforce security standards and ensure compliance across infrastructure
- Work with teams to refine engineering processes and operational practices
Requirements
- Minimum of three years in DevOps, Site Reliability Engineering, or infrastructure roles
- Demonstrated skill with at least one major cloud platform (AWS, GCP, or Azure)
- Proven track record building and maintaining CI/CD systems
- Direct experience deploying and managing applications with Docker and Kubernetes
- Expertise in defining infrastructure through code using Terraform, Pulumi, or CloudFormation
- Hands-on use of observability tools including Prometheus, Grafana, Datadog, or New Relic
- Proficiency in scripting with Python, Bash, or comparable languages
- Solid grasp of cloud security principles and system reliability engineering
Nice to Have
- Background working with microservices and distributed system architectures
- Knowledge of serverless platforms such as AWS Lambda or Cloud Functions
- Experience supporting production Kubernetes clusters
- Holding relevant certifications like AWS Certified DevOps Engineer, CKA, or equivalent
- Prior experience in SaaS, fintech, healthcare, or enterprise software environments