About the Role
Role details below.
Responsibilities
- Develop, enhance, and maintain CI/CD self-hosted infrastructure using modern DevOps tooling (GitHub Actions Runners stack, ArgoCD, FluxCD etc.)
- Automate infrastructure provisioning, configuration management, monitoring, and operational workflows using IaC and scripting languages
- Own the deployment, maintenance, and lifecycle management of systems supporting engineering (Kubernetes clusters, container registries, artifact systems, and internal developer platforms) leveraging deep expertise in Kubernetes, container runtimes, and the broader cloud-native ecosystem (Helm, Kustomize, etc.)
- Troubleshoot complex infrastructure and application issues, driving root-cause analysis and developing long-term remediation solutions
- Design, build, and maintain cloud infrastructure across major cloud providers (AWS, GCP, Azure), and develop/support deployments of applications, services, and monitoring with a strong focus on scalability, reliability, and cost optimization
- Develop internal tooling and automation using Terraform, Python, Go, or similar languages to streamline operational tasks and improve developer productivity
- Implement and manage security best practices across cloud environments, including identity management, secrets handling, audit logging, and network controls
- Leverage AI/ML tools to automate repetitive DevOps tasks, operational workflows
Requirements
- 4-8+ years of experience in DevOps, Cloud Engineering, Systems Administration, or similar infrastructure-focused roles
- Familiar with GIT and comfortable with at least one systems programming language (C/C++, golang) and one scripting language (python, bash)
- Proficiency with Infrastructure as Code (Terraform, Pulumi, or CloudFormation) and configuration management (Ansible, Chef, or SaltStack)
- Working knowledge of at least one of observability stacks (Prometheus, Grafana, ELK/OpenSearch, Datadog, etc.) and operational troubleshooting
- Strong hands on experience with Kubernetes management in production environments
- Experience designing, developing, and/or troubleshooting distributed systems
- Comfort with shells on *nix family systems
- B.S. degree or equivalent experience in Engineering, Computer Science or a related field
Nice to Have
- Hands-on experience building CI/CD pipelines from a user perspective (GitHub Workflows, GitLab pipelines, etc)
- Certifications from cloud service providers (AWS DevOps, GCP Devops Eng, etc)
- Natural curiosity or drive to learn about new or adjacent technologies
- Experience applying AI/ML tools (GitHub Copilot, cloud AI services, LLM-based automations, anomaly detection