Allianz is seeking a Lead-Application Operations / DevOps Engineer to join our team. In this role, you will be responsible for designing, implementing, and managing our cloud infrastructure and CI/CD pipelines to deliver scalable, high-availability solutions.
What You'll Do
- Design, implement, and manage cloud infrastructure on AWS and Amazon EKS, ensuring high availability and disaster recovery.
- Develop and maintain highly available production EKS clusters.
- Manage Docker containers and Kubernetes clusters for containerization and orchestration.
- Design and optimize CI/CD pipelines using Jenkins, ArgoCD, GitHub Actions, and other tools.
- Automate workflows with Infrastructure as Code (IaC) tools like Terraform and Ansible.
- Implement and manage monitoring solutions using Prometheus, Grafana, and Dynatrace.
- Troubleshoot issues and continuously optimize cloud infrastructure.
- Apply DevOps security best practices and enhance security protocols.
- Document technical target infrastructure and processes comprehensively.
- Collaborate with cross-functional teams to deliver scalable cloud solutions.
- Migrate tools and applications to new infrastructure and update infrastructure regularly for evolving project requirements.
- Maintain procedural tools, such as Artifactory and Jenkins, and enhance existing CI/CD pipeline libraries.
- Migrate monitoring solutions (e.g., Prometheus/Grafana to Dynatrace).
- Follow the OE calendar for this role.
What We're Looking For
- University Graduate degree (mandatory).
- Strong hands-on experience with AWS services and Kubernetes.
- Advanced scripting skills in Python, Bash, Groovy, and Ansible.
- Proficiency in IaC tools like Terraform and Ansible.
- Expertise in Docker and Kubernetes, including CI/CD processes and deployment.
- Experience with version control systems, particularly GitHub.
- Proficiency in monitoring tools (Prometheus, Grafana) and logging systems (Dynatrace).
- Understanding of networking concepts and protocols (DNS, HTTP/HTTPS, TCP/IP).
- Knowledge of PostgreSQL and database management.
- Experience in designing and implementing disaster recovery and fault-tolerant systems.
- Familiarity with AzTech Cloud Services and advanced cloud-based solutions.
- Strong documentation and communication skills for technical processes.
- Experience in evolving and maintaining production-grade EKS clusters.
Nice to Have
- Basic understanding or exposure to AI tools.
Technical Stack
- Cloud & Infrastructure: AWS, Amazon EKS
- Containers & Orchestration: Docker, Kubernetes
- CI/CD & Automation: Jenkins, ArgoCD, GitHub Actions, Terraform, Ansible
- Monitoring & Observability: Prometheus, Grafana, Dynatrace
- Languages & Scripting: Python, Bash, Groovy
- Other Tools: GitHub, PostgreSQL, AzTech Cloud Services
Benefits & Compensation
- Hybrid work model which recognizes the value of striking a balance between in-person collaboration and remote working.
- Up to 25 days per year working from abroad.
- Compensation and benefits package includes a company bonus scheme, pension, employee shares program, and multiple employee discounts (details vary by location).
- Lifelong learning for our employees worldwide and an environment where innovation, delivery, and empowerment are fostered.
- Flexible working, health and wellbeing offers (including healthcare and parental leave benefits) to support balancing family and career.
Work Mode
This role follows a hybrid work model, combining in-person collaboration with remote working.
Allianz is an equal opportunity employer.






