Cognizant is looking for an AWS Site Reliability Engineer within its Cloud Infrastructure & Security services practice. In this role, you will be responsible for designing, deploying, and managing AWS environments with a focus on automation, scalability, and security.
What You'll Do
- Design, code, test, and deliver software to automate manual operational work.
- Troubleshoot priority incidents, facilitate blameless post-mortems, and ensure permanent closure.
- Collaborate with development teams to help build software for reliability and scale.
- Identify application patterns and analytics to support better service level objectives.
- Design self-healing and resiliency patterns.
- Design automated software and product upgrades, change management, and release management solutions.
- Collaborate with senior technical leads and mentor junior engineers.
- Design, deploy, and manage AWS environments with a focus on automation, scalability, and security.
- Build and maintain Infrastructure as Code using tools such as Terraform.
- Monitor and optimize system performance, availability, and security, applying observability best practices.
What We're Looking For
- Strong hands-on exposure in AWS, Terraform, Python/Bash, CI/CD.
- Experience with Infrastructure as Code and CI/CD tools like Bitbucket, Jenkins, or Spinnaker.
- Strong knowledge of containerization and orchestration, including Docker and Kubernetes.
- Strong scripting skills in Python or Bash for automation.
- Proven experience deploying, managing, and a deep understanding of AWS cloud infrastructure in secure environments.
- Working knowledge of infrastructure components like routers, load balancers, cloud products, container systems, compute, storage, networks, VPC, subnets, and security groups.
- Excellent troubleshooting, problem-solving, and debugging skills.
- A Bachelor’s degree or equivalent experience in a software engineering discipline.
Nice to Have
- Basic knowledge of AI technologies and prompt engineering to leverage generative AI for enhancing productivity and automating tasks.
Technical Stack
- AWS, Terraform, Python, Bash
- CI/CD, Bitbucket, Jenkins, Spinnaker
- Docker, Kubernetes





