Udemy is hiring a Site Reliability Engineer to ensure the reliability and scalability of our global learning platform. You will manage infrastructure across the stack, develop essential tooling, and work with teams to uphold SRE principles as we transform lives through learning.
What You'll Do
- Lead projects developing and improving our infrastructure and tooling, working with our team and teams across the engineering department.
- Act as a mentor to other engineers on the SRE team.
- Champion SRE best practices.
- Participate in an on-call rota.
What We're Looking For
- Experience managing Kubernetes clusters and cloud environments.
- Experience using infrastructure as code tools to deploy infrastructure.
- Experience writing tools and applications using programming languages such as Python, Golang, and Kotlin.
- Experience being on call.
- Experience working with a wide variety of engineering teams to guide them on best practices.
- Good communication skills and an ability to both share and receive feedback in a responsible manner.
- Extensive knowledge of cloud technologies. AWS is a particular advantage.
- Experience managing containerized workloads using Kubernetes in a production environment.
- Experience with infrastructure as code tools such as Terraform and Helm.
Technical Stack
- Kubernetes, AWS, Python, Golang, Kotlin, Terraform, Helm, CI/CD
Benefits & Compensation
- Full access to Udemy courses.
- Monthly UDay to invest in yourself.
- Budget to spend on self-improvement.
- AI tools and space to experiment.
Work Mode
This is a hybrid position open to candidates in San Francisco, Denver, Austin, Australia, India, Ireland, Mexico, and Türkiye.
At Udemy, we value diversity and inclusion and consider qualified applicants without regard to race, color, religion, sex, national origin, ancestry, age, genetic information, sexual orientation, gender identity, marital or family status, veteran status, medical condition, or disability.




