Udemy is hiring a Cloud Engineer to join our Datastore Infrastructure (DSI) team. In this role, you will be responsible for the databases, streaming, and caching infrastructure that supports our global learning platform, ensuring high uptime, security, performance, and scalability. You will lead technical improvement projects, develop infrastructure patterns, and provide mentorship within our team, which is distributed across EU and US regions.
What You'll Do
- Lead improvement projects for datastores and platform teams to align with company long-term objectives.
- Maintain infrastructure uptime, monitor performance, and ensure infrastructure scales as the company grows.
- Develop immutable infrastructure patterns and automate infrastructure provisioning via code (Terraform, Python, Golang, Ansible).
- Ensure adherence to PCI, ISO27001, and SOC 2 security requirements, modifying CI/CD processes when necessary.
- Advocate for and implement positive changes in tools and processes through healthy discussions.
- Participate in an on-call rotation with a systematic approach to incident management.
- Participate in day-to-day activities, support requests, and project-related tasks for the team.
- Contribute to documentation, maintain ticketing queues, provide project support, and offer after-hours assistance.
- Provide coaching and mentorship to new hires, fostering their technical growth and integration.
What We're Looking For
- 3-5 years of professional experience in a Cloud Engineering, SRE, or DBRE team with infrastructure responsibilities managing large production workloads.
- Proficiency with managing MySQL at scale (Horizontal Scaling, sharding, InnoDB optimizations, Query Optimization, HA/DR, Monitoring, Backups Strategy, Security, Automations).
- Strong understanding in supporting datastores running behind Kubernetes Workloads in Production.
- Proficiency with tools like Terraform, Ansible, Git and working with Infrastructure as Code and automated provisioning.
- Strong experience in Kafka/MSK cluster management, topic configuration, performance tuning, and ensuring high availability and fault tolerance.
- Strong Experience with Distributed Caching (Redis, Valkey, Memcache) or similar products.
- Experience in Python or Golang.
- Knowledge of configuration management tools, monitoring systems (Datadog or similar) for database infrastructure, and scaling strategies.
- Strong troubleshooting skills to diagnose complex database issues.
- Hands-on experience with AWS cloud infrastructure and a grasp of security best practices.
- Adaptability and comfort working in a fast-paced, hands-on environment.
Technical Stack
- Databases: MySQL, PGSQL, Aurora, DynamoDB
- Streaming & Caching: Kafka, Redis, Valkey, Memcache
- Infrastructure: Kubernetes, AWS, Terraform, Ansible, Git
- Languages: Python, Golang
- Monitoring: Datadog
Team & Environment
You will be part of Udemy's Platform team, specifically the Datastore Infrastructure (DSI) team, which is split between EU and US regions.
Benefits & Compensation
- Full access to Udemy courses
- Monthly UDay to invest in yourself
- Budget for personal improvement resources
Work Mode
This role offers a hybrid work mode and is open to candidates in San Francisco, Denver, Austin, Australia, India, Ireland, Mexico, and Türkiye.
At Udemy, we value diversity and inclusion and consider qualified applicants without regard to race, color, religion, sex, national origin, ancestry, age, genetic information, sexual orientation, gender identity, marital or family status, veteran status, medical condition, or disability.


