Role Overview
As a Senior Staff Site Reliability Engineer, you will play a key role in shaping and maintaining the foundation of our cloud infrastructure. You'll work within the Compute, Observability, and Reliability (COR) team to ensure our platforms are scalable, resilient, and optimized for performance and cost. Your focus will be on Kubernetes environments across EKS and AKS, infrastructure automation, and enabling engineering teams with reliable, self-service capabilities.
Key Responsibilities
- Lead platform-level initiatives to enhance reliability, scalability, and operational efficiency
- Maintain and evolve Kubernetes clusters on Azure (AKS) and AWS (EKS)
- Design and implement infrastructure-as-code using Terraform or OpenTofu
- Integrate and manage tools such as Karpenter, KEDA, Istio, ScaleOps, and Ansible
- Diagnose and resolve complex system and cluster-level issues
- Support internal engineering teams through direct collaboration and on-call rotation
- Contribute to a culture of automation by identifying and eliminating recurring operational work
- Explore and evaluate emerging technologies to improve platform capabilities
- Participate in team on-call coverage with full peer support—no one handles incidents alone
What We’re Looking For
- Strong experience operating production Kubernetes clusters in cloud environments
- Proficiency with Microsoft Azure and cloud-native services
- Hands-on expertise with Linux-based operating systems (e.g., Ubuntu, Amazon Linux, Bottlerocket)
- Fluency in scripting languages such as bash or Python
- Experience with version control systems, particularly GitHub
- Proven ability to troubleshoot complex distributed systems
- Commitment to automation, observability, and continuous improvement
- Excellent communication skills and a collaborative mindset
- Alignment with core values: taking ownership, working as one team, and acting as a trusted advisor
- Willingness to grow professionally and support others’ development
Nice to Have
- Experience with Golang
Work Environment
This role is based in the Czech Republic with flexibility for remote work within the country. The team gathers monthly in Prague for collaboration and connection. While remote-friendly, regular in-person participation is encouraged for local team members.
Benefits
- Flexible time off: 5 weeks of vacation and 5 annual sick days
- 4% employer-contributed supplemental pension
- Private health insurance (Program Health Plus) for employee and spouse
- Life insurance equal to 2x annual salary
- 5,000 CZK monthly allowance for meals, transit, and personal expenses
- 16 weeks of supplemental paid maternity leave; 8 weeks fully paid paternity leave
- Participation in the company’s RSU program
- Employee referral bonuses
- Company and team events that celebrate both work and play
- Inclusion-focused employee resource groups including networks for women, LGBTQIA+, Black, Latinx, AAPI, military veterans, people with disabilities, and gender diversity
Our Values
We believe in shared success, mutual trust, and accountability. We expect every team member to take initiative, support their peers, and contribute to an inclusive, high-performing culture.
Equal Opportunity Employer
We welcome applications from all individuals, regardless of race, religion, gender identity, sexual orientation, national origin, veteran status, or disability. We actively seek diverse perspectives and encourage candidates from historically underrepresented communities to apply, even if they don’t meet every requirement listed.


