Requirements
- Experience in an SRE role
- Strong knowledge of cloud technologies and SLA SLO SLI management
- Excellent communication and leadership skills
- Ability to analyze and improve operational processes and performance metrics
- Experience in software design, automation, and root cause analysis
- On-call support experience and customer-focused mindset
- Collaborative attitude with commercial and technical teams
- Launching and operating production Kubernetes clusters
- Designing and operating infrastructure on AWS and other providers
- Operating MongoDB (or other document database) clusters
- Operating Redis (or other key-value storage) clusters
- Administering Linux servers
- Operating Prometheus and Grafana
- Operating logging collection and analysis system
- Participating in the on-call rotation (4:00am - 16:00pm UTC)
Nice to Have
- Kubernetes (administrator)
- Go and/or Python (advanced)
- AWS/ EKS (advanced)
- Linux (advanced)
- Terraform and IaC in general (proficient)
- Helm (proficient)
- Monitoring – prometheus, grafana, thanos (familiar)
- Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.)
- Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP)
- Proactive, energetic, innovative and change oriented
- A desire to lead/mentor a team
Benefits
- Everyone has unlimited paid holidays.
- We have total flexibility in hours, as we believe creativity flows better when our people are given freedom to decide when they are most productive.
- Employee share scheme
- Generous maternity and paternity leave
- Volunteering Days
- Employee Wellbeing platform
Additional Information
- Participating in the on-call rotation (4:00am - 16:00pm UTC)


