Role Overview
We’re seeking a Senior Site Reliability Engineer to join our Production Engineering team, based remotely within the Republic of Ireland. In this role, you’ll help maintain and evolve a global platform that powers millions of user interactions every month. Your focus will be on building resilient, automated systems that support both current operations and future growth.
Key Responsibilities
- Partner with engineering teams to deploy and support new services and features at scale
- Implement monitoring and observability tools to proactively detect and resolve platform issues
- Scale Kubernetes and cloud infrastructure on AWS while meeting strict reliability targets
- Maintain the health and performance of core databases, including MySQL and Cassandra
- Diagnose and resolve production incidents using tools such as Splunk, Grafana, and Prometheus
- Automate infrastructure, deployment, and operational workflows using Python, Terraform, Puppet, and Jenkins
- Build custom solutions when existing tools fall short under high-scale demands
- Contribute improvements back to open-source projects used in production
- Design and implement new systems, testing strategies, and operational procedures
- Support a positive, inclusive team culture aligned with company values
- Participate in a shared on-call rotation with team peers
Required Qualifications
- Deep familiarity with Linux systems, including troubleshooting complex OS-level behaviors
- Strong programming skills in at least one modern language such as Python, Go, or Ruby
- Working knowledge of core internet protocols including TCP/IP, HTTP, and DNS
- Hands-on experience with public cloud platforms like AWS or GCP
- Proven experience with infrastructure-as-code tools such as Terraform, Puppet, or Ansible
- Experience operating containerized environments using Docker, Podman, and Kubernetes
- Self-driven mindset with a focus on continuous system improvement
- Ability to lead technical initiatives and collaborate across teams
- Ownership of systems across their full lifecycle—from design to decommissioning
Technology Environment
Our stack includes Linux (Ubuntu), Python, Puppet, Git, Jenkins, Terraform, Kubernetes, AWS, GCP, MySQL, Cassandra, Splunk, Grafana, Prometheus, Docker, and Podman. You’ll work across layers of infrastructure, networking, and application performance to ensure seamless user experiences.
Work Environment & Benefits
- Full project ownership from day one
- Flexible working hours and meeting-free Wednesdays to support deep work
- 25 days of paid vacation annually, increasing with tenure
- One floating holiday per year
- €150 monthly stipend for remote work setup
- €95 monthly reimbursement for dependent care
- Private health coverage with dental and vision
- Quarterly team offsites and regular hackathons
- Bi-weekly learning groups and support for career development
- €95 monthly wellness benefit
- Opportunities to attend digital events and technical conferences
- Competitive salary, pension plan, and optional stock purchase program
Company Culture
We value authenticity, collaboration, and creative problem-solving. Our environment supports personal and technical growth, with an emphasis on inclusivity and shared success. Engineers are encouraged to experiment, learn, and contribute meaningfully to the platform and team culture.
Equal Opportunity Statement
We are committed to a diverse and inclusive workplace. Qualified applicants will be considered without regard to race, color, religion, sex, national origin, ancestry, age, genetic information, sexual orientation, gender identity, marital or family status, veteran status, medical condition, disability, or any other legally protected status.


