Axiom is looking for a Site Reliability Engineer to help uphold our promise of superior reliability and performance. You will be pivotal in collaborating with backend engineers and product teams to create, operate, and continuously improve scalable and reliable systems.
What You'll Do
- Engineer and maintain a robust, secure, and scalable infrastructure for Axiom Cloud.
- Collaborate with engineering teams to define and refine service level objectives.
- Contribute to disaster recovery planning, capacity engineering, performance analysis, and system tuning.
- Foster best practices for code deployments and aid in the education of the broader development team.
- Roll out tooling and solutions that improve system reliability and reduce manual toil.
- Address and remediate service incidents and contribute to postmortems and root cause analyses.
- Foster a culture of monitoring, alerting, and observability across the organization.
What We're Looking For
- Over two years of experience in a reliability-focused engineering environment.
- Passion for system reliability, latency, performance, and efficiency.
- Familiarity with AWS tools and technologies.
- Hands-on experience with Docker, Kubernetes, and Amazon EKS.
- Understanding of infrastructure-as-code tools such as Terraform or Pulumi.
- Strong networking knowledge and adeptness with Linux systems.
- Familiarity with CI platforms like GitHub Actions, GitLab, or CircleCI.
- Ability to efficiently use LLMs.
- Experience with monitoring, alerting, and observability tools.
Nice to Have
- Proven track record of maintaining production systems at scale.
- A software engineering background with expertise in Golang.
Technical Stack
- AWS, Docker, Kubernetes, Amazon EKS, Terraform, Pulumi, Linux
- GitHub Actions, GitLab, CircleCI, LLMs, Golang
Team & Environment
You will be collaborating closely with backend engineers and product teams within our remote-first, globally distributed organization.
Benefits & Compensation
- Flexibility to work from wherever suits you best.
- Budget to build your home office set-up.
- Monthly budget to support mental and physical wellness.
- A focus day each week with no meetings, Slack, or Zoom.
- Uncapped vacation to unplug and rejuvenate.
- Generous and flexible family leave for everyone.
Work Mode
This is a global, remote-first role. We are considering individuals based in UTC-5 (EST) to UTC+2, with a preference for those in UTC-3.
Axiom is an equal opportunity employer.


