LeoTech is looking for a Sight Reliability Engineer to join our team. This is a full-time position based in Austin, TX, with the flexibility of remote work options.
What You'll Do
- Design and implement systems for monitoring, observability, and incident response.
- Build and maintain tools to automate reliability practices and improve system performance.
- Collaborate with engineering teams to define service level objectives and error budgets.
- Lead post-mortem investigations and drive the implementation of preventative measures.
- Participate in on-call rotations to ensure high availability of critical services.
What We're Looking For
- 3+ years of experience in a Site Reliability Engineer or similar role.
- Proficiency with cloud infrastructure (AWS, GCP, or Azure) and infrastructure-as-code tools.
- Strong scripting skills in languages like Python, Go, or Bash.
- Experience with monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog).
- Solid understanding of containerization and orchestration technologies (Docker, Kubernetes).
- Excellent problem-solving skills and a systematic approach to incident management.
Nice to Have
- Experience in a fast-paced startup or product-driven environment.
- Knowledge of database performance tuning and optimization.
- Familiarity with CI/CD pipelines and GitOps methodologies.
- Contributions to open-source projects or a public technical blog.
Work Mode
This is a full-time position based in Austin, TX, with remote work options available.
LeoTech is an equal opportunity employer.

