Diagrid is hiring a Senior Site Reliability Engineer to drive engineering efforts for a state-of-the-art, Kubernetes-based cloud platform. In this crucial role, you will ensure business continuity and uphold SLAs/SLOs for core products in a multi-cloud environment by providing reliability, security, automation, and lifecycle management.
What You'll Do
- Build and operate cutting-edge cloud infrastructure to support Diagrid's core products.
- Define standards and deliver tools, processes, and frameworks to make products secure, reliable, efficient, and highly available.
- Build and maintain CI/CD pipelines that enable delivering software quickly and securely across clouds.
- Continuously optimize services and cloud infrastructure for performance and availability.
- Design and document operational knowledge and best practices.
- Be part of the on-call rotation and lead via example.
What We're Looking For
- 8+ years of experience provisioning and managing cloud resources on Google Cloud, AWS, or Azure.
- Experience building processes and using industry standard tools for managing applications on Kubernetes.
- Experience setting up and operating stateful software on Kubernetes.
- Comprehensive knowledge of Kubernetes best practices for cluster management, security, troubleshooting, and ongoing operations.
- Ability to debug issues in Kubernetes clusters and complex distributed applications.
- Experience developing and supporting CI/CD production processes.
- Experience with Git-based version control systems.
- Experience with scripting and programming.
Nice to Have
- Multi-cloud experience.
- Experience with Terraform.
- Experience with Kafka, Redis, MySQL, and MongoDB.
- Scripting/programming experience with bash, Python, and Go.
- Bonus: experience operating Postgres or Kafka at scale in production.
- Bonus: experience with multi-tenant services.
Technical Stack
- Kubernetes, Terraform, Kafka, Redis, MySQL, MongoDB, Google Cloud, AWS, Azure, Git, bash, Python, Go, Postgres
Team & Environment
You will be part of a team driving engineering efforts for the platform.
Benefits & Compensation
- Competitive compensation.
- Company equity.
- Remote first & flexible work environment.
- Flexible paid time off.
- Comprehensive healthcare for you and your dependents.
- Choice of hardware.
- $1000 for home office setup.
- Monthly WFH stipend.
- Team events & gatherings.
Work Mode
This is a remote position.
Diagrid, Inc. is an Equal Opportunity Employer. We do not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.


