Diagrid is seeking a Senior Site Reliability Engineer to join our team. You will drive engineering efforts to provide reliability, security, automation, and lifecycle management for our state-of-the-art Kubernetes-based cloud platform. This role is crucial for ensuring business continuity for our users and upholding SLAs and SLOs for Diagrid's core products.
What You'll Do
- Build and operate cutting-edge cloud infrastructure to support Diagrid's core products.
- Define standards, deliver tools, processes, and frameworks to make products secure, reliable, efficient, and highly available.
- Build and maintain CI/CD pipelines that enable delivering software quickly and securely across clouds.
- Continuously optimize services and cloud infrastructure for performance and availability.
- Design and document operational knowledge and best practices.
- Be part of the on-call rotation and lead via example.
What We're Looking For
- 8+ years of experience provisioning and managing cloud resources on Google Cloud, AWS or Azure.
- Experience building processes and using industry standard tools for managing applications on Kubernetes.
- Experience setting up and operating stateful software on Kubernetes.
- Comprehensive knowledge of Kubernetes best practices for cluster management, security, troubleshooting, and ongoing operations with failover, backup & restore.
- Ability to debug issues in Kubernetes clusters and complex distributed applications.
- Experience developing and supporting CI/CD production processes.
- Experience with Git-based version control systems.
- Experience with scripting and programming.
Nice to Have
- Multi-cloud experience.
- Preferably Terraform.
- Preferably Kafka, Redis, MySQL, and MongoDB.
- Preferably bash, Python, and Go.
- Bonus: experience operating Postgres or Kafka at scale in production.
- Bonus: experience with multi-tenant services.
Technical Stack
- Kubernetes, Terraform, Kafka, Redis, MySQL, MongoDB, Postgres, bash, Python, Go, Google Cloud, AWS, Azure
Benefits & Compensation
- Competitive compensation.
- Company equity.
- Remote first & flexible work environment.
- Flexible paid time off.
- Comprehensive healthcare for you and your dependents.
- Choice of hardware.
- $1000 for home office setup.
- Monthly WFH stipend.
- Team events & gatherings.
- Chance to collaborate with industry-leading figures.
Work Mode
This role is global and remote-first.
Diagrid, Inc. is an Equal Opportunity Employer. We do not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.



