United Kingdom or Spain or Portugal or Canada Remote (Global)

Diagrid is hiring a Site Reliability Engineer

Design and maintain a highly available, secure, and automated cloud platform built on Kubernetes, supporting managed state stores and message brokers. Ensure robust lifecycle management and operational excellence across multi-cloud environments through automation, observability, and resilient design.

Responsibilities

  • Develop and manage advanced cloud infrastructure to power core platform products
  • Establish security, reliability, and efficiency standards by delivering tools, frameworks, and operational processes
  • Maintain and enhance CI/CD pipelines for secure, rapid software delivery across multiple cloud providers
  • Lead on-call support and drive incident resolution with a focus on continuous improvement

Requirements

  • Minimum of 8 years managing cloud infrastructure on AWS, GCP, or Azure, with experience across multiple providers preferred
  • Proven experience using Kubernetes and tools like Terraform to manage application deployments at scale
  • Hands-on experience running stateful systems such as Kafka, Redis, MySQL, and MongoDB on Kubernetes
  • Deep understanding of Kubernetes operations including cluster management, security, failover, backup, and troubleshooting
  • Strong scripting and programming skills in languages such as bash, Python, or Go

Nice to Have

  • Production experience managing Kafka or Postgres at scale
  • Experience operating multi-tenant platform services

Tech Stack

Kubernetes, Terraform, Kafka, Redis, MySQL, MongoDB, Postgres, GCP, AWS, Azure, Dapr, KEDA, CI/CD, Bash, Python, Go

Benefits

  • Competitive compensation package
  • Equity in the company
  • Fully remote and flexible work environment
  • Generous flexible paid time off
  • Comprehensive healthcare coverage for employees and dependents
  • $1,000 stipend for home office setup

Compensation

Competitive compensation. Equity: Company equity. Monthly WFH stipend, $1000 for home office setup

Work Arrangement

global — Remote first & flexible work environment

Team

Engineering team focused on cloud platform and reliability, led by the creators of Dapr and KEDA

  • Commitment to open-source software, open standards, and API-driven innovation
  • Focus on improving developer productivity through infrastructure abstraction
  • Active collaboration with recognized industry experts
  • Dedicated to diversity, inclusion, and equitable practices
  • Equal Opportunity Employer

Additional Information

  • Founded by the creators of the Dapr and KEDA open-source projects
  • Dapr is a graduated project in the Cloud Native Computing Foundation (CNCF), alongside Kubernetes, Prometheus, and Istio
  • Backed by leading venture capital firms and advised by prominent technology leaders
  • All hiring decisions are based on qualifications, merit, and business requirements
Required Skills
KubernetesTerraformKafkaRedisGoPythonbashMySQLMongoDBPostgresGoogle CloudAWSAzure KubernetesTerraformKafkaRedisMySQLMongoDBPostgresGCPAWSAzureDaprKEDACI/CDBashPython
About company
Diagrid
Diagrid provides developers with APIs and tools that help them focus on their code and not on infrastructure. The company is founded by the creators of the Dapr and KEDA open-source projects.
All jobs at Diagrid Visit website
Job Details
Department Information Technology
Category infrastructure
Posted 2 months ago