Remote (Global) Full-time

Cyberhaven is hiring a Senior Director, SRE & Cloud Infrastructure

About the Role

Cyberhaven is seeking a Senior Director of SRE & Cloud Infrastructure to lead the teams responsible for the reliability, scalability, and cost-efficiency of our data security platform. You will own the infrastructure and operational foundations that power our engineering organization and customer-facing products, operating at massive scale. This is a chance to revolutionize data security with an AI-driven approach to reliability.

What You'll Do

  • Lead, grow, and mentor high-performing globally distributed SRE and Infrastructure teams, including managers and senior ICs.
  • Own the reliability, availability, scalability, and performance of our production and developer platforms.
  • Define and execute the SRE and infrastructure strategy, including cloud architecture, Kubernetes platforms, CI/CD, and automation.
  • Drive horizontal scaling and enable teams to operate independently, through decoupling and modularization of both architecture and processes.
  • Drive infrastructure cost (COGS) optimization, capacity planning, and cloud financial management in close partnership with Finance and Engineering leadership.
  • Establish and evolve SLOs, SLIs, error budgets, and operational best practices across the organization.
  • Oversee incident management, postmortems, and continuous improvement, ensuring a strong culture of learning and ownership.
  • Collaborate closely with security to ensure our infrastructure is secure, compliant, and resilient by design.
  • Contribute to and uphold strong documentation, operational standards, and knowledge sharing across teams.

What We're Looking For

  • Led SRE and Infrastructure organizations at high-growth SaaS, platform, or security companies.
  • Strong technical leader with deep experience in cloud-native systems and a strong SRE mindset.
  • Strong background in Kubernetes, cloud platforms (GCP and/or AWS), and infrastructure as code (Terraform or equivalent).
  • Designed or operated large-scale distributed systems, real-time data pipelines, or high-throughput platforms.
  • Experience owning COGS, cloud spend, and efficiency metrics, and can clearly communicate tradeoffs to executives.
  • Comfortable operating at multiple levels: strategic planning, architectural reviews, and deep technical problem solving.
  • Use data and metrics to drive reliability, performance, cost optimization, and team productivity.
  • Proven track record of scaling teams and systems while maintaining high reliability and velocity.
  • Empathetic leader who fosters inclusion, ownership, accountability, and psychological safety.
  • Thrive in fast-moving environments and are comfortable navigating ambiguity and change.

Technical Stack

  • Kubernetes
  • GCP
  • AWS
  • Terraform

Team & Environment

You will lead globally distributed SRE and Infrastructure teams, including managers and senior ICs. You will report directly to the SVP of Engineering.

Work Mode

This role supports a global work mode.

Cyberhaven is committed to creating a diverse environment and is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Required Skills
KubernetesGCPAWSTerraformSRECloud InfrastructureSite Reliability EngineeringInfrastructure as CodeDistributed SystemsAutomationMonitoringIncident ManagementCapacity PlanningPerformance Optimization
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
Cyberhaven

Reimagines data protection with AI-enabled data lineage that analyzes billions of workflows to understand data, detect risk, and stop threats. Backed by $250M from leading investors.

Visit website
Job Details
Category infrastructure
Posted a month ago