As a Cloud Infrastructure Engineer, you'll be responsible for building and maintaining the foundation of a cloud-native platform running on Google Cloud Platform. Your work will directly influence system reliability, scalability, and automated operations across a distributed environment.
Key Responsibilities
- Architect and manage core infrastructure components including networking, identity, and security on GCP.
- Operate and optimize Kubernetes-based systems using GKE, Knative, and Istio to support a multi-tenant platform.
- Implement infrastructure-as-code with Terraform and Terragrunt to ensure consistent, auditable deployments.
- Develop and maintain CI/CD pipelines using GitHub Actions and TeamCity, integrated with GitOps tooling like Flux and Helm.
- Oversee the data layer built on Postgres and Neon, ensuring performance, tenant isolation, and safe schema evolution.
- Lead monitoring, alerting, and logging initiatives using Prometheus, Grafana, and Loki to maintain system health.
- Address complex operational challenges such as autoscaling efficiency, cold start reduction, and queue management.
- Drive improvements in system performance through tuning of container resources, caching, distributed locks, and garbage collection settings.
- Enforce security best practices across infrastructure, including secrets management, network segmentation, and vulnerability controls.
- Own strategic initiatives like multi-region deployments, disaster recovery, and ephemeral environments.
- Mentor engineers in cloud-native practices and promote SRE principles across teams.
- Collaborate with product and engineering to define long-term platform vision and operational strategy.
Qualifications
You bring at least five years of experience designing and operating large-scale cloud infrastructure with a focus on DevOps and site reliability engineering. You have deep technical knowledge of Kubernetes, GCP, and infrastructure automation tools.
- Proven expertise in GCP (or equivalent) and hands-on experience with Kubernetes administration.
- Strong proficiency with Terraform for managing complex, secure environments.
- Familiarity with Linux internals, networking (CNI, service mesh), and distributed systems.
- Experience with CI/CD systems, GitOps workflows, monitoring stacks, and logging solutions.
- Ability to communicate technical concepts clearly across teams.
- Experience with GKE is highly valued.
Technology Environment
Google Cloud Platform, Kubernetes, GKE, Knative, Istio, Terraform, Terragrunt, GitHub Actions, TeamCity, Flux, Helm, Postgres, Neon, Prometheus, Grafana, Loki, Linux, CNI, Service Mesh, GitOps.
Work Environment
This role supports a global team with flexible work arrangements. The company fosters an open, inclusive culture that welcomes individuals of all backgrounds, identities, and experiences.
As an equal opportunity employer, we believe innovation thrives when everyone can contribute fully, regardless of age, religion, disability, gender identity, or orientation.


