What You'll Do
Design and maintain the core infrastructure that underpins a high-availability platform used by data teams worldwide. You'll shape the architecture behind scalable service routing, cloud networking, and account lifecycle management across multiple environments.
Build automation to safely migrate customer workloads at scale from legacy systems to a modern, multi-cell platform. Develop backend services in Go and Python, with opportunities to contribute in Rust, ensuring systems are resilient, observable, and performant.
Work deeply with Kubernetes, Argo Workflows, and Terraform to streamline deployment workflows and improve system reliability. Tackle advanced networking challenges involving VPCs, DNS, load balancing, PrivateLink, and service mesh configurations across both single- and multi-tenant deployments.
Collaborate across engineering, security, and support teams to guide technical direction, resolve cross-cutting issues, and mentor peers. Own systems end to end, diagnosing complex failures across layers and participating in an on-call rotation to ensure platform stability.
Requirements
- Proven experience building and operating large-scale distributed systems in a remote, asynchronous engineering environment
- Strong proficiency in backend development using Go or Python, with a track record of delivering production-grade infrastructure
- Extensive hands-on experience with cloud platforms (AWS, GCP, or Azure), container orchestration (Kubernetes), and Infrastructure as Code (Terraform)
- Deep understanding of cloud networking concepts such as VPCs, DNS, load balancing, proxies, and service mesh technologies
- Experience designing internal platforms or automation tools that improve developer velocity and system reliability
- Ability to lead technical problem-solving with a systematic, customer-focused mindset
- Clear communicator who can explain technical trade-offs to diverse audiences and collaborate across team boundaries
Benefits
You’ll work in a fully remote, globally distributed environment that values autonomy, deep technical work, and continuous learning. The role offers opportunities to grow expertise in multi-cloud networking, platform automation, and large-scale system design. You'll contribute to foundational infrastructure that directly impacts the reliability and scalability of a data platform used by engineers around the world.


