As a Senior Systems Software Engineer, you will play a key role in building and evolving cloud-native infrastructure that powers large-scale AI computing. Your work will focus on designing and implementing robust software systems that run on Kubernetes, enabling automated, scalable, and highly available services across public cloud environments.
What You'll Do
- Design, code, and maintain backend systems that support massive cloud infrastructure deployments
- Build and distribute APIs that enable Infrastructure as Code workflows and streamline deployment pipelines
- Collaborate with architects, site reliability engineers, and product teams to define and deliver scalable solutions
- Take ownership of software projects from concept through deployment and ongoing production support
- Write comprehensive unit and integration tests to ensure system reliability and correctness
- Participate in design discussions, contributing technical insight to shape product direction
- Share lessons learned openly in a culture that values transparency and continuous improvement
What We're Looking For
You should have a strong foundation in systems programming and cloud-native technologies, with a proven ability to deliver resilient distributed services at scale. A background in modern software development practices and container orchestration is essential.
- Bachelor’s degree in Computer Science, Information Systems, Computer Engineering, or equivalent practical experience
- Minimum of 8 years of professional software engineering experience
- 3–5 years focused on large-scale distributed systems using modern development frameworks
- Deep experience with Golang, particularly in developing Kubernetes operators, controllers, and related tooling
- Hands-on track record deploying and managing services on Kubernetes, including CRDs and auto-scaling components
- Familiarity with managed Kubernetes offerings on AWS, GCP, Azure, and Oracle Cloud Infrastructure
- Experience supporting production systems, including incident response, root cause analysis, and reliability improvements
- Strong communication skills with the ability to explain technical decisions across infrastructure and application layers
Preferred Background
- Direct experience with Cluster API, Terraform, and cloud provider APIs
- Proficiency in Kustomize or similar Kubernetes configuration management tools
- Working knowledge of CNI, CSI, and CRI interfaces within the CNCF ecosystem
- Active contributions to open-source projects or community-driven development efforts
- Understanding of modern deployment patterns in hybrid and cloud-native environments
Technology Environment
Our stack is built around Golang, Kubernetes, Custom Resource Definitions, and auto-scaling infrastructure, with automation powered by Terraform and Kustomize. We operate across multiple cloud providers and are deeply integrated with the CNCF ecosystem.
Why This Matters
You’ll help shape the foundation of AI-powered computing at scale, working on systems that accelerate innovation in high-performance computing and artificial intelligence. This role offers the chance to work in a technically rigorous, inclusive environment that values curiosity, collaboration, and engineering excellence.


