Crusoe is seeking a Staff Software Engineer to architect and scale the next generation of the Crusoe Cloud Control Plane. You will be responsible for high-availability systems that manage a global fleet of AI-optimized compute, network, and storage resources. Your work will define the systemic architecture of our IaaS platform, ensuring it remains fault-tolerant, performant, and integrated with underlying hardware as we scale.
What You'll Do
- Design and lead the implementation of scalable, reliable microservices that power the Crusoe Cloud control plane and manage complex virtualized resource lifecycles.
- Build the backend primitives that underpin our IaaS platform, ensuring high throughput and low-latency API responses for large-scale cluster provisioning.
- Collaborate with Product, Networking, Storage, and Hardware teams to evaluate emerging frameworks and tools, creating differentiated cloud solutions for AI/ML customers.
- Drive company-wide architectural decisions that improve the maintainability, observability, and disaster-recovery capabilities of our distributed systems.
- Mentor senior and mid-level engineers, lead rigorous design reviews, and evolve hiring practices to build a world-class engineering organization.
- Author and review comprehensive design docs for multi-region control plane services that must handle thousands of concurrent resource state transitions.
- Identify and eliminate bottlenecks in our resource orchestration layer.
- Partner with SRE and Cloud Support to translate operational feedback into architectural improvements that harden our production environment.
- Spend time pairing with team members to solve complex bugs or architectural hurdles.
What We're Looking For
- 8+ years of software development experience.
- Mastery of modern compiled languages—Go is highly preferred, but Rust or C++ are also valued.
- Proven track record of designing, deploying, and scaling fault-tolerant distributed systems and managed cloud services at high scale.
- Deep technical proficiency with the modern infrastructure stack, including Kubernetes, Docker, Terraform, Postgres, pub/sub messaging, and sophisticated CI/CD pipelines.
- A strong understanding of how cloud resources (Compute, Network, Storage) are abstracted and managed in an IaaS environment.
- Demonstrated experience in guiding engineering teams, improving onboarding processes, and driving the professional growth of others.
- Exceptional ability to articulate complex technical trade-offs to both engineering peers and non-technical stakeholders.
Technical Stack
- Languages: Go, Rust, C++
- Infrastructure: Kubernetes, Docker, Terraform, Postgres
- Systems: pub/sub messaging, CI/CD pipelines
Team & Environment
You will be part of the Compute team.
Benefits & Compensation
- Compensation range: $208,600 - $254,400 + equity: Restricted Stock Units are included in all offers.
- Industry competitive pay
- Restricted Stock Units
- Health insurance package options (HDHP and PPO, vision, and dental)
- Employer contributions to HSA accounts
- Paid Parental Leave & Life Insurance
- 401(k) with a 100% match up to 4% of salary
- Generous paid time off and holiday schedule
- $200/month Commuter FSA benefit
- Cell phone and tuition reimbursement
- Subscription to the Calm app and MetLife Legal
Work Mode
This is an onsite role located in San Francisco or Sunnyvale.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.






