Umbra is seeking a Cloud Infrastructure Engineering Manager to lead cross-functional teams in building and maintaining the reliable, scalable systems that power our core business operations. You will be instrumental in guiding our infrastructure strategy and team development.
What You'll Do
- Lead 3 specialized teams (SRE, CI/CD, and Platform Engineering) with approximately 5 direct reports.
- Partner closely with the Director of Site Reliability Engineering to develop strategic initiatives and provide regular reporting.
- Partner with Software Engineering teams to understand application infrastructure needs.
- Partner with Security teams to implement robust security practices across all cloud infrastructure and deployment processes.
- Set and track clear objectives for individual and team development.
- Facilitate performance reviews and conduct regular one-on-one meetings.
- Collaborate with HR and senior management to define career paths and growth opportunities.
- Oversee the hiring process for the cloud infrastructure teams.
- Facilitate efficient team execution and project delivery by coordinating tasks and removing obstacles.
- Identify process bottlenecks and inefficiencies, and work towards continuous improvement.
What We're Looking For
- A minimum of 6 years of relevant technical experience.
- A minimum of 2 years of experience in personnel management.
- Strong understanding of site reliability and platform engineering processes for mission-critical, high-availability systems.
- Exceptional communication and interpersonal skills.
- Strong problem-solving skills and a proactive attitude.
- Strong understanding of agile methodologies and project management.
- The ability to coach and guide technical teams effectively.
Nice to Have
- Experience managing cross-functional technical teams, ideally encompassing SRE, CI/CD, and Platform Engineering disciplines.
- Experience with budget and resource management, including personnel allocation and workload balancing.
- Strong experience with AWS cloud services, with a focus on scalability and reliability.
- Familiarity with monitoring and observability platforms (Prometheus, DataDog, Grafana, etc.) and implementing SLI/SLO frameworks.
- Background in Kubernetes orchestration and microservice architecture.
- Comfortable with Infrastructure as Code tools (Terraform/OpenTofu, CloudFormation, etc.).
- Experience with platform engineering and self-service tooling development.
- Familiarity with CI/CD pipeline design and operation.
Technical Stack
- AWS
- Prometheus
- DataDog
- Grafana
- Kubernetes
- Terraform
- OpenTofu
- CloudFormation
Team & Environment
You will lead 3 specialized teams (SRE, CI/CD, and Platform Engineering) with approximately 5 direct reports. You will partner closely with the Director of Site Reliability Engineering.
Benefits & Compensation
- Compensation: $160,000 - $190,000 DOE + equity: Stock Options
- Flexible Time Off, Sick, Family & Medical Leave
- Medical, Dental, Vision, Life, LTD, STD (employer funded)
- Vol Life, Critical Illness, Accidental, Hospital Indemnity, Pet Insurance (employee funded)
- 401k with 3% non-elective company contribution
- Free Parking (on-site only)
- Free lunch in office daily (on-site only)
Work Mode
This role operates on a hybrid work model from our offices in Santa Barbara, CA or Arlington, VA.
Umbra is an Equal Opportunity Employer. We do not discriminate in hiring on the basis of sex, gender identity, sexual orientation, race, color, religious creed, national origin, physical or mental disability, protected veteran status, or any other characteristic protected by federal, state, or local law.



