Responsibilities
- Architect, automate, and manage resilient cloud platforms that support critical government operations.
- Ensure systems are observable, self-healing, and designed to minimize manual oversight.
- Develop and refine secure, automated deployment pipelines and standardized infrastructure configurations compliant with regulatory standards.
- Enhance deployment reliability and performance across isolated network environments.
- Establish and enforce reliability standards including uptime, response times, incident management, and service-level targets.
- Engage in incident response and rotating on-call duties with a focus on reducing manual work through automation.
- Work closely with software and systems engineers to meet mission-critical performance and reliability goals.
Benefits
- Unlimited paid time off with a minimum 15-day usage requirement
- Observance of U.S. federal government holidays
- Quarterly health and wellness stipends
- Full health insurance coverage for employees; family plans available
- 4% 401(k) contribution match
- Annual 4-day company-wide retreat
Team
Small team environment with close collaboration across frontend, backend, and platform engineering functions.
Other
- Active U.S. Security Clearance (minimum Secret; TS/SCI desired)
- Must be a U.S. Citizen
- Subject to ITAR regulations: candidates must be U.S. citizens, nationals, lawful permanent residents, refugees, or asylees, or able to obtain required State Department authorizations
- Required to participate in a 24/7 on-call rotation