About the Role
Role details below.
Responsibilities
- Managing and evolving AWS infrastructure using Pulumi, with a focus on reliability, cost efficiency, and maintainability
- Operating and improving Kubernetes clusters: workload scheduling, resource management, networking, and observability
- Owning and optimizing GitHub Actions workflows to keep builds fast, feedback tight, and deployments safe
- Hardening infrastructure posture, supporting audit readiness, and implementing controls that meet healthcare compliance requirements
- Supporting the reliable operation of data engineering workflows
- Deploying and maintaining prompt tracing, evaluation, and observability tools as AI capabilities are integrated into the product
- Managing secure, zero-trust connectivity via Tailscale across distributed infrastructure
- Designing, documenting, and regularly testing disaster recovery and incident response processes
Benefits
- Market-competitive compensation based on experience and ability
- Significant equity option grants
- 100% company-paid premiums for health, dental, and vision coverage
- Company co