Responsibilities
- Have developed a deep understanding of Platform and Platform customers
- Have built and operated cloud infrastructure (AWS) based on customer requirements
- Have solved any development and deployment challenges around making Platform infrastructure highly reliable, easy to maintain, and cost effective
- Have participated in the development, execution, and support of the new feature rollout with solution architects, forward-deployed engineers, and customer success teams
- Have developed and contributed to existing and new monitoring and alerting systems for Platform infrastructure
- Have hardened infrastructure security including network, storage, user access, etc.
- Have responded and solved key customer reported issues in a timely manner
- Participated in an on-call rotation
- Spearheaded the adoption of AI-driven development workflows, integrating LLM-based tools into the SDLC to accelerate feature delivery and reduce mean time to resolution (MTTR)
- Architected and deployed custom MCP (Model Context Protocol) tooling and autonomous agents, enabling seamless data interoperability between internal systems and AI models to automate complex platform engineering tasks
Requirements
- Engineering Excellence: Proficiency in Python or Rust, with deep technical troubleshooting skills and extensive experience building backend APIs and microservices
- Cloud Infrastructure: Expert-level knowledge of AWS core services and the ability to architect scalable solutions from high-level requirements
- Kubernetes & Orchestration: Hands-on experience managing the "care and feeding" of Kubernetes clusters (upgrades, Service Mesh/Istio, Helm) and scaling compute services using ArgoCD and Argo Rollouts
- Infrastructure as Code & CI/CD: Proficiency in Terraform/Scalr and experience building robust CI/CD pipelines using GitHub Actions or Jenkins
- Networking & Connectivity: Solid understanding of networking fundamentals (subnets, CIDR, VPCs) to ensure secure, global application accessibility
- Data Frameworks: Familiarity with big-data tools and workflow orchestrators such as Snowflake, Airflow, and Spark
- AI & Productivity: Ability to leverage AI tools (Gemini, ChatGPT, Cursor) to streamline engineering workflows and enhance personal productivity
Nice to Have
- Experience with AWS preferred. AWS Cloud Infrastructure certification is a plus
- Experience with data privacy concerns such as HIPAA or GDPR
- Working knowledge of data modeling and storage across relational (PostgreSQL), NoSQL (DynamoDB, Redis), and MPP databases (Snowflake, Redshift)
- Experience working with cross-functional teams and with other customer-facing teams
- Passion! We hope you are passionate about our mission and technology
- Ownership! We do not need you to know everything, but we hope you have deep knowledge in at least one area and can start contributing quickly. And we would love to learn from you in your area(s) of expertise as well
- Expertise! We do not need you to know everything, but we hope you have deep knowledge in at least one area and can start contributing quickly. And we would love to learn from you in your area(s) of expertise as well
Work Arrangement
Hybrid — San Francisco, New York City, Chicago
Additional Information
- Equal opportunity employment: Komodo Health provides equal employment opportunities and prohibits discrimination and harassment based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws
- Candidates must acknowledge that they have read and understand Komodo Health’s Privacy Notice for Employees and Contractors
- Hybrid work model with intentional in-office rhythms and flexibility
- Expectations about work setup will be clear before joining