Responsibilities
- Design and implement robust, high-throughput systems for serving generative AI models, managing data pipelines, and enabling distributed computation
- Develop and expand infrastructure supporting embeddings, retrieval mechanisms, data discovery, and evaluation processes
- Create and sustain secure, scalable APIs with a focus on usability and developer efficiency
- Improve deployment automation, monitoring tools, and operational standards to strengthen system reliability
- Work across teams to assess new generative AI research, performance benchmarks, and emerging methods, integrating high-value advancements into production
- Influence platform architecture decisions related to cluster management, service orchestration, and system dependability
Compensation
Competitive salary and equity in a well-funded startup
Work Arrangement
Remote-first, with expectation to collaborate during U.S. business hours
Team
Small, agile team with a culture of curiosity, humility, and long-term thinking
Why Join Us?
- Make a meaningful contribution to the advancement of generative AI in enterprise environments
- Work remotely from any location, supported with the necessary tools and flexibility
- Be guided by founders who have led successful tech ventures and are deeply passionate about AI innovation
- Operate in a flat, transparent organization with minimal bureaucracy and high ownership
- Receive competitive pay, equity stake, and comprehensive benefits package
- Enjoy medical, dental, vision, disability, and life insurance, 401(k) with 4% match, FSA, HSA, PTO, and remote work support
Who We Are
- A compact, mission-driven team creating a seamless platform for deploying generative AI
- Our values: inquisitive, grounded, and persistent in pursuit of excellence
- We are collaborative, respectful, and talented—focused on impact without ego
- We act quickly and decisively while maintaining a strategic, long-range perspective
How to Apply
- Showcase your technical depth with code samples from GitHub or similar platforms
- Include documentation of a system design you led or contributed to significantly
- Share any benchmarking or evaluation projects that highlight analytical rigor
Other
- Applicants must be legally permitted to work in the U.S. without sponsorship now or in the future
- Remote applicants accepted if based in the U.S. and able to align with U.S. working hours
- Submit code repositories, system designs, and technical evaluations as part of your application
- If you used generative AI in preparing your application, we’re interested in how you applied it
Not available; candidates must be authorized to work in the United States without sponsorship