Coram AI is looking for an AI Research Engineer to build production-grade AI agents powered by the latest LLMs and Claude Code. You will be responsible for turning foundation models into reliable, high-performance systems that operate in real-world environments.
What You'll Do
- Design and build autonomous agents using state-of-the-art LLMs.
- Implement tool use, retrieval pipelines, memory systems, and multi-step reasoning flows.
- Engineer prompts and system instructions for robustness, reliability, and speed.
- Optimize latency, cost, and throughput in production.
- Build evaluation frameworks to measure agent accuracy, tool correctness, and failure modes.
- Create high-quality datasets for training, fine-tuning, and benchmarking.
- Develop introspection tooling to debug reasoning chains, hallucinations, and tool misuse.
- Run structured experiments to improve agent performance through iterative testing.
What We're Looking For
- BS, MS, or PhD in Computer Science, Engineering, Machine Learning, or a related technical field from a top university.
- 2+ years of experience building software systems.
- Strong programming ability in Python.
- Experience working with modern LLM APIs (OpenAI, Anthropic, etc.) and building applications powered by foundation models.
- Experience building or contributing to production systems that must be reliable, observable, and scalable.
- Ability to diagnose and mitigate LLM failure modes such as hallucinations, tool misuse, and reasoning errors.
- Strong experimental mindset with a data-driven approach to improving system performance.
- Excellent communication skills (written and verbal) in English.
- Passion for building cutting-edge AI systems at the speed of a fast-growing startup.
- Resilient and adaptable in challenging, fast-paced environments.
- Ability to work in an onsite environment.
Nice to Have
- Experience building evaluation harnesses or LLM benchmarking systems.
- Background in machine learning, applied research, or systems performance optimization.
- Experience optimizing inference latency and cost at scale.
- Experience debugging complex agent behaviors in real-world environments.
- Experience in Go or TypeScript.
Technical Stack
- Python, Go, TypeScript
- LLMs, Claude Code
Team & Environment
You will join a small, fast-moving team at Coram AI.
Benefits & Compensation
- Competitive compensation package.
- 100% Employer-paid medical, dental, vision, and base life insurance.
- Flexible paid time off and 9 paid holidays.
- 401(k) with both Traditional and Roth options.
- Equity in a rapidly growing company.
- Referral bonuses.
- Daily team dinners and regular team off-sites.
- The latest Apple tech and unlimited tools.
- Unlimited Cursor and Claude Code credits.
- Direct exposure to our AI-native GTM machinery.
Work Mode
This role is onsite.
Coram AI values clarity, craftsmanship, and impact. Every person has a voice, ships meaningful work, and helps shape how AI can make the world safer and more connected.




