Responsibilities
- Build and maintain data pipelines for LLM (Large Language Model) training and evaluation, curate user-understanding signals (such as intents, preferences, and behavioral features), and ensure data quality, privacy, and proper dataset management.
- Develop and manage labeling and feedback loops, including heuristics, annotation jobs, and prompt-based labeling, to create high-quality corpora, collaborating with Data Engineering and Applied Science partners to improve data coverage and reduce noise.
- Design, prototype, and ship to production agentic AI solutions, including multi-agent systems using frameworks like LangGraph, and implement context-aware features in partnership with senior engineers.
- Implement an evaluation framework to measure model quality on offline test sets (accuracy, bias, safety, user-intent coverage), and build dashboards to track improvements over time.
- Lead and contribute to experimentation by implementing metrics, A/B tests, and monitoring, helping to harden prototypes for reliable rollouts.
- Collaborate with senior engineers and cross-functional partners to select the right technologies, participate in code reviews, and share best practices (including mentoring interns or new hires as needed).
- Summarize research findings and model evaluations into clear write-ups and demos for the team and cross-functional stakeholders.
- Stay current on emerging agentic AI paradigms, implement paper-inspired proofs of concept, and contribute insights to the team roadmap.
Requirements
- A master’s degree or above, or equivalent experience in Computer Science, Electrical Engineering, or a related field, with an emphasis on building products using frontier multimodal LLMs (Large Language Models).
- Expertise in agentic AI, pretraining, fine-tuning, and reinforcement learning of large language models.
- 3+ years of hands-on experience building large-scale, high-impact solutions, ideally with recent experience in agent-based systems, multi-agent collaboration, or similar paradigms.
- Experience deploying and scaling AI services capable of handling hundreds of millions of daily interactions with high availability, low latency, and robust fault tolerance.
Nice to Have
- A track record of writing articles and publishing high-impact research in top AI conferences is a big plus.
Benefits
- equity awards based on factors such as experience, performance and location
Team
Structure: lean, customer-focused team comprises scientists and engineers working together to deliver a delightful customer experience.
Additional Information
- Employees in this role will not be paid below the salary threshold for exempt employees in the state where they reside.
- If you have a disability or special need that requires accommodation, please contact your recruiter directly.
- Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable state and local law.


