Responsibilities
- Design and develop novel agentic solutions
- Improve upon SOTA on hard agentic tasks
- Research the next-generation of on-line learning-from-experience self-improvement
- Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
- Work with an amazing team of researchers and engineers pushing the boundaries
Requirements
- Strong software engineering skills
- Proficiency in Python and have some experience with ML-related code (e.g., pytorch, numpy, etc.)
Nice to Have
- Experience with LLMs and agentic frameworks
- Experience with post-training LLMs (SHT, PEFT, or RL*)
- Experience with building synthetic data generation pipelines
Benefits
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits, including a separate budget to take care of your mental health
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days)
Additional Information
- Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
