Thomson Reuters is looking for a Manager, Lead Research Scientist, Training Data (Foundational Research) to lead a diverse global team of experts in foundational machine learning research. You will be involved in strategic planning, hiring, hands-on research, and translating findings into concrete deliverables, focusing on advanced algorithms and training techniques for Large Language Models (LLMs).
What You'll Do
- Lead strategic planning, hiring, and management in foundational research.
- Mentor, lead, and help direct reports grow and contribute to the wider group.
- Innovate at the cutting edge of AI Research, making best use of rich data sources.
- Develop novel performance-driven data sub-selection methods and training insights.
- Participate in the entire research & model development lifecycle: brainstorming, coding, testing, and delivering high-quality reports.
- Collaborate with a global team of research engineers and academic partners.
- Communicate technical findings through seminars, lectures, conferences, publications, and sharing of technical assets.
What We're Looking For
- PhD in a relevant discipline.
- 3+ years of hands-on experience leading teams building advanced ML/NLP/AI systems in academia or industry.
- Strong publication record in top-tier conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL) with specific focus on training data curation, synthetic data generation, etc.
- Familiarity with one or more deep learning frameworks (e.g., pytorch, jax, tensorflow).
- Experience in ML Research beyond completing a PhD (e.g., supervision, industry experience, leading academic initiatives).
- Excellent communication skills to report and present research findings clearly, both orally and in writing.
- Curious and innovative disposition capable of devising novel, well-founded algorithmic solutions.
- Good social skills and ability to motivate, inspire and mentor team members.
- Comfortable working in fast-paced, agile environments, managing uncertainty and ambiguity.
Nice to Have
- High-impact publications in top-tier conferences or other influence in the research community.
- 5+ years of hands-on experience leading teams building advanced ML/NLP/AI systems in academia or industry.
- Extensive experience with deep learning and large-scale model training.
- Extensive experience working with LLM training-data, ideally involved in training large-scale foundation models.
- Strong software and/or infrastructure engineering skills, evidenced by code contributions to popular open-source libraries or writing production code.
- Experience training large-scale models over distributed nodes with cloud tools such as Amazon AWS, MS Azure, or Google Cloud.
Technical Stack
- pytorch
- jax
- tensorflow
- Amazon AWS
- MS Azure
- Google Cloud
Team & Environment
You will manage a diverse global team of experts within the Foundational Research division of Thomson Reuters Labs.
Benefits & Compensation
- On-the-job coaching and learning.
- Opportunity to work with cutting-edge methods and technologies.
- Access to large datasets (over 60,000 TBs of data) and all major cloud computing platforms.
- Flexible hybrid working environment (2-3 days a week in the office).
- Flex My Way policies for work-life balance, including work from anywhere for up to 8 weeks per year.
- Career Development and Growth programming (Grow My Way).
- Comprehensive benefit plans including flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs.
- Two paid volunteer days off annually and opportunities for pro-bono consulting and ESG initiatives.
Work Mode
This role follows a hybrid work model, expected to work from the office 2-3 days per week.
Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace and makes reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs.

