Thomson Reuters is looking for a Manager, Lead Research Scientist, Training Data (Foundational Research) to lead a diverse global team of experts in foundational machine learning research. You will be involved in strategic planning, hiring, hands-on research, and translating findings into concrete deliverables, focusing on advanced algorithms and training techniques for Large Language Models (LLMs).

What You'll Do

Lead strategic planning, hiring, and management in foundational research.
Mentor, lead, and help direct reports grow and contribute to the wider group.
Innovate at the cutting edge of AI Research, making best use of rich data sources.
Develop novel performance-driven data sub-selection methods and training insights.
Participate in the entire research & model development lifecycle: brainstorming, coding, testing, and delivering high-quality reports.
Collaborate with a global team of research engineers and academic partners.
Communicate technical findings through seminars, lectures, conferences, publications, and sharing of technical assets.

What We're Looking For

PhD in a relevant discipline.
3+ years of hands-on experience leading teams building advanced ML/NLP/AI systems in academia or industry.
Strong publication record in top-tier conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL) with specific focus on training data curation, synthetic data generation, etc.
Familiarity with one or more deep learning frameworks (e.g., pytorch, jax, tensorflow).
Experience in ML Research beyond completing a PhD (e.g., supervision, industry experience, leading academic initiatives).
Excellent communication skills to report and present research findings clearly, both orally and in writing.
Curious and innovative disposition capable of devising novel, well-founded algorithmic solutions.
Good social skills and ability to motivate, inspire and mentor team members.
Comfortable working in fast-paced, agile environments, managing uncertainty and ambiguity.

Nice to Have

High-impact publications in top-tier conferences or other influence in the research community.
5+ years of hands-on experience leading teams building advanced ML/NLP/AI systems in academia or industry.
Extensive experience with deep learning and large-scale model training.
Extensive experience working with LLM training-data, ideally involved in training large-scale foundation models.
Strong software and/or infrastructure engineering skills, evidenced by code contributions to popular open-source libraries or writing production code.
Experience training large-scale models over distributed nodes with cloud tools such as Amazon AWS, MS Azure, or Google Cloud.

Technical Stack

pytorch
jax
tensorflow
Amazon AWS
MS Azure
Google Cloud

Team & Environment

You will manage a diverse global team of experts within the Foundational Research division of Thomson Reuters Labs.

Benefits & Compensation

On-the-job coaching and learning.
Opportunity to work with cutting-edge methods and technologies.
Access to large datasets (over 60,000 TBs of data) and all major cloud computing platforms.
Flexible hybrid working environment (2-3 days a week in the office).
Flex My Way policies for work-life balance, including work from anywhere for up to 8 weeks per year.
Career Development and Growth programming (Grow My Way).
Comprehensive benefit plans including flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs.
Two paid volunteer days off annually and opportunities for pro-bono consulting and ESG initiatives.

Work Mode

This role follows a hybrid work model, expected to work from the office 2-3 days per week.

Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace and makes reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs.