About the Role

The position involves designing and implementing pre-training approaches for large language models, with an emphasis on improving model capabilities, safety, and interpretability through empirical research and engineering.

Compensation

Competitive salary and benefits package

Work Arrangement

Hybrid or remote options available

Team

Part of a research-focused team advancing AI model development

Responsibilities

Design and run experiments to improve pre-training methodologies
Develop scalable training pipelines for large language models
Analyze model behavior during pre-training phases
Collaborate with researchers to test hypotheses about model scaling
Implement and refine data filtering and curation techniques
Optimize training efficiency and compute utilization
Debug and resolve issues in distributed training setups
Contribute to internal tools for monitoring model training
Evaluate the impact of architectural choices on pre-training outcomes
Document findings and share insights across teams

Qualifications

Strong programming skills in Python and machine learning frameworks
Experience with deep learning and transformer-based models
Familiarity with large-scale training infrastructure
Background in natural language processing or related field
Proven ability to conduct independent research
Experience with distributed training systems
Knowledge of data handling at scale
Understanding of model evaluation techniques
Ability to analyze and interpret training dynamics
Clear communication skills for technical collaboration

Preferred Qualifications

PhD in computer science, machine learning, or related discipline
Prior work on language model pre-training
Experience with reinforcement learning from human feedback
Contributions to open-source machine learning projects
Publications in relevant research venues
Familiarity with safety-focused AI research
Knowledge of interpretability methods
Experience optimizing training for robustness

What We Value

Rigorous empirical approach to research
Focus on long-term model safety
Collaborative problem-solving
Transparency in experimental design
Commitment to reproducible results
Curiosity about model internals
Attention to ethical implications
Willingness to iterate on failed hypotheses

Visa sponsorship may be available for qualified candidates

Anthropic is hiring a Research Engineer/Research Scientist, Pre-training

About the Role

Compensation

Work Arrangement

Team

Responsibilities

Qualifications

Preferred Qualifications

What We Value

Focus on work, not paperwork

Similar Jobs

Board Certified Behavior Analyst (BCBA)

Remote Scoop Operator

Bilingual Customer Service Representative - In office/hybrid

Executive Assistant

Enterprise Core Sales Engineer - Midwest

Sales Associate - Immigration Law Firm