San Francisco, CA Hybrid Employment $350,000 - $850,000 USD

Anthropic is hiring a Research Engineer, Performance RL

Responsibilities

  • Invent, design and implement RL environments and evaluations.
  • Conduct experiments and shape our research roadmap.
  • Deliver your work into training runs.
  • Collaborate with other researchers, engineers, and performance engineering specialists across and outside Anthropic.

Requirements

  • expertise with accelerators (CUDA, ROCm, Triton, Pallas)
  • ML framework programming (JAX or PyTorch)
  • worked across the stack – kernels, model code, distributed systems
  • balance research exploration with engineering implementation
  • passionate about AI's potential and committed to developing safe and beneficial systems

Nice to Have

  • Experience with reinforcement learning.
  • Experience porting ML workloads between different types of accelerators.
  • Familiarity with LLM training methodologies.

Benefits

  • competitive compensation and benefits
  • optional equity donation matching
  • generous vacation and parental leave
  • flexible working hours
  • lovely office space in which to collaborate with colleagues

Team

Structure: Reinforcement Learning teams

Additional Information

  • Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.
Required Skills
acceleratorsreinforcement learning.LLM training methodologies. acceleratorsreinforcement learning.LLM training methodologies.
Your first international client?

Don't lose them over invoicing

Clients ghost freelancers with unprofessional invoicing. Glopay gives you a real EU company partnership so they take you seriously from invoice #1.

Instant EU company partnership
Invoice builder with your branding
Automated payment reminders
Real-time payment tracking
Get EU company now
Ready in 24 hours
About company
Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
All jobs at Anthropic Visit website
Job Details
Department RL Teams
Category other
Posted 2 hours ago