Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY Hybrid Employment $350,000 - $500,000 USD

Anthropic is hiring a Research Engineer, Reward Models Platform

Responsibilities

  • Construct infrastructure to support rapid experimentation with reward signals, including tools for creating evaluation rubrics and analyzing human feedback data
  • Build automated systems to assess reward quality and detect anomalies such as reward hacking or unintended behaviors
  • Develop software that enables side-by-side comparison of different reward modeling approaches and their impact
  • Design end-to-end pipelines that streamline reward model development, from data collection to deployment
  • Implement observability tools to monitor reward signal integrity during training processes
  • Work closely with research teams to convert scientific objectives into scalable technical solutions
  • Improve existing platforms for better speed, stability, and usability
  • Help establish and document standardized practices for reward model development

Team

a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems

Want to work from Thailand?

Join a remote network built for tech talent

Iglu gives you real employment in Southeast Asia — visa, work permit, and projects included. Pick what you work on, earn performance-based pay, and live where you want.

Legal employment in Thailand & Vietnam
Choose your own projects
Performance-based revenue sharing
Relocation support available
Join Iglu
200+ professionals worldwide
About company
Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
All jobs at Anthropic Visit website
Job Details
Department Fine-Tuning
Category data
Posted 2 hours ago