San Francisco, CA | New York City, NY Hybrid Employment $300,000 - $405,000 USD

Anthropic is hiring a Full-Stack Software Engineer, Reinforcement Learning

About the Role

This position involves building and maintaining software systems that enable the development and deployment of reinforcement learning models, bridging research and production environments through robust full-stack engineering practices.

Responsibilities

  • Design and implement full-stack systems that support reinforcement learning workflows
  • Collaborate with research teams to translate experimental prototypes into production-grade code
  • Build scalable backend services to manage training pipelines and model evaluation
  • Develop user interfaces for monitoring and interacting with AI training processes
  • Optimize system performance for large-scale distributed training runs
  • Ensure robustness, reliability, and observability across all system components
  • Integrate machine learning models into end-to-end software platforms
  • Work closely with infrastructure teams to align with security and compliance standards
  • Maintain clean, well-documented codebases with comprehensive testing
  • Troubleshoot and resolve issues across the stack in production environments
  • Contribute to architectural decisions for new features and systems
  • Support deployment automation and continuous integration pipelines
  • Help define best practices for software engineering within the AI research context
  • Iterate quickly based on feedback from researchers and product stakeholders
  • Balance rapid prototyping with long-term maintainability and scalability

Nice to Have

  • Experience contributing to machine learning training pipelines
  • Background in reinforcement learning or related AI subfields
  • Prior work on systems supporting human-in-the-loop training
  • Familiarity with frontend frameworks such as React or Angular
  • Knowledge of authentication, authorization, and security best practices
  • Experience with large-scale data processing systems
  • Contributions to open-source projects in AI or systems engineering
  • MS or PhD in Computer Science or related discipline

Compensation

Competitive salary and benefits package commensurate with experience

Work Arrangement

Hybrid or remote options available; some roles may require office presence depending on team needs

Team

A rapidly expanding team of dedicated researchers, engineers, policy specialists, and business leaders focused on advancing AI safety and capabilities

Research Collaboration

Engineers regularly partner with research teams to implement novel algorithms and evaluate system performance in real-world settings

Focus on Safety

All systems are designed with safety, interpretability, and alignment considerations integrated throughout the development lifecycle

Engineering Culture

Emphasis on code quality, peer review, documentation, and iterative improvement

Impact

Work directly contributes to advancing the state of AI systems while maintaining rigorous safety standards

Growth Opportunities

Opportunities to lead projects, mentor junior engineers, and shape technical direction as the team scales

Visa sponsorship available for qualified candidates

Your first international client?

Don't lose them over invoicing

Clients ghost freelancers with unprofessional invoicing. Glopay gives you a real EU company partnership so they take you seriously from invoice #1.

Instant EU company partnership
Invoice builder with your branding
Automated payment reminders
Real-time payment tracking
Get EU company now
Ready in 24 hours
About company
Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
All jobs at Anthropic Visit website
Job Details
Department Reinforcement Learning
Category fullstack
Posted 2 hours ago