Responsibilities
- help to safely advance the capabilities of our models in secure coding, vulnerability remediation, and other areas of defensive cybersecurity.
- develop novel approaches and realize them in code.
- design and implement RL environments
- conduct experiments and evaluations
- deliver your work into production training runs
- collaborate with other researchers, engineers, and cybersecurity specialists across and outside Anthropic.
Requirements
- Have experience in cybersecurity research.
- Have experience with machine learning.
- Have strong software engineering skills.
- Can balance research exploration with engineering implementation.
- Are passionate about AI's potential and committed to developing safe and beneficial systems.
Nice to Have
- Professional experience in security engineering, fuzzing, detection and response, or other applied defensive work.
- Experience participating in or building CTF competitions and cyber ranges.
- Academic research experience in cybersecurity.
- Familiarity with RL techniques and environments.
- Familiarity with LLM training methodologies.
Team
Structure: The Horizons team leads Anthropic's reinforcement learning (RL) research and development, playing a critical role in advancing our AI systems. We've contributed to every Claude release, with significant impact on the autonomy, coding, and reasoning capabilities of Anthropic's models.