About the Role
The engineer will simulate real-world attacks on AI systems to uncover weaknesses, improve model robustness, and ensure safe deployment, particularly in Japanese-language contexts.
Responsibilities
- Conduct adversarial testing on AI models to expose potential risks
- Design and execute attack simulations targeting language model behavior
- Analyze model outputs for safety violations or policy breaches
- Collaborate with research teams to interpret findings and recommend improvements
- Develop tools and methodologies for systematic red teaming
- Focus on Japanese language patterns and cultural nuances in testing
- Document vulnerabilities and provide detailed reports
- Assist in creating benchmarks for model safety evaluation
- Stay current with emerging threats in AI systems
- Evaluate responses to harmful or misleading prompts
- Test multilingual capabilities with emphasis on Japanese
- Identify biases or inconsistencies in model behavior
- Work closely with engineering to validate fixes
- Contribute to internal frameworks for automated red teaming
- Support evaluation of model updates and new releases
- Maintain confidentiality of sensitive testing procedures
- Use creative prompting strategies to probe model boundaries
- Assess model adherence to ethical guidelines
- Provide feedback on user safety experience
- Participate in cross-team discussions on AI risk mitigation
- Track and categorize discovered vulnerabilities
- Help prioritize risks based on severity and impact
- Ensure testing aligns with regulatory and compliance standards
- Contribute to documentation of red team protocols
- Support training for other team members on red team methods
Compensation
Competitive salary and benefits package
Work Arrangement
Remote with flexible hours
Team
Collaborative team focused on AI safety and evaluation
Language Focus
This position requires native or near-native fluency in Japanese to effectively test language-specific behaviors, cultural references, and regional nuances in AI outputs.
Security and Ethics
Candidates must demonstrate a responsible approach to probing AI systems, ensuring tests are conducted ethically and without unnecessary risk.
Available for qualified candidates
