Responsibilities
- Evaluate LLM Architecture Logic: review AI-generated explanations of model architectures, loss functions, and backpropagation for technical accuracy
- Audit Code & Notebooks: validate ML-specific code (e.g., training loops, data preprocessing scripts, or model evaluations) for efficiency and correctness
- Refine RLHF Frameworks: provide the high-quality human feedback necessary to align models with human intent, safety, and helpfulness
- Analyze Model Reasoning: critically assess how an AI model navigates complex chain-of-thought (CoT) prompts and identify where the reasoning breaks down
- Benchmark Performance: conduct comparative testing between different model outputs based on specific technical taxonomies and performance metrics
Benefits
- competitive pay rates
- flexible hours
- ability to work from home
Work Arrangement
Hybrid
Additional Information
- You must be prepared to complete paid tasks that require one hour of uninterrupted work, though many are shorter.
- By submitting your application, you agree that Prolific may collect your personal data for recruiting and global organisation planning. Prolific's Candidate Privacy Notice explains what personal information Prolific may process, where Prolific may process your personal information, its purposes for processing your personal information, and the rights you can exercise over Prolific use of your personal personal information.