The Machine Learning Evaluation Specialist is a remote research role at G2i Inc., where domain experts create and evaluate complex machine learning challenges that test the limits of current AI systems. This is not an engineering position but a research-intensive role requiring advanced knowledge in a technical or scientific field intersecting with machine learning.
What You'll Do
- Propose and frame original, research-grade ML problems rooted in your domain
- Design evaluation tasks that require specialized knowledge well beyond standard pipelines
- Assess AI-generated solutions for correctness, creativity, and methodological rigor — and explain exactly where and why they fall short
- Document problem difficulty, required domain knowledge, and expected failure modes
What We're Looking For
- Graduate-level expertise (MS or PhD preferred) in a scientific or technical domain that intersects with ML
- Strong working knowledge of ML methods — model selection, feature engineering, evaluation metrics
- Deep familiarity with active research problems in your field — you know where general ML knowledge runs out
- Excellent written communication — you can articulate complex problems clearly and precisely. This cannot be overstated.
- Self-motivated and comfortable working independently on intellectually demanding tasks
Benefits & Compensation
- Fully remote — work from anywhere
- Assessment required — paid if approved
- Independent contractor (1099) — not compatible with F-1 OPT, STEM OPT, or visa statuses requiring W-2 employment or employer sponsorship
- ⚠️ This is a project-based, freelance opportunity with no guaranteed hours. We recommend keeping other work options open while waiting for project assignment.
All qualified applicants will receive consideration without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.





