Remote Remote (Global) Employment

Pathway is hiring an AI Benchmark & Datasets Engineer/ Researcher Internship

About the Role

The intern will work on creating, refining, and analyzing datasets used for benchmarking artificial intelligence models, contributing to research that supports more accurate and robust AI evaluation.

Responsibilities

  • Design and implement evaluation frameworks for AI models
  • Develop high-quality datasets tailored to specific AI tasks
  • Collaborate with researchers to identify data requirements for benchmarking
  • Assess dataset quality, bias, and representativeness
  • Optimize data pipelines for efficient dataset generation
  • Contribute to documentation for datasets and benchmarks
  • Evaluate model performance using standardized test sets
  • Identify limitations in existing benchmarks
  • Propose improvements to current evaluation methodologies
  • Work with engineering teams to integrate benchmarks into testing workflows
  • Analyze results from model evaluations to inform future iterations
  • Ensure datasets comply with ethical and privacy standards
  • Support version control and reproducibility of datasets
  • Participate in peer reviews of dataset design and usage
  • Stay current with advancements in AI evaluation techniques

Nice to Have

  • Prior research or engineering experience in AI evaluation
  • Contributions to open-source AI or dataset projects
  • Experience with large-scale dataset management
  • Publication record in machine learning or AI-related venues
  • Familiarity with reproducibility challenges in AI research

Compensation

Competitive hourly rate or stipend based on experience and location

Work Arrangement

Hybrid or remote options available; some team meetings may require time zone alignment

Team

Collaborative research and engineering team focused on AI evaluation and dataset development

Application Process

  • Interested candidates should submit a resume, cover letter, and a sample of prior work involving data or AI projects
  • Shortlisted applicants will be asked to complete a technical assessment related to dataset design or model evaluation

Internship Duration

  • This is a summer internship typically lasting 10 to 12 weeks
  • Start date is flexible within a June to July window

No visa sponsorship available for this internship position

Relocating to Thailand?

Visa and work permit handled by experts

SVBL manages your entire visa process — from application to approval. Work permits, extensions, and compliance all covered. One partner for legal, immigration, and settling in.

Work permit processing
Visa extensions & renewals
Immigration compliance
Banking & housing guidance
Get free consultation
Free initial consultation
About company
Pathway

The first post-transformer frontier model that solved continual learning.

Pathway has developed BDH, a massively parallel post-Transformer reasoning architecture enabling generalization over time. Created by scientists and researchers, the company is led by CEO Zuzanna Stamirowska, CTO Jan Chorowski, and CSO Adrian Kosowski. The team has previously built AI tooling that has amassed 126k stars on GitHub.

Pathway is backed by prominent figures including Lukasz Kaiser (co-inventor of Transformers), Martin Farach-Colton (NYU), and Jacques Attali, with support from investors such as TQ, Kadmos, ID4 Ventures, RBV, Inovo, and Market One Capital.

All jobs at Pathway Visit website
Job Details
Department R&D
Category other
Posted 2 months ago