Germany (Remote) Remote (Global) Contract $50 – $75 per hour

LILT is hiring an AI Benchmark Engineer | Native Language Specialist - German

About the Role

The role involves assessing the performance of artificial intelligence models in German language tasks, ensuring accuracy and fluency through structured evaluation and feedback loops.

Responsibilities

  • Assess German language outputs from AI systems
  • Apply standardized evaluation criteria consistently
  • Report on model performance trends
  • Contribute to test case development
  • Support refinement of training datasets
  • Maintain detailed records of evaluation results
  • Collaborate with engineering and research teams
  • Adapt to evolving evaluation requirements
  • Ensure linguistic authenticity in AI responses
  • Participate in calibration sessions

Nice to Have

  • Experience with NLP applications
  • Knowledge of translation technologies
  • Exposure to AI testing environments
  • Sensitivity to regional language variations
  • Familiarity with quality scoring systems
  • Adaptability to technical workflows
  • Proven remote work experience

Benefits

  • Flexible work schedule
  • Remote-first culture
  • Professional development opportunities
  • Collaborative team environment
  • Access to cutting-edge AI tools
  • Competitive compensation
  • Health and wellness benefits
  • Paid time off
  • Inclusive workplace policies
  • Support for continuous learning

Compensation

Competitive salary and benefits package

Work Arrangement

Remote position with flexible hours

Team

Collaborative team focused on AI-driven language technologies

Responsibilities

  • Evaluate AI-generated German text for linguistic accuracy and naturalness
  • Develop and apply benchmarking frameworks for machine translation systems
  • Identify patterns in model output to guide improvements
  • Collaborate with engineers to refine AI training data
  • Document evaluation findings with detailed annotations
  • Ensure consistency across language quality assessments
  • Provide feedback on model behavior in real-world scenarios
  • Support the development of evaluation guidelines
  • Work with cross-functional teams to align on quality standards
  • Track performance metrics over time

Qualifications

  • Native proficiency in German with strong command of grammar and idiomatic usage
  • Fluency in English for technical collaboration
  • Experience in linguistics, translation, or language technology
  • Familiarity with AI or machine learning concepts
  • Detail-oriented with strong analytical skills
  • Ability to follow structured evaluation protocols
  • Prior experience in language quality assessment preferred
  • Comfortable working with technical interfaces and data
  • Bachelor’s degree in a relevant field or equivalent experience
  • Proven ability to work independently and meet deadlines

Preferred Skills

  • Knowledge of natural language processing tools
  • Experience with translation memory systems
  • Exposure to AI benchmarking or testing frameworks
  • Understanding of cross-cultural communication nuances
  • Familiarity with German dialects or regional variations
  • Technical aptitude for learning new software platforms
  • Experience in remote work environments

Available for qualified candidates

Freelancing without stability?

Get steady projects, keep your freedom

Iglu connects you with international clients and handles contracts, payments, and admin. You get consistent work and flexibility — no more chasing invoices or worrying about gaps.

Consistent client projects
Contract & payment management
Flexible work schedule
Revenue-sharing compensation
See open positions
Work from anywhere
About company
LILT
LILT builds multilingual AI and human-verified services that make the world's information available to everyone, regardless of language. The company serves Enterprises, Governments, and AI Developers worldwide.
All jobs at LILT Visit website
Job Details
Department LiltLancer Community, AI Data Services
Category other
Posted 2 months ago