About the Role

The role involves assessing the performance of artificial intelligence models in German language tasks, ensuring accuracy and fluency through structured evaluation and feedback loops.

Responsibilities

Assess German language outputs from AI systems
Apply standardized evaluation criteria consistently
Report on model performance trends
Contribute to test case development
Support refinement of training datasets
Maintain detailed records of evaluation results
Collaborate with engineering and research teams
Adapt to evolving evaluation requirements
Ensure linguistic authenticity in AI responses
Participate in calibration sessions

Nice to Have

Experience with NLP applications
Knowledge of translation technologies
Exposure to AI testing environments
Sensitivity to regional language variations
Familiarity with quality scoring systems
Adaptability to technical workflows
Proven remote work experience

Benefits

Flexible work schedule
Remote-first culture
Professional development opportunities
Collaborative team environment
Access to cutting-edge AI tools
Competitive compensation
Health and wellness benefits
Paid time off
Inclusive workplace policies
Support for continuous learning

Compensation

Competitive salary and benefits package

Work Arrangement

Remote position with flexible hours

Team

Collaborative team focused on AI-driven language technologies

Responsibilities

Evaluate AI-generated German text for linguistic accuracy and naturalness
Develop and apply benchmarking frameworks for machine translation systems
Identify patterns in model output to guide improvements
Collaborate with engineers to refine AI training data
Document evaluation findings with detailed annotations
Ensure consistency across language quality assessments
Provide feedback on model behavior in real-world scenarios
Support the development of evaluation guidelines
Work with cross-functional teams to align on quality standards
Track performance metrics over time

Qualifications

Native proficiency in German with strong command of grammar and idiomatic usage
Fluency in English for technical collaboration
Experience in linguistics, translation, or language technology
Familiarity with AI or machine learning concepts
Detail-oriented with strong analytical skills
Ability to follow structured evaluation protocols
Prior experience in language quality assessment preferred
Comfortable working with technical interfaces and data
Bachelor’s degree in a relevant field or equivalent experience
Proven ability to work independently and meet deadlines

Preferred Skills

Knowledge of natural language processing tools
Experience with translation memory systems
Exposure to AI benchmarking or testing frameworks
Understanding of cross-cultural communication nuances
Familiarity with German dialects or regional variations
Technical aptitude for learning new software platforms
Experience in remote work environments

Available for qualified candidates

LILT is hiring an AI Benchmark Engineer | Native Language Specialist - German

About the Role

Responsibilities

Nice to Have

Benefits

Compensation

Work Arrangement

Team

Responsibilities

Qualifications

Preferred Skills

Similar Jobs

Field Service Engineer (field-based across South West Ireland)

Internship Content Creative Strategist Start it @CBC

Bilingual In-Home Mental Health Therapist (BA or MA) - Kane County (Aurora & Elgin Towns)

Senior Sales Manager (video surveillance, security solutions)

Senior IT Specialist

Senior, Software Engineer - Data Pipeline