Remote; Argentina (Remote); Berlin, Germany; Brazil (Remote); China (Remote); Egypt (Remote); France (Remote); India (Remote); Indonesia (Remote); Japan (Remote); Kenya (Remote); Korea (Remote); Mexico (Remote); Russia; Taiwan (Remote); Thailand (Remote); Turkey (Remote); Ukraine (Remote); Vietnam (Remote) Remote (Global) Contract

LILT is hiring a Customer Service & Support AI Rater & Evaluator

About the Role

This role involves assessing AI-generated customer support content for quality, correctness, and appropriateness. Evaluators follow structured guidelines to rate responses and help refine machine learning models used in customer-facing applications.

Responsibilities

  • Review AI-generated replies in customer service contexts for clarity and correctness
  • Rate responses based on accuracy, tone, and adherence to support best practices
  • Identify and flag inappropriate, off-topic, or factually incorrect outputs
  • Follow detailed evaluation criteria for consistency across assessments
  • Provide feedback that contributes to model improvement
  • Handle multiple evaluation tasks within defined timeframes
  • Maintain high attention to detail during repetitive assessment work
  • Report anomalies or systemic issues in AI behavior
  • Ensure evaluations reflect diverse customer perspectives
  • Adhere to data privacy and confidentiality standards
  • Work independently with minimal supervision
  • Meet quality and productivity benchmarks
  • Stay updated on evolving evaluation guidelines
  • Collaborate with team leads on edge cases
  • Contribute to training data refinement for AI systems
  • Evaluate multilingual support responses where applicable
  • Assess empathy and professionalism in AI tone
  • Judge response relevance to specific customer intents
  • Identify cultural or contextual misalignments in replies
  • Support efforts to reduce bias in AI-generated content
  • Use annotation tools to submit evaluations
  • Complete tasks according to project timelines
  • Maintain consistent performance across evaluation cycles
  • Participate in calibration exercises with peers
  • Follow ethical guidelines when rating sensitive content

Nice to Have

  • Prior work in customer support or technical support roles
  • Experience evaluating AI or NLP systems
  • Background in language quality assessment
  • Familiarity with rating interfaces for machine learning
  • Knowledge of linguistic evaluation metrics
  • Experience with low-code or annotation platforms
  • Understanding of sentiment analysis concepts
  • Exposure to human-in-the-loop AI systems
  • Work history in remote, asynchronous environments
  • Demonstrated ability to follow complex instructions

Compensation

Competitive hourly rate based on experience and location

Work Arrangement

Remote

Team

Distributed team supporting AI training and evaluation initiatives

What You’ll Do

  • Evaluate AI-generated customer service replies across various scenarios
  • Apply scoring rubrics to assess response quality and safety
  • Help train AI models by identifying strengths and weaknesses in outputs
  • Contribute to improving the realism and effectiveness of support bots
  • Work on tasks that require consistent, thoughtful judgment

Who You Are

  • Detail-oriented with a strong sense of accuracy
  • Able to articulate why a response works or fails
  • Comfortable working with structured feedback systems
  • Sensitive to tone, empathy, and customer needs
  • Reliable and committed to high-quality output

Not applicable

About company
LILT
LILT builds multilingual AI and human-verified services that make the world's information available to everyone, regardless of language. The company serves Enterprises, Governments, and AI Developers worldwide.
All jobs at LILT Visit website
Job Details
Department LiltLancer Community, AI Data Services
Category other
Posted 2 months ago