About the Role
This position involves transcribing audio recordings in Somali and reviewing transcriptions for accuracy, clarity, and consistency. The work supports the development of high-quality language datasets used in artificial intelligence systems.
Responsibilities
- Transcribe spoken Somali from audio recordings with high accuracy
- Identify and label speakers in multi-person audio clips
- Follow detailed guidelines for punctuation, formatting, and style
- Review existing transcriptions for errors in spelling, grammar, or timing
- Flag inappropriate or sensitive content according to protocol
- Maintain consistent transcription quality across diverse dialects
- Adapt to feedback and revise work as needed
- Meet productivity targets without compromising accuracy
- Report technical issues with audio files when encountered
- Handle confidential data in compliance with privacy standards
- Work independently with minimal supervision
- Use transcription software tools efficiently
- Manage time effectively to meet deadlines
- Communicate clearly about project challenges
- Stay updated on changes to transcription guidelines
- Ensure alignment with cultural and linguistic nuances in Somali
- Transcribe content across various topics including casual conversation and formal speech
- Verify timestamps for spoken segments
- Distinguish between similar-sounding words in context
- Preserve original meaning while transcribing idiomatic expressions
Nice to Have
- Formal education in linguistics or language studies
- Prior experience in speech transcription for technology projects
- Familiarity with African languages beyond Somali
- Experience working in remote, asynchronous teams
- Background in quality assurance or editing
- Knowledge of phonetics or dialectology
- Experience with AI or machine learning data preparation
- Previous work in multilingual environments
- Training in language instruction or interpretation
- Exposure to digital accessibility standards
Compensation
Hourly
Work Arrangement
Remote
Team
Distributed team across multiple time zones
Project Focus
- This project emphasizes accurate representation of spoken Somali in written form to support the training of speech recognition systems.
- Work contributes to improving language technology for underrepresented languages.
Work Schedule
- Flexible hours with expected weekly commitment.
- Tasks are assigned in batches and must be completed within set deadlines.
Not offered