About the Role
The role involves processing and annotating Portuguese text data to help train and refine large language models, ensuring high accuracy and consistency in linguistic patterns and outputs.
Responsibilities
- Review and label Portuguese text according to defined guidelines
- Identify linguistic features in written content for model training
- Ensure annotations follow established quality standards
- Flag ambiguous or problematic language samples
- Collaborate with team members to resolve edge cases
- Maintain consistency across annotation tasks
- Provide feedback on annotation guidelines
- Participate in training for new project requirements
- Meet productivity and accuracy targets
- Adapt to updates in annotation protocols
- Assist in validating model-generated Portuguese text
- Report recurring issues in data sets
- Support quality assurance checks
- Contribute to improving data categorization systems
- Work with structured and unstructured text formats
- Handle sensitive language content with care
- Follow data privacy and security practices
- Track progress using internal tools
- Communicate with project leads on challenges
- Complete tasks within assigned timelines
Nice to Have
- Formal education in linguistics or related field
- Previous work in NLP data preparation
- Experience with Portuguese from multiple regions
- Familiarity with machine learning concepts
- Background in translation or editing
Compensation
Competitive salary based on experience
Work Arrangement
Remote
Team
Collaborative team focused on language data quality
Language Focus
- Primary language for annotation is European and Brazilian Portuguese
- Tasks involve distinguishing regional variations and dialects
Project Duration
- Initial assignment may be time-bound with potential for extension
- Workload varies based on project phase
Not specified