BJAK is looking for a Data Engineer to join our team developing impactful AI features. You will be tasked with fine-tuning state-of-the-art models, designing evaluation frameworks, and bringing these systems into production. Your work will ensure our models are intelligent, safe, trustworthy, and effective at scale.
What You'll Do
- Collect, clean, and preprocess user-generated text and image data for fine-tuning large models.
- Design and manage scalable data labeling pipelines, leveraging both crowdsourcing and in-house labeling teams.
- Build and maintain automated datasets for content moderation, such as safe versus unsafe content.
- Collaborate with researchers and engineers to ensure datasets are high-quality, diverse, and aligned with model training needs.
What We're Looking For
- Proven experience preparing datasets for machine learning or fine-tuning large models.
- Strong skills in data cleaning, preprocessing, and transformation for both text and image data.
- Hands-on experience with data labeling workflows and quality assurance for labeled data.
- Familiarity with building and maintaining moderation datasets for safety, compliance, and filtering.
- Proficiency in scripting with Python and SQL and working with large-scale data pipelines.
Technical Stack
- Python
- SQL
Team & Environment
You'll join a flat structure where you'll have real ownership. You'll be fully involved in direction setting and consensus decision making.
Benefits & Compensation
- Health, dental & vision insurance.
- Global travel insurance for you and your dependents.
- Unlimited, flexible time off.
- Housing rental subsidies.
- Quality company cafeteria.
- Overtime meals.
- High-impact role with visibility across product, data, and engineering.
- Global exposure to product development.
- Top-of-market compensation and performance-based bonuses.
Work Mode
This is a hybrid position open to candidates in Malaysia, Thailand, Taiwan, and Japan.
BJAK is an equal opportunity employer.


