Tavus is hiring a Senior Data Engineer to own our entire data strategy, from sourcing and curation to structuring and optimization, powering our AI models and products. You will be responsible for building and scaling data pipelines that influence AI model training, ensuring data quality across multimodal datasets with a particular focus on video and audio.
What You'll Do
- Anticipate future data needs and curate diverse, high-quality datasets to ensure AI models reach their full potential.
- Collaborate closely with ML engineers to optimize datasets for maximum model performance, efficiency, and inference accuracy.
- Own, build, and scale the data pipeline, including data sourcing, curation, filtering, and preprocessing across various data modalities.
- Source, collect, and curate the best multimodal data from web scraping, third-party deals, and unconventional sources.
- Own the challenge of structuring video and audio datasets for AI success.
- Own the data labeling process and build automated workflows for cleaning, labeling, and structuring data efficiently.
- Work closely with data annotation teams to ensure high-quality labeled data for ML models.
- Unlock and use internal platform data to drive smarter decisions and supercharge growth.
- Build tight, efficient, and lasting pipelines, datasets, and workflows.
What We're Looking For
- Extreme ownership of data strategy end-to-end.
- Strategic mindset to anticipate data needs and help shape AI development.
- Automation expertise for data cleaning, structuring, and labeling workflows.
- ML-first mindset to structure datasets that maximize AI model accuracy.
- Ability to move fast while maintaining accuracy.
- Ability to create best practices in new domains.
- Strong experience with Python, SQL, and large-scale data processing tools.
Nice to Have
- Previous work with LLMs and multimodal data.
- Experience with in-house video data collection and relevant studio setups.
- Knowledge of best practices for multimodal video and audio data collection.
Technical Stack
- Python
- SQL
- Large-scale data processing tools
Benefits & Compensation
- Flexible work schedule
- Unlimited PTO
- Extremely competitive healthcare
- Gear stipends
Work Mode
This is a global position with a flexible work schedule.
Tavus is driven by people, with a diverse and supportive team where success is shared by all. We are inclusive to all, and diversity is at the core of our hiring, communication, and work.


