Remote (Global) Full-time

Tavus is hiring a Senior Data Engineer

About the Role

Tavus is hiring a Senior Data Engineer to own our entire data strategy, from sourcing and curation to structuring and optimization, powering our AI models and products. You will be responsible for building and scaling data pipelines that influence AI model training, ensuring data quality across multimodal datasets with a particular focus on video and audio.

What You'll Do

  • Anticipate future data needs and curate diverse, high-quality datasets to ensure AI models reach their full potential.
  • Collaborate closely with ML engineers to optimize datasets for maximum model performance, efficiency, and inference accuracy.
  • Own, build, and scale the data pipeline, including data sourcing, curation, filtering, and preprocessing across various data modalities.
  • Source, collect, and curate the best multimodal data from web scraping, third-party deals, and unconventional sources.
  • Own the challenge of structuring video and audio datasets for AI success.
  • Own the data labeling process and build automated workflows for cleaning, labeling, and structuring data efficiently.
  • Work closely with data annotation teams to ensure high-quality labeled data for ML models.
  • Unlock and use internal platform data to drive smarter decisions and supercharge growth.
  • Build tight, efficient, and lasting pipelines, datasets, and workflows.

What We're Looking For

  • Extreme ownership of data strategy end-to-end.
  • Strategic mindset to anticipate data needs and help shape AI development.
  • Automation expertise for data cleaning, structuring, and labeling workflows.
  • ML-first mindset to structure datasets that maximize AI model accuracy.
  • Ability to move fast while maintaining accuracy.
  • Ability to create best practices in new domains.
  • Strong experience with Python, SQL, and large-scale data processing tools.

Nice to Have

  • Previous work with LLMs and multimodal data.
  • Experience with in-house video data collection and relevant studio setups.
  • Knowledge of best practices for multimodal video and audio data collection.

Technical Stack

  • Python
  • SQL
  • Large-scale data processing tools

Benefits & Compensation

  • Flexible work schedule
  • Unlimited PTO
  • Extremely competitive healthcare
  • Gear stipends

Work Mode

This is a global position with a flexible work schedule.

Tavus is driven by people, with a diverse and supportive team where success is shared by all. We are inclusive to all, and diversity is at the core of our hiring, communication, and work.

Required Skills
PythonSQLlarge-scale data processingdata engineeringdata pipelinesdata modelingcloud data warehousesETL/ELTdata qualitydata governancecollaborationcommunicationproject management
Earn more as a remote developer

Performance pay that rewards your skills

Iglu's revenue-sharing model means top performers earn significantly more than traditional salaries. Choose your projects, deliver great work, and see it reflected in your pay.

Revenue-sharing compensation
Project choice & autonomy
International client base
Career growth support
Check compensation
Top earners exceed market rate
About company
Tavus

Tavus builds the human layer of AI, making human-AI interaction as natural as face-to-face interaction through pioneering research in multi-modal AI models for human perception and avatar rendering. Their models power text-to-video AI avatars and real-time conversational video experiences across industries like healthcare, recruiting, sales, and education.

Visit website
Job Details
Category data
Posted 8 months ago