Hybrid Full-time

Tavus is hiring an AI Researcher (Voice)

About the Role

Tavus is looking for an AI Researcher (Voice) to lead our research efforts on generative video and audio models for lifelike, expressive avatars in real-time applications. You will be a key member of our Core AI team, helping to build the human layer of AI to make human-AI interaction as natural as face-to-face interaction.

What You'll Do

  • Lead research efforts on generative video and audio models, such as text-to-speech, speech-to-speech, and audio-to-expression.
  • Work with the Applied ML team to help productionize research.
  • Stay relevant with the latest advancements in AI and help create them.

What We're Looking For

  • Proven experience with flow matching, diffusion models, and auto regressive networks in the audio domain.
  • Experience training deep learning models from medium-sized to large models.
  • Experience building streaming text-to-speech models or speech-to-speech models.
  • Strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping.
  • Knowledge of state-of-the-art architectures in representation learning for audio or image domains, including face animation.
  • Excellent programming skills and fluency in PyTorch.
  • Evidence of original research, with publications in top-tier or solid second-tier venues such as CVPR, NeurIPS, or BMVC.
  • Excitement about building lifelike, expressive avatars for real-time applications.

Nice to Have

  • Skills in 3D graphics and Gaussian splatting.
  • Additional experience with generative models.
  • PhD or equivalent research experience.
  • Experience leading research teams.
  • Knowledge of best practices in software development.

Technical Stack

  • PyTorch

Team & Environment

You will be part of the Core AI team at Tavus, a group dedicated to advancing the state of generative media. Our work is driven by our team, and our success is shared by all.

Benefits & Compensation

  • Flexible work schedule
  • Unlimited PTO
  • Extremely competitive healthcare plans
  • Gear stipends

Work Mode

This is a hybrid position based in San Francisco.

Tavus is an equal opportunity employer. Diversity is at the core of how we hire, communicate, and work, and we are looking for culture creators.

Required Skills
PyTorchMachine LearningDeep LearningVoice SynthesisGenerative AIAudio ProcessingPythonResearchModel DevelopmentExperimentationData AnalysisNeural Networks
Visa expiring soon?

Extend or switch without leaving Thailand

Running out of time on your current visa? SVBL identifies your best option — extension, category switch, or long-term visa — and handles the entire process.

Visa extensions & category switches
LTR & DTV visa applications
90-day reporting managed
Overstay prevention
Check your options
Prevent overstay issues
About company
Tavus

Tavus builds the human layer of AI, making human-AI interaction as natural as face-to-face interaction through pioneering research in multi-modal AI models for human perception and avatar rendering. Their models power text-to-video AI avatars and real-time conversational video experiences across industries like healthcare, recruiting, sales, and education.

Visit website
Job Details
Category data
Posted 8 months ago