Remote, California, United States Remote (Global) USD 150,000 – 250,000 / year

Deepgram is hiring a Research Staff, Voice AI Foundations

About the Role

The role involves conducting foundational research in voice AI, designing novel models for speech processing, and contributing to the scientific and technical advancement of core platform capabilities.

Responsibilities

  • Develop and refine machine learning models for speech recognition
  • Conduct experiments to evaluate model accuracy and efficiency
  • Publish findings in peer-reviewed journals and conferences
  • Collaborate with engineering teams to integrate research into production systems
  • Analyze large-scale audio datasets to identify patterns and insights
  • Improve training pipelines for deep neural networks
  • Explore new architectures for end-to-end speech processing
  • Optimize inference speed and computational resource usage
  • Investigate domain adaptation techniques for diverse speaker populations
  • Enhance robustness of models under noisy conditions
  • Contribute to open-source projects related to speech technology
  • Stay current with advancements in natural language and audio processing
  • Design evaluation frameworks for model comparison
  • Participate in technical planning and roadmap discussions
  • Mentor junior researchers and interns
  • Ensure ethical use of data and model transparency
  • Support patent disclosures for novel inventions
  • Present work internally and at external events
  • Troubleshoot model failures and diagnose edge cases
  • Collaborate across disciplines to solve complex problems

Nice to Have

  • Postdoctoral research experience
  • Industry research lab experience
  • Contributions to major speech recognition frameworks
  • Experience with low-resource languages
  • Work on speaker diarization or voice activity detection
  • Knowledge of self-supervised learning methods
  • Experience deploying models in production
  • Background in multilingual speech processing
  • Familiarity with edge-device constraints
  • Involvement in benchmarking initiatives

Compensation

Competitive salary with performance-based incentives

Work Arrangement

Hybrid work model with flexible scheduling options

Team

Collaborative research environment within a specialized AI engineering unit

Research Focus

  • Primary emphasis on fundamental improvements in speech representation and modeling
  • Exploration of transformer-based and recurrent architectures for audio
  • Investigation into unsupervised and semi-supervised learning for voice data

Impact Goals

  • Deliver research that directly enhances product capabilities
  • Advance state-of-the-art in accuracy, latency, and scalability
  • Enable new use cases through foundational breakthroughs

Available for qualified candidates

About company
Deepgram
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale. More than 200,000 developers and 1,300+ organizations build voice offerings that are ‘Powered by Deepgram’.
All jobs at Deepgram Visit website
Job Details
Department Research
Category other
Posted 2 months ago