About the Role

The role involves conducting foundational research in voice AI, designing novel models for speech processing, and contributing to the scientific and technical advancement of core platform capabilities.

Responsibilities

Develop and refine machine learning models for speech recognition
Conduct experiments to evaluate model accuracy and efficiency
Publish findings in peer-reviewed journals and conferences
Collaborate with engineering teams to integrate research into production systems
Analyze large-scale audio datasets to identify patterns and insights
Improve training pipelines for deep neural networks
Explore new architectures for end-to-end speech processing
Optimize inference speed and computational resource usage
Investigate domain adaptation techniques for diverse speaker populations
Enhance robustness of models under noisy conditions
Contribute to open-source projects related to speech technology
Stay current with advancements in natural language and audio processing
Design evaluation frameworks for model comparison
Participate in technical planning and roadmap discussions
Mentor junior researchers and interns
Ensure ethical use of data and model transparency
Support patent disclosures for novel inventions
Present work internally and at external events
Troubleshoot model failures and diagnose edge cases
Collaborate across disciplines to solve complex problems

Nice to Have

Postdoctoral research experience
Industry research lab experience
Contributions to major speech recognition frameworks
Experience with low-resource languages
Work on speaker diarization or voice activity detection
Knowledge of self-supervised learning methods
Experience deploying models in production
Background in multilingual speech processing
Familiarity with edge-device constraints
Involvement in benchmarking initiatives

Compensation

Competitive salary with performance-based incentives

Work Arrangement

Hybrid work model with flexible scheduling options

Team

Collaborative research environment within a specialized AI engineering unit

Research Focus

Primary emphasis on fundamental improvements in speech representation and modeling
Exploration of transformer-based and recurrent architectures for audio
Investigation into unsupervised and semi-supervised learning for voice data

Impact Goals

Deliver research that directly enhances product capabilities
Advance state-of-the-art in accuracy, latency, and scalability
Enable new use cases through foundational breakthroughs

Available for qualified candidates

Deepgram is hiring a Research Staff, Voice AI Foundations