Inworld AI is looking for a Senior AI Engineer to build the engine for the next generation of AI-driven software. Voice is a key interface for human-AI interaction, and you will focus on pushing the boundaries of speech modeling.
What You'll Do
- Research, build, optimize, and deploy production ML systems for speech modeling (STT & TTS).
- Solve challenges related to audio data collection, efficient training infrastructure, RL alignment environments, and ultra-low latency inference optimizations.
- Work on the difficult research and engineering problems of building the engine for the next generation of AI-driven software.
What We're Looking For
- BA/BS, MS, or PhD in a technical field (CS, Math, Physics) with a strong foundation in Machine Learning.
- 3+ years of combined experience in software development (e.g., with Python or C++) and applied ML engineering.
- Demonstrated experience applying or researching Machine Learning in Speech/Video processing, NLP, or Action planning.
- Strong foundation in data structures, algorithms, and neural network architectures.
- Proficiency with ML frameworks such as PyTorch.
- Power user of AI agents for work automation.
Nice to Have
- A passion for learning and staying up-to-date with the latest advancements in ML/Voice AI research and its applications.
- Ability to work collaboratively in a fast-paced environment with shifting priorities.
- Familiarity with pre-training, fine-tuning, RLHF and evaluation of large language and speech models.
- Knowledge of working with embedded systems and/or running ML on edge devices.
- Strong background in mathematics and/or physics.
Technical Stack
- Python
- C++
- PyTorch
Work Mode
This position is local-country, open to candidates based in the United Kingdom.




