We are seeking a Senior AI Research Engineer with expertise in developing and optimizing multimodal AI models. The role will be central to advancing Axelera AI’s platform capabilities in inference for Generative AI, working on state-of-the-art models that integrate multiple data modalities (e.g., text, vision, and audio) for a broad range of applications.
What You'll Do
- Design, develop, and optimize multimodal AI models for real-time, high-efficiency inference across a variety of deployment environments (edge, server-side, and embodied AI).
- Work closely with cross-functional teams, including AI researchers, hardware engineers, and software engineers to integrate AI models into the broader platform.
- Focus on optimizing models for memory efficiency, low-latency inference, and high throughput.
- Stay up-to-date with the latest research in multimodal AI, proposing and implementing new techniques to push the boundaries of what's possible in generative AI.
- Implement best practices for model testing, deployment, and continuous improvement to ensure models scale effectively in production environments.
What We're Looking For
- Proven experience (for all levels) in developing and deploying multimodal models, including text, image, and/or audio data.
- Strong background in deep learning frameworks (e.g., TensorFlow, PyTorch, JAX).
- Proficiency in natural language processing (NLP), computer vision (CV), and speech processing techniques.
- Experience with model optimization techniques (e.g., quantization, pruning, distillation).
- Familiarity with distributed computing, in-memory computing platforms, or high-performance computing.
- A strong understanding of the latest advancements in AI/ML research, particularly in generative models (e.g. transformers and diffusion models).
- Ability to work in a highly collaborative, fast-paced startup environment and communicate complex technical concepts clearly.
Nice to Have
- PhD or advanced degree in Computer Science, Machine Learning, AI, or related fields.
- 5+ years of post-graduation relevant work experience.
- Experience in deploying models on edge devices or in-memory computing systems.
- Familiarity with model deployment frameworks like TensorRT, ONNX, or similar.
- A passion for solving real-world challenges with AI in dynamic, high-performance environments.
Technical Stack
TensorFlow, PyTorch, JAX, TensorRT, ONNX
Team & Environment
Join a world-class team of 220+ employees, including 49+ PhDs, working remotely from 18 different countries, with offices in Belgium, France, Switzerland, Italy, and the UK, and headquartered in Eindhoven, Netherlands.
Benefits & Compensation
- Relocation support to Leuven, Bologna, Florence or Milan for talent based abroad
- Work on groundbreaking technology that will power the next wave of AI applications, from edge computing to embodied AI systems
- Join a diverse, driven team that values innovation, collaboration, and continuous learning
- Significant growth opportunities, including the chance to shape the direction of the product and AI strategy
- Competitive salary
- Equity options
- Benefits package
Work Mode
Hybrid and remote options available in Italy; hybrid or on-site in Belgium; relocation supported. Locations include Leuven, Bologna, Florence, and Milan.
At Axelera AI, we wholeheartedly embrace equal opportunity and hold diversity in the highest regard. Our steadfast commitment is to cultivate a warm and inclusive environment that empowers and celebrates every member of our team. We welcome applicants from all backgrounds to join us in shaping the future of AI.






