The ML Runtime Optimization Engineer at Applied Intuition will focus on optimizing machine learning models and deploying them on production-grade embedded systems. This role involves working across the full ML framework stack to enhance performance for ADAS/AD applications on various embedded compute platforms.
What You'll Do
- Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms
- Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
- Work on model pruning and quantization, and support deployment on memory constrained platforms
- Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions
- Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration
What We're Looking For
- Bachelors in Electrical Engineering or Computer Science, OR B.Sc. in Computer Science, Mathematics, Physics or a related field
- 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture
- Strong software development skills with the focus on embedded programming
- Experience profiling and optimizing model performance on embedded compute platforms
- Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)
Nice to Have
- M.Sc or PhD in a ML related area
- Built an ML optimization framework from scratch before
- Deployed ML solutions to embedded chips for real time robotics applications
Technical Stack
- PyTorch
- JAX
- ONNX
- TensorRT
- CUDA
- XLA
- Triton
Benefits & Compensation
- Comprehensive health, dental, vision, life and disability insurance coverage
- 401k retirement benefits with employer match
- Learning and wellness stipends
- Paid time off
Compensation: $159,053 - $199,295 USD annually. Equity in the form of options and/or restricted stock units. Base salary is a single component of the total compensation package.
Work Mode
Employees primarily work from the office 5 days a week, but occasional remote work is allowed. Flexibility includes starting the day with morning meetings from home before heading to the office or leaving earlier when needed to accommodate family commitments. Office locations include Sunnyvale, California; Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo.
Applied Intuition is an equal opportunity employer and federal contractor or subcontractor. The company complies with regulations prohibiting discrimination based on race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability. It requires affirmative action to employ and advance individuals without regard to these factors.
