Responsibilities
- Evaluate cutting-edge machine learning workloads, including multi-modal large language models, chain-of-thought reasoning systems, and video or audio generation models
- Help define hardware and software capabilities that drive next-generation inference acceleration solutions for data center environments
- Stay current with advancements in machine learning architectures and algorithmic research
- Work closely with cross-functional teams such as Product, Hardware Design, Compiler, Inference Server, and Kernels
- Study characteristics of emerging ML algorithms and workloads to assess functional and performance impacts
- Develop analytical models to forecast performance on existing and upcoming hardware platforms
- Suggest innovative hardware and software features to support or enhance algorithm execution
