Santa Clara, California, United States Hybrid USD 190,000 – 280,000 / year

d-Matrix is hiring a Principal Architect, Performance Analysis and Modeling

Responsibilities

  • Evaluate cutting-edge machine learning workloads, including multi-modal large language models, chain-of-thought reasoning systems, and video or audio generation models
  • Help define hardware and software capabilities that drive next-generation inference acceleration solutions for data center environments
  • Stay current with advancements in machine learning architectures and algorithmic research
  • Work closely with cross-functional teams such as Product, Hardware Design, Compiler, Inference Server, and Kernels
  • Study characteristics of emerging ML algorithms and workloads to assess functional and performance impacts
  • Develop analytical models to forecast performance on existing and upcoming hardware platforms
  • Suggest innovative hardware and software features to support or enhance algorithm execution
Required Skills
Machine LearningC++Python
About company
d-Matrix
d-Matrix is focused on unleashing the potential of generative AI to power the transformation of technology. They are at the forefront of software and hardware innovation, pushing the boundaries of what is possible.
All jobs at d-Matrix Visit website
Job Details
Department Research and Development (R&D)
Category other
Posted 3 months ago