NVIDIA is looking for a Senior Performance Compiler Engineer - Triton to advance the open-source Triton compiler project and improve AI performance on NVIDIA GPUs. In this role, you will enable breakthroughs in large language models, agents, and other high-impact AI applications, accelerating both training and inference.
What You'll Do
- Investigate the latest and future NVIDIA GPU hardware architecture and programming models.
- Work on the frontier of AI by understanding advanced algorithms and numerics to identify new opportunities for optimization.
- Design and implement compiler technology using MLIR to optimize high-level kernel descriptions written in Triton's Python DSL.
- Use inline PTX to hand-tune critical code paths and extract peak performance from the hardware when vital.
- Engage in a dynamic, iterative process of optimization to find the most efficient path to peak performance.
- Collaborate with teams across NVIDIA, including hardware architects and the CUDA compiler team, to influence future products and ensure maximum efficiency.
What We're Looking For
- Bachelor, Masters or Ph.D. degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or a related field.
- 6+ years of relevant industry experience in software development.
- Demonstrated strong C++ programming and software design skills, with an emphasis on performance analysis and debugging.
- Experienced in parallel programming, including CUDA/OpenCL GPU programming or other parallel models such as OpenMP.
- Solid understanding of computer architecture and hands-on experience with assembly-level programming.
Nice to Have
- Experience in tuning BLAS or deep learning library kernels.
- Background in numerics and linear algebra.
- Experience with machine learning compilers like TVM or MLIR.
- Contributions to open-source projects, especially in the AI/ML or compiler space.
- Familiarity with the latest research in AI algorithms and numerics.
Technical Stack
- C++
- MLIR
- CUDA
- OpenCL
- OpenMP
- PTX
- Triton Python DSL
Team & Environment
Collaborate with teams across NVIDIA, including hardware architects and the CUDA compiler team.
Benefits & Compensation
- Competitive salaries
- Generous benefits package
- Equity
- Compensation: $184,000 USD - $287,500 USD for Level 4, and $224,000 USD - $356,500 USD for Level 5.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
