Full-time

Nvidia is hiring a Senior Performance Compiler Engineer - Triton

About the Role

NVIDIA is looking for a Senior Performance Compiler Engineer - Triton to advance the open-source Triton compiler project and improve AI performance on NVIDIA GPUs. In this role, you will enable breakthroughs in large language models, agents, and other high-impact AI applications, accelerating both training and inference.

What You'll Do

  • Investigate the latest and future NVIDIA GPU hardware architecture and programming models.
  • Work on the frontier of AI by understanding advanced algorithms and numerics to identify new opportunities for optimization.
  • Design and implement compiler technology using MLIR to optimize high-level kernel descriptions written in Triton's Python DSL.
  • Use inline PTX to hand-tune critical code paths and extract peak performance from the hardware when vital.
  • Engage in a dynamic, iterative process of optimization to find the most efficient path to peak performance.
  • Collaborate with teams across NVIDIA, including hardware architects and the CUDA compiler team, to influence future products and ensure maximum efficiency.

What We're Looking For

  • Bachelor, Masters or Ph.D. degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or a related field.
  • 6+ years of relevant industry experience in software development.
  • Demonstrated strong C++ programming and software design skills, with an emphasis on performance analysis and debugging.
  • Experienced in parallel programming, including CUDA/OpenCL GPU programming or other parallel models such as OpenMP.
  • Solid understanding of computer architecture and hands-on experience with assembly-level programming.

Nice to Have

  • Experience in tuning BLAS or deep learning library kernels.
  • Background in numerics and linear algebra.
  • Experience with machine learning compilers like TVM or MLIR.
  • Contributions to open-source projects, especially in the AI/ML or compiler space.
  • Familiarity with the latest research in AI algorithms and numerics.

Technical Stack

  • C++
  • MLIR
  • CUDA
  • OpenCL
  • OpenMP
  • PTX
  • Triton Python DSL

Team & Environment

Collaborate with teams across NVIDIA, including hardware architects and the CUDA compiler team.

Benefits & Compensation

  • Competitive salaries
  • Generous benefits package
  • Equity
  • Compensation: $184,000 USD - $287,500 USD for Level 4, and $224,000 USD - $356,500 USD for Level 5.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required Skills
C++MLIRCUDATritonPTXOpenCLOpenMPCompiler DesignPerformance OptimizationParallel ProgrammingLLVMDeep Learning CompilersPython
Starting a business in Thailand?

Company registration done right

Foreign ownership rules, licenses, tax registration — Thai business setup has many moving parts. SVBL guides you through every step with full legal compliance.

Company registration & structure
Foreign ownership solutions
License & tax registration
BOI promotion eligibility
Start your business
100% foreign ownership possible
About company
Nvidia

NVIDIA's invention of the GPU sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing for science and engineering. Today, the company is known as 'the AI computing company,' with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.

Visit website
Job Details
Category data
Posted 7 months ago