Santa Clara or United States Employment USD 184,000 - 287,500 Yearly

NVIDIA is hiring a Senior Software Engineer

About the Role

NVIDIA is looking for a Senior Software Engineer to develop groundbreaking technologies in the inference systems software stack. You will build innovative AI systems software to accelerate AI inference and develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture.

What You'll Do

  • Innovate and develop new AI systems technologies for efficient inference
  • Design, implement, and optimize kernels for high impact AI workloads
  • Design and implement extensible abstractions for LLM serving engines
  • Build efficient just-in-time domain specific compilers and runtimes
  • Collaborate closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
  • Contribute to open source communities like FlashInfer, vLLM, and SGLang

What We're Looking For

  • Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience)
  • 6+ years (academic/industry) experience with ML/DL systems development preferable
  • Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC
  • Strong Python and C/C++ programming skills
  • Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)

Nice to Have

  • PhD are preferred
  • Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
  • Expertise in inference engines like vLLM and SGLang
  • Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
  • Open source project ownership or contributions

Technical Stack

  • PyTorch, JAX, TensorFlow, ONNX, vLLM, SGLang, MLC, Python, C/C++, CUDA C/C++, cuTile, Triton, FlashInfer, Flash Attention, Apache TVM, MLIR

Team & Environment

Collaborate closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams.

Benefits & Compensation

  • Compensation: $184,000 USD - $287,500 USD + equity: Eligible
  • Equity
  • Benefits

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required Skills
PyTorchJAXTensorFlowONNXvLLMSGLangMLCPythonC/C++CUDA C/C++GPU kernel developmentperformance optimizationscuTileTritonML/DL systems
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
NVIDIA

NVIDIA is the platform upon which every new AI‑powered application is built.

Visit website
Job Details
Department Software Development
Category data
Posted 14 days ago