NVIDIA is looking for a Senior Software Engineer to develop groundbreaking technologies in the inference systems software stack. You will build innovative AI systems software to accelerate AI inference and develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture.
What You'll Do
- Innovate and develop new AI systems technologies for efficient inference
- Design, implement, and optimize kernels for high impact AI workloads
- Design and implement extensible abstractions for LLM serving engines
- Build efficient just-in-time domain specific compilers and runtimes
- Collaborate closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
- Contribute to open source communities like FlashInfer, vLLM, and SGLang
What We're Looking For
- Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience)
- 6+ years (academic/industry) experience with ML/DL systems development preferable
- Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC
- Strong Python and C/C++ programming skills
- Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)
Nice to Have
- PhD are preferred
- Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
- Expertise in inference engines like vLLM and SGLang
- Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
- Open source project ownership or contributions
Technical Stack
- PyTorch, JAX, TensorFlow, ONNX, vLLM, SGLang, MLC, Python, C/C++, CUDA C/C++, cuTile, Triton, FlashInfer, Flash Attention, Apache TVM, MLIR
Team & Environment
Collaborate closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams.
Benefits & Compensation
- Compensation: $184,000 USD - $287,500 USD + equity: Eligible
- Equity
- Benefits
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.





