Full-time

Nvidia is hiring a Principal Artificial Intelligence Algorithms Engineer

About the Role

NVIDIA is looking for a Principal Artificial Intelligence Algorithms Engineer to join our core AI Frameworks team. You will design, develop, and optimize diverse real-world AI workloads, expanding the capabilities of the open-source Megatron Core and NeMo Framework. This role tackles large-scale, end-to-end AI training and inference challenges across the full model lifecycle.

What You'll Do

  • Develop advanced algorithms for AI, deep learning, data analytics, machine learning, or scientific computing.
  • Contribute to and advance the open-source Megatron Core and NeMo Framework.
  • Solve large-scale, end-to-end AI training and inference challenges spanning orchestration, data pre-processing, model training, tuning, and deployment.
  • Work at the intersection of computer architecture, libraries, frameworks, AI applications, and the entire software stack.
  • Innovate and improve model architectures, distributed training algorithms, and model parallel paradigms.
  • Conduct performance tuning, optimizations, and model training/finetuning with mixed precision recipes on next-gen NVIDIA GPU architectures.
  • Research, prototype, and develop robust and scalable AI tools and pipelines.

What We're Looking For

  • MS, PhD or equivalent experience in Computer Science, AI, Applied Math, or a related field.
  • 10+ years of relevant industry experience.
  • Experience with AI Frameworks (e.g., PyTorch, JAX) and/or inference and deployment environments (e.g., TRTLLM, vLLM, SGLang).
  • Proficient in Python programming, software design, debugging, performance analysis, test design, and documentation.
  • Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.
  • Strong understanding of AI/Deep-Learning fundamentals and their practical applications.

Nice to Have

  • Hands-on experience in large-scale AI training, with a deep understanding of core compute system concepts and demonstrated excellence in performance analysis and tuning.
  • Expertise in distributed computing, model parallelism, and mixed precision training.
  • Prior experience with Generative AI techniques applied to LLM and Multi-Modal learning (Text, Image, and Video).
  • Knowledge of GPU/CPU architecture and related numerical software.
  • Contributions to open source deep learning frameworks.

Technical Stack

  • PyTorch, JAX, TRTLLM, vLLM, SGLang
  • Python
  • NVIDIA GPU architectures

Team & Environment

You will join the Core AI Frameworks (Megatron Core and NeMo Framework) team, operating in a creative and autonomous environment with forward-thinking colleagues.

Benefits & Compensation

  • Compensation range: $272,000 USD - $425,500 USD + eligible equity.
  • Equity and comprehensive benefits package.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required Skills
PyTorchJAXTRTLLMvLLMSGLangPythonNVIDIA GPU ArchitecturesMachine LearningDeep LearningLarge Language ModelsAI AlgorithmsDistributed SystemsPerformance Optimization
Planning long-term in Thailand?

Full relocation support, start to finish

From visa strategy to housing, banking, and schools for your family — SVBL plans and manages every detail of your move to Thailand so nothing falls through the cracks.

Complete relocation planning
Family visa & school enrollment
Banking & insurance setup
Cultural integration support
Plan your move
One partner for everything
About company
Nvidia

NVIDIA's invention of the GPU sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing for science and engineering. Today, the company is known as 'the AI computing company,' with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.

Visit website
Job Details
Category data
Posted 7 months ago