Full-time

NVIDIA is hiring a Senior Generative AI Research Engineer

About the Role

NVIDIA is looking for a Senior Generative AI Research Engineer to design, train, and optimize foundation models for real-world applications. You will be a key contributor on our Cosmos generative AI engineering team, working to advance the state of AI and bring world models from research to deployment.

What You'll Do

  • Design, post-train, and optimize foundation models like LLMs, diffusion video models, VLMs, and VLAs for real-world use.
  • Contribute to the development of large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Work collaboratively with research, software, and product teams to transition world models from idea to deployment.
  • Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers.
  • Prototype and iterate rapidly on experiments across domains like agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to achieve production-ready performance.

What We're Looking For

  • A minimum of 8 years of industry experience, or 5+ years of research or postdoc experience building and deploying generative AI systems.
  • Proficiency in PyTorch, JAX, or other deep learning frameworks.
  • Expertise in one or more areas: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Intimate familiarity with all variants of transformer attention mechanisms.
  • Hands-on experience with large-scale training techniques (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing frameworks (e.g., Ray, Spark).
  • Production-quality software engineering skills in Python.
  • An MS, PhD, or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field.
  • 12+ years of relevant software development experience.

Nice to Have

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or publications at major conferences like NeurIPS, ICML, CVPR, or ICLR.
  • Experience working with multimodal data such as vision-language models, VLAs, or audio.
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Technical Stack

  • PyTorch, JAX, Python
  • ZeRO, DDP, FSDP, TP, CP
  • Ray, Spark

Team & Environment

You will join the Cosmos generative AI engineering team at NVIDIA, working closely with teams in research, software, and product. The environment is creative, passionate, and self-motivated, with forward-thinking and hardworking people.

Benefits & Compensation

  • Compensation ranges from $224,000 USD - $356,500 USD for Level 5, and $272,000 USD - $425,500 USD for Level 6.
  • Eligible for equity.
  • Comprehensive benefits package.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required Skills
PyTorchJAXPythonZeRODDPFSDPTPCPRaySparkGenerative AILarge Language ModelsDistributed TrainingMachine LearningResearch
Want to work from Thailand?

Join a remote network built for tech talent

Iglu gives you real employment in Southeast Asia — visa, work permit, and projects included. Pick what you work on, earn performance-based pay, and live where you want.

Legal employment in Thailand & Vietnam
Choose your own projects
Performance-based revenue sharing
Relocation support available
Join Iglu
200+ professionals worldwide
About company
NVIDIA

NVIDIA is the platform upon which every new AI‑powered application is built.

Visit website
Job Details
Category data
Posted 8 months ago