Full-time

NVIDIA is hiring a Lead Senior Software Engineer, Agentic AI Applications

About the Role

NVIDIA is hiring a Lead Senior Software Engineer to serve as a Tech Lead, driving the design and delivery of agentic AI blueprints and reference workflows. You will craft industry-leading examples, such as the AI-Q Deep Researcher blueprint, that show enterprises how to implement agentic AI at scale.

What You'll Do

  • Design, develop, and implement agentic AI blueprints that demonstrate how enterprises can utilize and deploy this technology.
  • Lead technical reviews and provide mentorship, guiding the engineering team in building production-grade workflows and extending core GenAI SDK capabilities.
  • Develop proof-of-concept workflows rooted in first principles that apply modern data science techniques to GenAI use cases.
  • Collaborate cross-functionally with product, research, and infrastructure teams to evolve NVIDIA's agentic ecosystem, including integrations between the NeMo Agent Toolkit and other NVIDIA products.
  • Drive performance optimization for agentic applications across the data center, focusing on improving accuracy, reducing latency, and growing efficiency.
  • Establish engineering standards and best practices for developing, testing, and deploying agentic AI applications across distributed environments.

What We're Looking For

  • BS in Computer Engineering, Computer Science, Data Science, or a related field, or equivalent experience.
  • 8+ years of software engineering experience, including 2+ years as a tech lead.
  • Proficient in Python, with at least 6+ years of experience building Python libraries or applications for enterprise customers.
  • Experience with GenAI application development using LLM frameworks (e.g., Langchain, Llamaindex, or AutoGen), evaluation systems (e.g., RAGAs), and observability platforms (e.g., Arize Phoenix, W&B Weave, or LangSmith).
  • Experience using and understanding of agentic frameworks.
  • Proficient in distributed orchestration and communication frameworks (e.g., Kafka, Ray).
  • Ability to quickly learn and apply new technologies and libraries.
  • Self-starter with a proactive work ethic, capable of working independently and successfully within a distributed team.
  • Excellent communication and collaboration skills across distributed, cross-functional teams.

Nice to Have

  • MS or PhD preferred.
  • Demonstrated leadership in building and scaling agentic AI applications in production.
  • Experience developing your own agents in Python or a similar language (e.g., Go).
  • Concrete examples/code of how you have profiled code to identify performance bottlenecks and examples of how you mitigated these.
  • Experience developing for GPU platforms and familiarity with NVIDIA technologies (e.g., CUDA, TensorRT, Triton, NeMo) and LLM serving frameworks (e.g., Dynamo, vLLM, SGLang).
  • Experience with RAG systems and communication protocols (e.g., MCP, A2A).

Technical Stack

  • Primary Language: Python
  • LLM Frameworks: Langchain, Llamaindex, AutoGen
  • Evaluation Systems: RAGAs
  • Observability Platforms: Arize Phoenix, W&B Weave, LangSmith
  • Distributed Orchestration: Kafka, Ray
  • NVIDIA Technologies: CUDA, TensorRT, Triton, NeMo
  • LLM Serving Frameworks: Dynamo, vLLM, SGLang
  • Specialized Systems: RAG systems
  • Communication Protocols: MCP, A2A

Benefits & Compensation

  • Compensation: $184,000 - $287,500 USD for Level 4, and $224,000 - $356,500 USD for Level 5.
  • Eligible for equity.
  • Generous benefits package.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Required Skills
PythonLLM Frameworks (Langchain, Llamaindex, AutoGen)LLM Evaluation (RAGAs)Observability (Arize Phoenix, W&B Weave, LangSmith)Distributed Orchestration (Kafka, Ray)NVIDIA CUDANVIDIA TensorRT/Triton/NeMoLLM Serving (Dynamo, vLLM, SGLang)RAG SystemsCommunication Protocols (MCP, A2A)Agentic AILarge Language ModelsMachine LearningSoftware ArchitectureSystem Design
Planning long-term in Thailand?

Full relocation support, start to finish

From visa strategy to housing, banking, and schools for your family — SVBL plans and manages every detail of your move to Thailand so nothing falls through the cracks.

Complete relocation planning
Family visa & school enrollment
Banking & insurance setup
Cultural integration support
Plan your move
One partner for everything
About company
NVIDIA

NVIDIA is the platform upon which every new AI‑powered application is built.

Visit website
Job Details
Category data
Posted 5 months ago