Full-time

NVIDIA is hiring a Senior Distributed Storage Engineer - DGX Cloud

About the Role

NVIDIA is looking for a Senior Distributed Storage Engineer to join the DGX Cloud team. You will develop distributed storage services tailored for AI/ML applications, crafting a reliable, scalable, and efficient storage-as-a-service solution that can be deployed anywhere.

What You'll Do

  • Lead the overall architecture and design of our distributed storage service optimized for AI/ML.
  • Build features for a distributed storage service to enhance availability and reliability for large-scale deployments.
  • Engage and collaborate with NVIDIA Research, Computing, Product teams, cross-functional teams, and external customers to deliver Cloud services.
  • Automate distributed storage service end-to-end, including deployment, management, and monitoring.

What We're Looking For

  • Strong track record of delivering distributed services in a variety of distributed computing environments.
  • Experience designing, implementing, and operating distributed systems at a multi-petabyte scale.
  • Experience in implementing storage services and interfaces to ensure scalable, high-performance, and reliable solutions.
  • History of ownership of product delivery from inception to support.
  • Great communication and presentation skills.
  • Prior experience developing distributed systems with Kubernetes, Golang, Python, and Cloud Service Provider integrations.
  • Bachelor’s of Science in Computer Science, or related field (or equivalent experience) with 5+ years of industry experience.

Nice to Have

  • Architected, built, and deployed a distributed service that runs on large-scale clusters, multi-petabyte to exabyte in size, with millions of users.
  • Experience and own responsibility for all software development and delivery stages.
  • Passionate about innovating and investing in groundbreaking technologies and interested in working with accelerated Computing environments such as GPU Direct Storage, DPU, and RDMA.
  • Skilled in building and delivering cloud services, with a specific focus on distributed systems.

Technical Stack

  • Kubernetes
  • Golang
  • Python
  • Cloud Service Provider integrations

Benefits & Compensation

  • Equity and benefits.
  • Compensation: 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.

Required Skills
KubernetesGolangPythonCloud Service Provider integrationsDistributed Storage SystemsDistributed SystemsPerformance OptimizationSoftware EngineeringCloud InfrastructureNetworkingLinuxCI/CDAutomation
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
NVIDIA

NVIDIA is the platform upon which every new AI‑powered application is built.

Visit website
Job Details
Category infrastructure
Posted 8 months ago