Hybrid Full-time

Dell Technologies is hiring a Senior GenAI & High Performance Computing (HPC) Delivery Engineer

About the Role

Dell Technologies is seeking a Senior GenAI & High Performance Computing (HPC) Delivery Engineer to be responsible for deploying, configuring, and validating GPU-accelerated compute clusters for AI, machine learning, and HPC. This is a highly hands-on, customer-facing role involving significant travel for onsite deployments across the U.S.

What You'll Do

  • Deploy, configure, and validate GPU accelerated compute clusters for AI, ML, and HPC using NVIDIA Base Command Manager.
  • Perform benchmarking with tools like HPL GPU, HPL MxP, STREAM, NCCL, RCCL, and OSU Microbenchmarks.
  • Produce as-built documentation, performance reports, and share best practices.
  • Configure and secure Linux distributions including RHEL, Ubuntu, and Rocky for GenAI or HPC workloads.
  • Work directly with customers onsite, traveling regionally and across the U.S.

What We're Looking For

  • 7+ years of experience with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields.
  • Deep hands-on experience with GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager.
  • Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks.
  • Red Hat certification (RHCSA/RHCE) or 7+ years of relevant experience with Red Hat distributions.
  • Experience with GenAI/HPC networking (InfiniBand and/or RoCE).
  • Experience working in Linux based parallel computing environments at scale.
  • Experience with containers and orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm).
  • Ability to travel up to 70% of the time across the U.S. as needed for projects.
  • Strong customer facing and communication skills.

Nice to Have

  • Bachelor’s degree.
  • NVIDIA certifications (NCA, NCE, DGX).
  • Experience with NVIDIA UFM, Infiniband, and SpectrumX fabrics.
  • Exposure to hybrid cloud or GPU cloud environments.
  • Experience with GPU observability and performance profiling tools.

Technical Stack

  • Cluster Management: NVIDIA Base Command Manager, Warewulf, OpenHPC
  • Operating Systems: RHEL, Ubuntu, Rocky
  • Containers & Orchestration: Docker, Singularity/Apptainer, Kubernetes, Slurm
  • Networking: InfiniBand, RoCE, NVIDIA UFM, SpectrumX fabrics

Team & Environment

You will be part of the Service Delivery Team.

Benefits & Compensation

  • Compensation range: $153,850 to $199,100.
  • Health and wellness benefits detailed at MyWellatDell.com.

Work Mode

This role operates on a hybrid basis. Primary locations are Austin, Texas, and Remote United States.

Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.

Required Skills
NVIDIA Base Command ManagerWarewulfOpenHPCSlurmKubernetesDockerSingularity/ApptainerRHELUbuntuRocky LinuxHigh Performance ComputingGenAI InfrastructureLinux Systems AdministrationAutomation Scripting
Planning long-term in Thailand?

Full relocation support, start to finish

From visa strategy to housing, banking, and schools for your family — SVBL plans and manages every detail of your move to Thailand so nothing falls through the cracks.

Complete relocation planning
Family visa & school enrollment
Banking & insurance setup
Cultural integration support
Plan your move
One partner for everything
About company
Dell Technologies

Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play. They have delivered HPC solutions for 25+ years and are NVIDIA’s preferred partner for GenAI Factory systems.

Visit website
Job Details
Category infrastructure
Posted 18 days ago