NVIDIA is hiring a Senior Software Engineer, DevOps - Server Infrastructure

About the Role

Design, implement, and maintain infrastructure systems that power high-performance computing environments using modern DevOps methodologies and automation tools.

Responsibilities

  • Develop and manage scalable server infrastructure for high-availability systems
  • Implement continuous integration and continuous deployment pipelines
  • Automate infrastructure provisioning and configuration management
  • Monitor system performance and ensure reliability across environments
  • Troubleshoot complex infrastructure issues across distributed systems
  • Collaborate with software engineering teams to optimize deployment workflows
  • Maintain security and compliance standards across infrastructure platforms
  • Design fault-tolerant systems to support mission-critical operations
  • Optimize resource utilization in cloud and on-premises environments
  • Support disaster recovery and business continuity planning
  • Integrate monitoring and alerting solutions for proactive issue resolution
  • Manage containerized environments and orchestration platforms
  • Ensure infrastructure aligns with software development lifecycle requirements
  • Document architecture decisions and operational procedures
  • Evaluate and deploy new technologies to improve system efficiency
  • Participate in on-call rotations for critical system support
  • Contribute to capacity planning and scalability assessments
  • Work closely with security teams to enforce best practices
  • Improve deployment velocity while maintaining system stability
  • Drive automation initiatives to reduce manual intervention

Nice to Have

  • Master’s degree in computer science or related field
  • Experience supporting GPU-accelerated computing environments
  • Background in high-performance computing or data center operations
  • Contributions to open-source infrastructure projects
  • Familiarity with service mesh technologies
  • Knowledge of infrastructure security compliance frameworks
  • Experience with large-scale distributed systems
  • Exposure to machine learning or AI workloads
  • Certifications in cloud or DevOps platforms

Compensation

Competitive salary and comprehensive benefits package

Work Arrangement

Hybrid work model with flexibility for remote and on-site collaboration

Team

Part of a high-performance engineering team focused on scalable infrastructure systems

Why Join Us

  • Opportunity to work on cutting-edge infrastructure supporting advanced computing technologies
  • Collaborative environment with access to industry-leading hardware and software platforms

What We Offer

  • Comprehensive health and wellness benefits
  • Professional development and career growth opportunities
  • Innovative projects with global impact

Available for qualified candidates requiring work authorization

Required Skills
PythonShell ScriptingAnsibleTerraformHelm TemplateDockerDocker ComposeElasticsearchLogstashKibanaLinux System AdministrationCI/CDCloud InfrastructureNetworkingMonitoring PythonShell ScriptingAnsibleTerraformHelm TemplateDockerDocker ComposeElasticsearchLogstashKibanaLinux System AdministrationCI/CDCloud InfrastructureNetworkingMonitoring
About company
NVIDIA
NVIDIA builds accelerated computing platforms and AI technologies that power advancements in areas such as generative AI, data centers, robotics, and digital twins.
All jobs at NVIDIA Visit website
Job Details
Category infrastructure
Posted 7 months ago