About the Role

Design, implement, and maintain infrastructure systems that power high-performance computing environments using modern DevOps methodologies and automation tools.

Responsibilities

Develop and manage scalable server infrastructure for high-availability systems
Implement continuous integration and continuous deployment pipelines
Automate infrastructure provisioning and configuration management
Monitor system performance and ensure reliability across environments
Troubleshoot complex infrastructure issues across distributed systems
Collaborate with software engineering teams to optimize deployment workflows
Maintain security and compliance standards across infrastructure platforms
Design fault-tolerant systems to support mission-critical operations
Optimize resource utilization in cloud and on-premises environments
Support disaster recovery and business continuity planning
Integrate monitoring and alerting solutions for proactive issue resolution
Manage containerized environments and orchestration platforms
Ensure infrastructure aligns with software development lifecycle requirements
Document architecture decisions and operational procedures
Evaluate and deploy new technologies to improve system efficiency
Participate in on-call rotations for critical system support
Contribute to capacity planning and scalability assessments
Work closely with security teams to enforce best practices
Improve deployment velocity while maintaining system stability
Drive automation initiatives to reduce manual intervention

Nice to Have

Master’s degree in computer science or related field
Experience supporting GPU-accelerated computing environments
Background in high-performance computing or data center operations
Contributions to open-source infrastructure projects
Familiarity with service mesh technologies
Knowledge of infrastructure security compliance frameworks
Experience with large-scale distributed systems
Exposure to machine learning or AI workloads
Certifications in cloud or DevOps platforms

Compensation

Competitive salary and comprehensive benefits package

Work Arrangement

Hybrid work model with flexibility for remote and on-site collaboration

Team

Part of a high-performance engineering team focused on scalable infrastructure systems

Why Join Us

Opportunity to work on cutting-edge infrastructure supporting advanced computing technologies
Collaborative environment with access to industry-leading hardware and software platforms

What We Offer

Comprehensive health and wellness benefits
Professional development and career growth opportunities
Innovative projects with global impact

Available for qualified candidates requiring work authorization

NVIDIA is hiring a Senior Software Engineer, DevOps - Server Infrastructure

About the Role

Responsibilities

Nice to Have

Compensation

Work Arrangement

Team

Why Join Us

What We Offer

Similar Jobs

Senior Engineer - Cloud Platforms

Software Engineer / DevOps

Containerization Cloud Consulting

Hardware Enablement Engineer (Linux)

Senior DevOps Engineer (m/w/d)

DevOps Engineer III

Related Articles

Network Configuration as Code: CI/CD for Automation | NVIDIA

Become an AI Developer: Your Career Guide

CI/CD Testing Tools: 23 Best Options for 2026