Remote (Global)

Fluidstack is hiring an Infrastructure Engineer (Compute)

Fluidstack is hiring an Infrastructure Engineer (Compute) to design, deploy, and manage the compute infrastructure powering our GPU clusters. You will ensure the performance, scalability, and reliability of compute resources, working closely with hardware and software teams to support demanding AI workloads.

What You'll Do

  • Design and implement GPU/ASIC infrastructure at the server, rack, and system level.
  • Troubleshoot complex GPU and compute system related failures.
  • Develop and maintain hardware/firmware management services.
  • Automate all aspects of the server lifecycle.
  • Own end-to-end compute lifecycle, including partnering with vendors on RMAs.
  • Serve as the main point of contact for hardware escalation and troubleshooting.
  • Monitor system performance, identifying and resolving bottlenecks.
  • Automate deployment and management tasks to improve efficiency.
  • Collaborate with storage and network teams to ensure cohesive infrastructure operations.

What We're Looking For

  • 5+ years of experience in compute infrastructure engineering.
  • Strong knowledge of Linux systems administration and performance tuning.
  • Experience with bare metal provisioning tools (MaaS, Metal3, Tinkerbell, or other).
  • Familiarity with GPU hardware and workload optimization, especially kernel and driver level requirements.
  • Proficiency in automation tools (e.g., Ansible, Terraform).
  • Experience operating Kubernetes and SLURM clusters.

Technical Stack

  • Linux
  • MaaS
  • Metal3
  • Tinkerbell
  • Ansible
  • Terraform
  • Kubernetes
  • SLURM

Team & Environment

You will join a small, highly motivated team focused on providing a world-class supercomputing experience. We put customers first, hold ourselves and each other to high standards, and value effectiveness, competence, and a growth mindset.

Benefits & Compensation

  • Retirement or pension plan, in line with local norms.
  • Health, dental, and vision insurance.
  • Generous PTO policy, in line with local norms.
  • Access to WeWork for remote employees.

Work Mode

This role is remote, with access to key hub offices as needed.

Fluidstack is an equal opportunity employer.

Required Skills
LinuxMaaSMetal3TinkerbellAnsibleTerraformKubernetesSLURMBare Metal ProvisioningInfrastructure as CodeAutomationHigh-Performance ComputingNetworkingScriptingCloud Infrastructure LinuxMaaSMetal3TinkerbellAnsibleTerraformKubernetesSLURMBare Metal ProvisioningInfrastructure as CodeAutomationHigh-Performance ComputingNetworkingScriptingCloud Infrastructure
Looking for a remote dev community?

200+ professionals, 37 countries, one network

Working remotely doesn't mean working alone. Iglu connects you with developers, designers, and digital experts worldwide. Collaborate, learn, and grow together.

Global professional network
Knowledge sharing & collaboration
Regular community events
Cross-project opportunities
Join the community
37 countries represented
About company
Fluidstack
We’re building the infrastructure for abundant intelligence. We partner with top AI labs, governments, and enterprises to unlock compute at the speed of light.
All jobs at Fluidstack Visit website
Job Details
Category infrastructure
Posted 9 months ago