Hybrid Full-time

The San Francisco Compute Company is hiring a Principal Network Engineer

About the Role

The San Francisco Compute Company is hiring a Principal Network Engineer to design, provision, and manage the high-performance networks that underpin and interconnect our large-scale GPU clusters globally. You will build and automate these critical systems from the ground up.

What You'll Do

  • Design, provision, and manage networks for large-scale GPU clusters globally.
  • Stand up a 400GbE spine-leaf network from scratch, implementing eBGP.
  • Automate network configuration to deploy it across a 20k node cluster.
  • Work on network architecture design, automated fabric provisioning, and validation.
  • Handle performance monitoring and optimization.
  • Work on high performance distributed storage system interconnects.
  • Work on globally distributed cross-datacenter interconnects.

What We're Looking For

  • Experience with HPC and GPU network technologies including RoCEv2, InfiniBand, eBGP, EVPN/VXLAN, QoS, and ACLs.
  • Experience architecting resilient high performance networks including fat-tree and multi-layer spine-leaf topologies, MLAG/LACP.
  • Prior experience with network automation using Ansible, Bash, Python.
  • Comfortable operating and have opinions about Arista, Cisco, Dell, Juniper, OCP, SONiC.
  • Comfortable configuring NGFW's.
  • Appreciate and value good documentation.

Nice to Have

  • Familiarity with and/or involvement with the Ultra Ethernet consortium.
  • Experience with InfiniBand partitioning.
  • Prior experience with GPU clusters (NVIDIA A100 or newer).
  • Experience with high performance storage systems (WEKA, VAST, Ceph, etc.).

Technical Stack

  • Protocols: RoCEv2, InfiniBand, eBGP, EVPN/VXLAN, QoS, ACLs
  • Vendors & Platforms: Arista, Cisco, Dell, Juniper, OCP, SONiC
  • Automation: Ansible, Bash, Python

Benefits & Compensation

  • Generous equity grant
  • Competitive salary
  • Visa Sponsorships
  • 401(k) matching up to 4%
  • Medical, dental & vision insurance fully covered for employees and dependents
  • Unlimited paid time off
  • 10+ observed holidays
  • Paid parental leave for biological, adoptive, and foster parents
  • Daily lunch covered
  • Unlimited office book budget

Work Mode

This is a hybrid position based in San Francisco, CA.

The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment. We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law. We welcome the opportunity to consider qualified applicants with prior arrest or conviction records.

Required Skills
RoCEv2InfiniBandeBGPEVPN/VXLANQoSACLsAristaCiscoDellJuniperNetwork EngineeringData Center NetworkingNetwork ArchitectureNetwork Security
Your first international client?

Don't lose them over invoicing

Clients ghost freelancers with unprofessional invoicing. Glopay gives you a real EU company partnership so they take you seriously from invoice #1.

Instant EU company partnership
Invoice builder with your branding
Automated payment reminders
Real-time payment tracking
Get EU company now
Ready in 24 hours
About company
The San Francisco Compute Company

The company aims to make compute a tradable commodity by building a venue where compute contracts are traded in real-time, bringing traders into the supply chain. The goal is to enable buyers to get good prices for any order size and sellers to instantly book out their clusters.

Visit website
Job Details
Category infrastructure
Posted 7 months ago