Responsibilities
- Lead full-cycle technical implementations for GPU-based cloud and AI infrastructure clients, starting from bare metal setup to fully tested virtual cluster environments.
- Set up and resolve issues in bare metal GPU systems, including container networking, GPU operator deployment, distributed storage solutions, and high-speed interconnects like RDMA and InfiniBand.
- Install and verify Kubernetes and virtual cluster platforms to deliver managed Kubernetes clusters powered by GPU resources.
- Collaborate with client teams to transfer knowledge and enable independent operation and expansion of the infrastructure.
- Create detailed documentation of deployment patterns and system architectures to accelerate future implementations.
- Provide field insights to engineering and product teams, identifying common infrastructure issues and influencing product development priorities.
- Support sales efforts during pre-sales engagements that require deep technical expertise to demonstrate viable use cases.
Compensation
Not specified
Work Arrangement
Not specified
Team
Not specified
Not specified