The Staff Storage Engineer will lead strategy, architecture, and organizational influence in designing and scaling high-performance storage systems for AI/ML workloads at Lambda. This role involves driving storage solution selection, optimizing for AI workloads, improving operations, and collaborating with leadership on technical requirements and deployment plans.
What You'll Do
- Lead the RFP process and drive evidence-based storage solution selection and vendor evaluations.
- Develop an in-depth understanding of AI/ML workload profiles to influence future storage architecture and performance tuning.
- Identify and lead high-impact operational improvements and cross-functional deployment plans.
- Partner with leadership during deal formation to gather technical requirements and inform solution design.
- Delegate complex engineering tasks and maintain consistent, proactive communication with the engineering leadership team.
What We're Looking For
- 8+ years of experience designing, building, and operating large-scale multi-petabyte storage production environments
- Familiarity with one or more storage solutions of the following vendors: Vast, Weka, DDN, NetApp, PureStorage, Dell, IBM, HPE
- Understanding of File, Block, and Object storage types
- Knowledge of Storage Network Access Protocols such as NFS, SMB, and POSIX-compliant protocols.
- Experience with NVMEoverFabricStorage Transport Protocols: NVME/TCP, NVME/IB, or NVME/RoCE
- Understanding of Storage performance via RDMA, GPUDirect Storage, parallel file systems
- Knowledge of Encryption, storage security, and multi-tenancy strategies
- Understanding of Storage data-reduction, compression, and encryption
- Experience with Backup and data protection
- 5+ years of experience in Infrastructure as Code (e.g. Terraform, Ansible).
Nice to Have
- Experience with Kubernetes, including CSI and COSI drivers and CNI’s
- Deep understanding of storage performance
- Strong understanding of public cloud features (e.g., SDN, block storage, distributed file systems, identity management)
- Experience deploying, operating, and maintaining Software Defined Storage
- Have implemented either open-source or commercial monitoring solutions of storage and storage-adjacent solutions
Technical Stack
Vast, Weka, DDN, NetApp, PureStorage, Dell, IBM, HPE, NFS, SMB, POSIX, NVME/TCP, NVME/IB, NVME/RoCE, RDMA, GPUDirect Storage, parallel file systems, Terraform, Ansible, Kubernetes, CSI, COSI, CNI, SDN, block storage, distributed file systems, identity management, Software Defined Storage, monitoring solutions
Team & Environment
Product Engineering at Lambda is responsible for building and scaling the cloud offering, including the Lambda website, cloud APIs and systems, and internal tooling for system deployment, management, and maintenance. The Infrastructure Engineering team integrates advanced storage, networking, and compute hardware to build high-performance clusters. The team has 500+ employees and reports to the engineering leadership team.
Benefits & Compensation
- Generous cash & equity compensation
- Health, dental, and vision coverage for you and your dependents
- Wellness and commuter stipends for select roles
- 401k Plan with 2% company match (USA employees)
- Flexible paid time off plan that we all actually use
The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. Equity compensation is generous, and additional benefits include wellness and commuter stipends for select roles and a 401k Plan with 2% company match for USA employees.
Work Mode
This position requires presence in our San Francisco or San Jose office location 4 days per week; Lambda’s designated work from home day is currently Tuesday.
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
