Responsibilities
- Design core, backend software components
- Perform architecture and research work for AI workloads
- Interface with other teams to incorporate their innovations
- Analyze and improve efficiency, scalability, and stability of various system resources
- Conduct design and code reviews
- Create services, tools and developer documentation
- Create testing frameworks for robustness and fault-tolerance
- Participate in an on-call rotation to respond to critical incidents as needed
Requirements
- 5+ years experience writing high-performance, well-tested, production quality code
- Bachelor's or Master's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
- Demonstrated experience in building large scale, fault tolerant, distributed systems like storage, search, and computation
- Expert level programmer in one or more of Golang, Java, Rust, or C/C++
- Demonstrated experience with relational (e.g., PostgreSQL) and non-relational (e.g., ClickHouse, Redis) databases
- Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
- Excellent understanding of low level operating systems concepts including multi-threading, memory management, networking and storage, performance and scale
Nice to Have
- Experience with Kubernetes/Nomad and containers is a plus
- Experience with virtual machines, microVMs, or sandboxing technologies (e.g., gVisor, Firecracker, QEMU) is a plus
- Experience with Ceph is a plus
- Experience with Elixir is a plus
Benefits
- Competitive compensation
- Startup equity
- Other competitive benefits