Genesis AI is seeking a Staff Software Engineer, Data to build and maintain the large-scale data pipelines critical for training and evaluating robotics foundation models. You will own the core data infrastructure powering our work on general-purpose Physical AI.
What You'll Do
- Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale.
- Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks, and orchestration layers.
- Standardize data models and unify processing pipelines across real-world teleoperation and synthetic simulation datasets.
- Collaborate with a team of driven individuals committed to building general-purpose Physical AI.
What We're Looking For
- Excellent software engineering skills (Python, Go, or similar).
- Extensive experience designing, building, and maintaining large-scale data pipelines (8+ years).
- Deep understanding of distributed systems (Spark, Kafka, or similar).
- Extensive experience with data storage technologies (data lakes, warehouses, object stores like S3).
- Experience running and maintaining production-grade infrastructure (Kubernetes, Terraform).
Nice to Have
- Experience supporting AI systems, in particular embodied AI like self-driving.
Technical Stack
- Languages: Python, Go
- Distributed Systems: Spark, Kafka
- Storage: S3
- Infrastructure: Kubernetes, Terraform
Work Mode
This is a global position open to candidates in the Bay Area, Paris, or working remotely.
Genesis AI is an equal opportunity employer.


