Join Nuro as a Software Engineer, ML Data, where you will serve as a technical leader for our ML Data engine at the intersection of Autonomy, Machine Learning, and Infrastructure. You will design and build the systems that feed autonomy AI models, transforming massive amounts of data into high-value training signals.
What You'll Do
- Design and build scalable data ingestion and processing pipelines that turn data streams into targeted training datasets.
- Lead initiatives to improve data quality, detect anomalies, and manage out-of-distribution examples to ensure robust model training and deployment.
- Work across autonomy teams and data infra teams to build effective ML data pipelines and products for ML engineers.
- Develop infrastructure and visualization tools that allow ML researchers to easily introspect data, identify model failure modes, query for new data samples, and understand data distribution shifts.
- Collaborate closely with the data operations team to define quality standards, automate quality control (QC), and streamline the feedback loop between model performance and annotation guidelines.
- Lead the engineering effort to operationalize research-grade active learning methods, building systems that compute embeddings, run inference at scale, manage vector databases, and automatically sample the most informative data points for labeling.
What We're Looking For
- 7+ years of experience with a proven track record of technical leadership architecting and delivering complex, multi-system ML data engineering data systems.
- B.S./M.S. in Computer Science, Artificial Intelligence, Electrical Engineering, Robotics, or equivalent practical experience.
- Understanding of end-to-end ML data pipelines and their interaction with model training and evaluation.
- Strong proficiency in C++ and Python, with petabyte-level data management experience.
- Experience taking data concepts (e.g., 'uncertainty sampling') and turning them into stable, 24/7 production services.
Nice to Have
- Prior experience working in large companies with productionized AI systems working on data engines for large scale machine learning.
- Experience in workflow orchestration, introspection UI/UX for data understanding, and ML frameworks for foundation model training.
- Expertise in data-centric AI topics (active learning, pre-training) and their application in autonomous systems.
- Subject matter expertise and research in one or more of the following areas: Machine Learning, Deep Learning, Robotics, and some familiarity with the state of the art in ML for autonomous driving and data utilization.
Technical Stack
- C++
- Python
Team & Environment
You will be a member of the Autonomy team, partnering with Autonomy ML researchers and infrastructure teams.
Benefits & Compensation
- Compensation: $193,930 - $352,290 + equity
- Annual performance bonus
- Equity
- Competitive benefits package
Nuro celebrates differences and is committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics.


