Khan Academy is hiring a Senior Platform Engineer I, ML Data Systems for a 24-month fixed-term role. You will evolve our evaluation dataset tools to meet the growing platform needs of AI-based tutoring. This position bridges ML operations, data engineering, and data science, enabling our AI systems to learn from reliable, well-structured datasets that reflect the diversity of real learners.
What You'll Do
- Evolve and maintain pipelines for transforming raw trace data into ML-ready datasets.
- Clean, normalize, and enrich data while preserving semantic meaning and consistency.
- Prepare and format datasets for human labeling, and integrate results into ML datasets.
- Develop and maintain scalable ETL pipelines using Airflow, DBT, Go, and Python running on GCP.
- Implement automated tests and validation to detect data drift or labeling inconsistencies.
- Collaborate with AI engineers, platform developers, and product teams to define data strategies for continuously improving our AI-based tutoring quality.
- Contribute to shared tools and documentation for dataset management and AI evaluation.
- Inform our data governance strategies for proper data retention, PII controls, and isolation of sensitive data.
What We're Looking For
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, a related field, or equivalent professional experience.
- 5 years of Software Engineering experience, including significant time working with large ML datasets.
- Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect).
- Experience with data versioning tools (e.g., DVC, LakeFS) and cloud storage systems.
- Familiarity with machine learning workflows—from training data preparation to evaluation.
- Familiarity with the architecture and operation of large language models, and a nuanced understanding of their capabilities and limitations.
- Attention to detail and an obsession with data quality and reproducibility.
Nice to Have
- Experience with labeling platforms (e.g., Label Studio, Scale AI, Toloka) or human-in-the-loop systems.
- Understanding of ML evaluation techniques, including prompt-based and generative model metrics.
- Exposure to MLOps practices such as model registry, feature store, or continuous evaluation.
- Background in education technology or other human-centered AI applications.
Technical Stack
- Go, Python, SQL, GraphQL, JavaScript, React, React Native, Redux, Airflow, DBT, GCP
Team & Environment
You’ll work closely with other AI engineers, platform developers, and labeling teams.
Benefits & Compensation
- Competitive salaries
- Ample paid time off as needed
- 8 pre-scheduled Wellness Days in 2026
- Remote-first culture catering to your time zone
- Generous parental leave
- An exceptional team that trusts you and gives you the freedom to do your best
- The chance to work on a deeply meaningful mission and high-impact products
- Opportunities to connect through affinity, ally, and social groups
- 401(k) + 4% matching & comprehensive insurance (medical, dental, vision, life)
- Compensation: $137,871 - $172,339 USD / $186,306 - $232,883 CAN
Work Mode
This is a global, remote-first role open to candidates in the US and Canada.
Khan Academy is an equal opportunity employer.






