Mountain View, CA, USA or remote within Continental US, Hawaii, or Canada Hybrid USD 137,871 – 232,883 / year

Khan Academy is hiring a Senior Platform Engineer I, AI Evaluation (24 months fixed-term)

Responsibilities

Be fluent in the range of offline and online evaluation strategies, and when to apply the techniques over the lifecycle of development
Have intuitions about how to specify eval pipelines succinctly using declarative syntax
Understand the role of stratified datasets and ground truth labeling
Appreciate the range of eval scoring schemes from human raters to automated LLMs-as-judge

Requirements

Bachelor’s or Master’s degree in Computer Science, Data Engineering, related field, or equivalent professional experience.
5 years of Software Engineering including significant time working on the evaluation of generative AI systems or other evaluations of ML model quality
Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
Familiarity with the architecture of large language models and their industry-standard APIs

Nice to Have

Experience with labeling platforms (e.g., Label Studio, Scale AI, Toloka) and human-in-the-loop concerns such as rubric development and inter-rater agreement
Exposure to MLOps practices such as model registry, feature store, or continuous evaluation
Background in education technology or other human-centered AI applications

Benefits

Competitive salaries
Ample paid time off as needed
8 pre-scheduled Wellness Days in 2026
Remote-first culture
Generous parental leave
An exceptional team that trusts you and gives you the freedom to do your best
The chance to put your talents towards a deeply meaningful mission and the opportunity to work on high-impact products that are already defining the future of education
Opportunities to connect through affinity, ally, and social groups
401(k) + 4% matching & comprehensive insurance, including medical, dental, vision, and life

Work Arrangement

Hybrid

Additional Information

As part of our hiring process, we use a secure identity verification service through CLEAR® (in partnership with Greenhouse) to confirm that each applicant is who they claim to be. CLEAR® provides a safe, consistent way to confirm identity, helping protect both applicants and the company from impersonation or fraud.
24 months fixed-term

About company

Khan Academy is a nonprofit with the mission to deliver a free, world-class education to anyone, anywhere. Our proven learning platform offers free, high-quality supplemental learning content and practice that cover Pre-K - 12th grade and early college core academic subjects, focusing on math and science.

All jobs at Khan Academy Visit website

Job Details

Category infrastructure

Posted 5 months ago

Similar Jobs

Other opportunities you might be interested in

Platform Architect

DB Financial

Mumbai Remote (Global)

⚙️ Senior/Staff Platform Engineer

Runway Financial

North America Remote (Country)

Senior DevOps Engineer (hiring in US/CAN & LATAM)

TrueML

Remote in Mexico Remote (Global)

Customer Success Engineer

Redpanda

Austin Remote (Global)

Lead Engineer – Platform & Infrastructure

Stream

Amsterdam Hybrid

Data & ML Platform Engineer (Hybrid)

Homebase

Related Articles

Insights related to this role

Remote AI engineering workspace with dual monitors and coding environment, reflecting AI platform development for creative industries like Disney.

AI Leadership at Disney: Remote Creative Tech Role

4 min 3 months ago

A remote developer working in a well-lit, modern workspace, illustrating a productive environment enabled by a developer experience platform.

Developer Experience Platform: Lessons from Europe

5 min 2 months ago

Home office setup with dual monitors showing Kubernetes dashboards, representing the rise of Kubernetes remote jobs in AI and cloud-native careers 2026.

Kubernetes Remote Jobs: AI & Cloud-Native Careers in 2026

5 min 3 months ago