San Francisco, California, United States Hybrid Employment USD 380,000 - 445,000 Yearly

OpenAI is hiring a Research Engineer/Scientist

About the Role

OpenAI is hiring a Research Engineer/Scientist focused on RLHF and post-training for personalized, multimodal AI systems. You’ll join a team building the learning and evaluation foundations to make models more context-aware, adaptive, and useful over time.

What You'll Do

  • Develop RLHF and post-training methods for multimodal models.
  • Build reward models and preference-learning pipelines for adaptive, personalized model behavior.
  • Design datasets, rubrics, and evaluation frameworks that capture user preferences, contextual appropriateness, and long-term value in realistic tasks.
  • Run experiments on policy improvement using explicit feedback, implicit signals, and model-based grading.
  • Work on long-horizon evaluation problems, where model quality depends on whether behavior improves outcomes over time.
  • Collaborate closely with safety researchers to ensure adaptation and personalization remain aligned, interpretable, and bounded by clear constraints.
  • Prototype and iterate quickly on training recipes, reward formulations, data pipelines, and evaluation suites for product-relevant behaviors.
  • Help define how OpenAI measures success for personalized AI systems including trust, appropriateness, and long-term user benefit.

What We're Looking For

  • Strong background in machine learning research, with experience in RLHF, reward modeling, preference optimization, or post-training for large models.

Nice to Have

  • Experience in one or more of: reinforcement learning, ranking, recommender systems, personalization, memory, or human-in-the-loop evaluation.
  • Experience building datasets or eval pipelines grounded in human preferences, rubrics, or real-world product behavior.
  • Interest in multimodal AI and in how models can learn from richer interaction signals over time.
  • Desire to work on product-shaping research with high stakes for trust, alignment, and long-term user value.
  • Enjoy close collaboration with engineers, designers, and safety researchers to turn frontier research into real systems.

Team & Environment

You'll join an applied research team within the Consumer Devices group.

Work Mode

This is a hybrid role based in San Francisco, CA.

OpenAI is an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

Required Skills
machine learningRLHFreward modelingpreference optimizationpost-traininglarge language models
Need to work legally in Thailand?

Work permits without the paperwork nightmare

Thai immigration rules are strict and easy to get wrong. SVBL handles the bureaucracy — correct visa type, proper documentation, timely submissions. You focus on your work.

Right visa type for your situation
Document preparation & submission
Deadline tracking & renewals
Direct liaison with immigration
Talk to an expert
10+ years experience
About company
OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

Visit website
Job Details
Department Research and Development (R&D)
Category data
Posted 14 days ago