Full-time

Luma AI is hiring a Research Scientist / Engineer – Multimodal Capabilities

About the Role

Luma AI is looking for a Research Scientist / Engineer – Multimodal Capabilities to unlock advanced behaviors in our foundation models. You'll join the Multimodal Capabilities team to conduct strategic research on combining vision, audio, and language to solve fundamental questions.

What You'll Do

  • Collaborate with the Foundation Models team to identify capability gaps and research solutions.
  • Design datasets, experiments, and methodologies to systematically improve model capabilities across vision, audio, and language.
  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities.
  • Create prototypes and demonstrations that showcase new multimodal capabilities.

What We're Looking For

  • Strong programming skills in Python and PyTorch.
  • Experience with multimodal data processing pipelines and large-scale dataset curation.
  • Understanding of computer vision, audio processing, and/or natural language processing techniques.

Nice to Have

  • Expertise working with interleaved multimodal data.
  • Hands-on experience with Vision Language Models, Audio Language Models, or generative video models.

Technical Stack

  • Python, PyTorch

Team & Environment

You will be part of the Multimodal Capabilities team and collaborate closely with the Foundation Models team.

Benefits & Compensation

  • Salary: $200,000 - $300,000/yr + competitive equity in the form of stock options.
  • A comprehensive benefits plan.

Luma AI is an equal opportunity employer.

Required Skills
PythonPyTorchMachine LearningMultimodal AIComputer VisionNatural Language ProcessingDeep LearningResearchModel TrainingLarge-scale Systems
Earn more as a remote developer

Performance pay that rewards your skills

Iglu's revenue-sharing model means top performers earn significantly more than traditional salaries. Choose your projects, deliver great work, and see it reflected in your pay.

Revenue-sharing compensation
Project choice & autonomy
International client base
Career growth support
Check compensation
Top earners exceed market rate
About company
Luma AI

Luma AI is a technology company focused on developing advanced multimodal AI foundation models, working on innovative approaches to combining vision, audio, and language data.

Visit website
Job Details
Category data
Posted 7 months ago