Remote (Global) Full-time

Luma AI is hiring a Research Scientist / Engineer – Data

About the Role

Luma AI is looking for a Research Scientist / Engineer – Data to build the fundamental data layer that unlocks advanced capabilities in our foundation models. In this role, you’ll tackle open-ended challenges around how different modalities combine to create powerful and versatile AI systems.

What You'll Do

  • Identify capability gaps and research solutions
  • Design datasets and data-mixture ablations to systematically improve model capabilities across vision, audio, and language
  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities
  • Create prototypes and demonstrations that showcase new multimodal capabilities

What We're Looking For

  • Strong programming skills in Python and PyTorch
  • Experience with large-scale dataset creation and management
  • Experience with multimodal data processing pipeline
  • Understanding of computer vision, audio processing, and/or natural language processing techniques

Nice to Have

  • Expertise working with interleaved multimodal data
  • Hands-on experience with Vision Language Models, Audio Language Models, or generative video models

Technical Stack

  • Python
  • PyTorch

Required Skills
PythonPyTorchMachine LearningComputer Vision3D ReconstructionGenerative AIDeep LearningData ProcessingLarge-Scale SystemsResearchModel TrainingNeural NetworksData PipelinesDistributed Computing
Landing international contracts?

Invoice globally with an EU company

GloPay creates an Estonian partnership for you automatically. Your clients get proper invoices, you keep 95% of payments. Setup takes 5 minutes, works in 100+ currencies.

EU-registered company for compliance
Multi-currency invoicing & payments
Expense tracking & tax reports
Money in your bank in 1 business day
Start invoicing free
5% per invoice • No subscriptions
About company
Luma AI

Luma AI is a technology company focused on developing advanced multimodal AI foundation models, working on innovative approaches to combining vision, audio, and language data.

Visit website
Job Details
Category data
Posted 7 months ago