Luma AI Logo

Luma AI

Research Scientist/ Engineer – Multimodal Capabilities

Posted 4 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
200K-300K
Mid level
In-Office or Remote
2 Locations
200K-300K
Mid level
The role involves researching multimodal capabilities, designing experiments, developing evaluation frameworks, and creating prototypes related to AI systems integrating vision, audio, and language.
The summary above was generated by AI
About the Role

The Multimodal Capabilities team at Luma focuses on unlocking advanced capabilities in our foundation models through strategic research into multimodal understanding and generation. This team tackles fundamental research questions around how different modalities can be combined to enable new behaviors and capabilities, working on the open-ended challenges of what makes multimodal AI systems truly powerful and versatile.

Responsibilities
  • Collaborate with the Foundation Models team to identify capability gaps and research solutions

  • Design datasets, experiments, and methodologies to systematically improve model capabilities across vision, audio, and language

  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities

  • Create prototypes and demonstrations that showcase new multimodal capabilities

Experience
  • Strong programming skills in Python and PyTorch

  • Experience with multimodal data processing pipelines and large-scale dataset curation

  • Understanding of computer vision, audio processing, and / or natural language processing techniques

  • (Preferred) Expertise working with interleaved multimodal data

  • (Preferred) Hands-on experience with Vision Language Models, Audio Language Models, or generative video models

Compensation

  • The pay range for this position in California is $200,000 - $300,000/yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan. 

Your application is reviewed by real people.

Top Skills

Python
PyTorch

Similar Jobs

2 Hours Ago
Remote or Hybrid
United States
17-20
Entry level
17-20
Entry level
Digital Media • eCommerce • Information Technology • Marketing Tech • Retail • Social Media • Analytics
The Visual Design Intern supports the design team by adapting templates for various media, maintaining asset organization, and assisting with design execution while developing essential skills under guidance.
Top Skills: Adobe Creative SuiteFigmaIllustratorIndesignPhotoshop
2 Hours Ago
Remote or Hybrid
2 Locations
205K-258K Annually
Senior level
205K-258K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead Sales Operations to optimize processes, tools, and policies; support B2B software growth and enhance sales productivity through data-driven strategies.
Top Skills: RevtechSales Analytics ToolsSalesforce
4 Hours Ago
Remote or Hybrid
IL, USA
100K-148K Annually
Senior level
100K-148K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
Supervises a software engineering team to deliver Managed Services software development solutions, ensuring customer satisfaction and process improvement.
Top Skills: AgileCiscoIbmLeanMicrosoftSdlcVMware

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account