Lila Sciences Logo

Lila Sciences

ML Research Scientist I/II, Multimodal Data Extraction

Posted 3 Days Ago
Be an Early Applicant
In-Office
Cambridge, MA
176K-304K Annually
Expert/Leader
In-Office
Cambridge, MA
176K-304K Annually
Expert/Leader
As an ML Research Scientist, you will develop AI systems for multimodal data extraction and advance scientific knowledge structuring across domains.
The summary above was generated by AI

Your Impact at Lila

As a ML Research Scientist - Multimodal Data Extraction, you will advance Lila’s vision of scientific superintelligence by developing foundation models that autonomously read, interpret, and structure scientific knowledge across text, images, and experimental data in the physical sciences. Your research will help unify the world’s scientific information into machine-understandable form, powering reasoning, prediction, and autonomous discovery across materials science and chemistry.

What You'll Be Building

  • Research and develop AI systems that extract and structure knowledge from diverse scientific sources.
  • Design and fine-tune large language, multi-modal and specialized models for factual, interpretable data extraction.
  • Build scalable pipelines for unstructured and heterogeneous scientific data, integrating text, tables, and visuals.
  • Collaborate with domain experts to align extracted data with real-world discovery workflows.
  • Publish research that advances the state of the art in multimodal understanding and AI-driven knowledge extraction.

What You’ll Need to Succeed

  • PhD (or equivalent research experience) in Computer Science, Chemistry, Materials Science, or related field.
  • Expertise in machine learningNLP, and vision–language modeling using PyTorch and Hugging Face Transformers.
  • Proven ability to train, fine-tune, and evaluate LLMs and multimodal models for scientific data extraction.
  • Strong understanding of data structures and representations used in the physical sciences.
  • Demonstrated research impact through publications, preprints, or open-source work (e.g., NeurIPS, ICLR, ICML, ACL, EMNLP, Scientific Journals).

Bonus Points For

  • Experience with multimodal fusion architectures and document-level understanding.
  • Knowledge of scientific document parsing (OCR, table extraction, figure-caption linking).
  • Familiarity with knowledge graph construction or reasoning systems for science.
  • Experience with noisy or heterogeneous real-world scientific data.
  • Collaborative mindset and passion for advancing AI in the physical sciences.

About Lila

Lila Sciences is the world’s first scientific superintelligence platform and autonomous lab for life, chemistry, and materials science.  We are pioneering a new age of boundless discovery by building the capabilities to apply AI to every aspect of the scientific method.  We are introducing scientific superintelligence to solve humankind's greatest challenges, enabling scientists to bring forth solutions in human health, climate, and sustainability at a pace and scale never experienced before. Learn more about this mission at  www.lila.ai

If this sounds like an environment you’d love to work in, even if you only have some of the experience listed below, we encourage you to apply.

Composition

We expect the base salary for this role to fall between $176,000–$304,000 USD per year, along with bonus potential and generous early equity. The final offer will reflect your unique background, expertise, and impact.

We’re All In

Lila Sciences is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.

A Note to Agencies

Lila Sciences does not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Lila Sciences or its employees is strictly prohibited unless contacted directly by Lila Science’s internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Lila Sciences, and Lila Sciences will not owe any referral or other fees with respect thereto.

Top Skills

Hugging Face Transformers
Machine Learning
Nlp
PyTorch
Vision-Language Modeling

Similar Jobs

6 Hours Ago
In-Office
Boston, MA, USA
108K-203K Annually
Mid level
108K-203K Annually
Mid level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Manage and grow a portfolio of strategic food and beverage accounts by providing tailored solutions, advocating for clients, and collaborating with internal teams to enhance operations and performance.
Top Skills: PaymentsSaaSTechnology Solutions
6 Hours Ago
In-Office
Peabody, MA, USA
Senior level
Senior level
Aerospace
Lead the design of hypersonic flight test vehicles, coordinating teams and managing timelines to ensure compliance with engineering best practices and customer requirements.
Top Skills: Cad (Solidworks Or Nx)CfdFeaNastran
6 Hours Ago
In-Office
Peabody, MA, USA
Entry level
Entry level
Aerospace
Develop custom systems for hypersonic flight vehicles, manage technical requirements, and participate in collaborative engineering efforts and testing activities.
Top Skills: Aerospace And Military Avionics StandardsComputer Modeling And Simulation Packages

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account