Valo Health Logo

Valo Health

Data Scientist II, Scientific Data Curation

Posted 21 Days Ago
Hybrid
Lexington, MA
Mid level
Hybrid
Lexington, MA
Mid level
As a Data Scientist II, you will curate datasets for drug discovery, collaborate with stakeholders, and ensure data quality for the Opal Platform.
The summary above was generated by AI

About Us 

Valo Health is a technology company that is integrating human-centric data and AI-powered technology to accelerate the creation of life-changing drugs for more patients faster. Valo was created with the belief that the drug discovery and development process can and should be faster and less expensive, with a much higher probability of success. We are using models early to fail less often, executing clinical trials to add valuation to the company, and generating fit-for-purpose data to feed back into Valo’s Opal Computational Platform™ as we reinvent drug discovery and development from the ground up. Disease doesn’t wait, so neither can we. 

About the Role...

As a Data Scientist II, Scientific Data Curation, you will collaborate with stakeholders across data science, data engineering, lab informatics, medicinal chemistry, and assay development to establish high quality, well-structured datasets for use within the Opal Computational Platform. These datasets will include biochemical, cellular, ADME, PK, cytotoxicity, and other assays related to drug discovery. We are looking for someone who is passionate about data, curious to explore the complexities of semi-structured datasets, and takes an intuitively quantitative approach to data exploration and curation. This role will include exploratory analysis of data, building scalable data pipelines to clean and harmonize data, and generation of visualizations, dashboards, and written communications to empower scientists across Valo to utilize large scale data as part of the Opal Platform.  

What You Will Do...

  • Develop standard procedures for establishing small molecule assay datasets used for modeling, including data cleaning, labeling, harmonizing, reporting, and QC
  • Explore and evaluate new datasets to prioritize use at Valo 
  • Collaborate with relevant stakeholders to establish high quality, curated datasets for modeling 
  • Contribute to development and refinement of data standards and data models, and establish consistent internal terminologies and ontologies 

What You Bring... 

  • Degree in a scientific field with 4+ (BS), 2+ (MS), or 0+ (PhD) years of experience 
  • Demonstrated ability to communicate with cross-functional stakeholders, including technical and non-technical audiences 
  • Experience with small molecule assay data (biochemical, cellular, ADME, PK, cytotoxicity) 
  • Demonstrated ability to clean, curate, and harmonize semi-structured data with SQL and python, with an eye towards generalizability and code re-usability 
  • A strong understanding of common ontologies, controlled vocabularies, knowledge bases, and databases related to drug discovery data (BioAssay ontology, NCBI taxonomy, HGNC, Uniprot, ChEBI, ChEMBL) 
  • Experience balancing tradeoffs between perfect and sufficient. Able to establish acceptable error rates, estimate expected errors in data, and communicate those expectations to broad audiences 

Nice to Have...

  • Understanding of FAIR data principles 
  • Experience with ontology management systems 
  • Experience with using generative AI for data cleaning and harmonization 
  • Experience with machine learning and building small molecule predictive models  

More on Valo 

Valo Health, LLC (“Valo”) is a technology company built to transform the drug discovery and development process using human-centric data and artificial intelligence-driven computation. As a digitally native company, Valo aims to fully integrate human-centric data across the entire drug development life cycle into a single unified architecture, thereby accelerating the discovery and development of life-changing drugs while simultaneously reducing costs, time, and failure rates. The company’s Opal Computational Platform™ is an integrated set of capabilities designed to transform data into valuable insights that may accelerate discoveries and enable Valo to advance a robust pipeline of programs across cardiovascular metabolic renal, oncology, and neurodegenerative diseases. Founded by Flagship Pioneering, Valo is headquartered in Lexington, MA with tissue engineering research based in New York, NY. To learn more, visit www.valohealth.com. 

Top Skills

Python
SQL
HQ

Valo Health Boston, Massachusetts, USA Office

399 Boylston Street, Boston, MA, United States

Similar Jobs

9 Hours Ago
Lexington, MA, USA
Entry level
Entry level
Healthtech • Software • Pharmaceutical
As a Research Associate in peptide chemistry, you will support drug discovery projects through peptide synthesis, purification, and analysis while collaborating with a team.
Top Skills: ChromatographyElectronic Notebook SystemsMass SpectrometrySolid Phase Peptide Synthesis (Spps)Synthetic Organic Chemistry
9 Hours Ago
Remote
Hybrid
62 Locations
104K-233K Annually
Senior level
104K-233K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead AI and ML operations, develop innovative solutions, enhance business processes, guide teams through complexity, and manage client relationships.
Top Skills: AICloud PlatformsMl
9 Hours Ago
Remote
Hybrid
67 Locations
100K-232K Annually
Senior level
100K-232K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
The role involves managing AI and GenAI data science projects, coaching teams, and utilizing advanced analytics to drive client success.
Top Skills: AWSAzureCi/CdGitGCPKerasNltkNoSQLPandasPythonScikit-LearnSQL

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account