Matterworks Logo

Matterworks

Data Manager

Reposted 22 Days Ago
Be an Early Applicant
Hybrid
Somerville, MA
Senior level
Hybrid
Somerville, MA
Senior level
The Data Manager will own data strategy, management, and governance, ensuring data quality and integration across various scientific datasets whilst collaborating with multiple teams.
The summary above was generated by AI
About Us

At Matterworks we are building AI tools to extract insights from the ever-growing corpora of biological data and to unlock opportunities in therapeutic discovery, development, and manufacturing. We are building large-scale deep learning models of biological data to predict the phenotype and behavior of biological systems.

Position Overview

Matterworks is seeking a Data Manager (Bioinformatics / Cheminformatics) to build our data management practice, owning the strategy, processes, and day-to-day execution that turn complex, messy chemical and biological datasets into high-quality, well-governed training corpora and product-ready data assets.

You’ll be the connective tissue between applied science, AI, product, and our data platform/engineering team helping to answer what data is most valuable, how do we onboard it quickly, and how do we keep it consistently high quality over time.

This role will start as an individual contributor with end-to-end ownership, with a growth path to leading a function as we scale.

Key Responsibilities

  • Data Strategy & E2E Ownership: Partner with scientific, ML, and product stakeholders to define a data roadmap: which datasets move the needle, which should be refreshed, and what “good enough” looks like for each use case. Establish clear success metrics for onboarding speed, dataset quality, and downstream usability (e.g., fewer training/data failures, higher match rates, better coverage, higher-confidence labels).

  • Dataset Sourcing, Discovery, and Intake: Proactively scout and integrate public and client datasets, plus relevant literature and reference materials, to keep our corpora current and comprehensive. Design a repeatable dataset intake workflow including provenance, source tracking, and refresh cadence.

  • Data Curation, Quality and Governance: Define curation standards that make data consistent across sources and modalities, including compound identity management, biological/sample metadata standardization, and schema + conventions mappings. Build a scalable approach to integrating metabolomics now and expanding to additional omics without reinventing everything each time. Develop practical QC/QA frameworks that combine scientific judgment with repeatable checks.

  • Cross-Functional Collaboration: Work closely with leadership in engineering, AI, product, and scientific discovery to align initiatives with company-wide goals. Use experience to keep initiatives moving smoothly. Translate ambiguous questions into crisp data requirements, priorities, and execution plans. Build trust across disciplines by being both scientifically rigorous and pragmatically execution oriented.

About You

  • 6+ years of demonstrated experience owning scientific data work end-to-end (curation, standardization, QC, documentation, governance) in bioinformatics, cheminformatics, computational biology, scientific data engineering, or related roles.

  • Ability to navigate complex chemical and biological datasets, reconcile identifiers/metadata across sources, and make data consistently usable for end users.

  • Strong attention to detail with a keen ability to balance priorities and delivery incremental value while operating with minimal oversight.

  • Comfortable building structure from scratch: you can define processes, set standards, and iterate toward scalable practices in an early-stage environment.

  • Practical proficiency in Python and SQL for data investigation, transformation, QC, and automation.

  • Familiarity with modern data workflows (structured + semi-structured data, pipelines, reproducibility, documentation).

  • Experience with chemical structure representations and normalization (e.g., SMILES/InChI, canonicalization, salt/tautomer handling, stereochemistry considerations).

  • Demonstrated ability to communicate and collaborate with product, machine learning, applied science and engineers while reducing complex business questions into valuable, reliable technical solutions.

  • A passion for contributing to an early-stage startup where autonomy, eagerness to learn, and enthusiasm for solving novel scientific challenges prevail over rigid processes and egos.

Working at Matterworks

Given the cross-disciplinary and innovative nature of our work, effective collaboration and communication are critical to our progress. We operate in a flexible hybrid model that accommodates both fully remote team members and those who work full-time from our Somerville, MA office. While some positions may require regular in-person presence for hands-on work or local collaboration, many roles can be performed remotely with team members distributed across various locations.

Compensation and Benefits

Matterworks offers a competitive base salary, stock options, and benefits (health & dental, vision, long- and short-term disability, life insurance, 401k with company match). Employees enjoy a flexible work & unlimited time away policy, commuter benefits and parking, regular team meals and outings, and company support for continued education/coursework and conference participation.

Matterworks, Inc. is an equal opportunity employer. All candidates for employment at Matterworks are considered without regard to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, sexual orientation, or any other category protected by law.

Top Skills

Python
SQL
HQ

Matterworks Somerville, Massachusetts, USA Office

444 Somerville Ave, Somerville, MA, United States, 02143

Similar Jobs

10 Days Ago
Hybrid
Boston, MA, USA
187K-234K Annually
Senior level
187K-234K Annually
Senior level
Consumer Web • eCommerce • Software
Lead and grow an analytics engineering team to deliver scalable data models and reporting for Product, Marketing, Finance, and other stakeholders. Drive BI platform vision, collaborate with Product and Data Governance, prioritize work, implement agile practices, and ensure high data quality and performant transformations.
Top Skills: DatabricksGoogle BigqueryLookerSigmaSnowflakeSQLTableau
21 Hours Ago
In-Office
Quincy, MA, USA
136K-204K Annually
Senior level
136K-204K Annually
Senior level
AdTech • eCommerce • Food • Marketing Tech • Retail
The Data Science Manager leads a team delivering business insights through data science techniques, provides leadership, develops models, and influences strategic decisions.
Top Skills: DatabricksPythonSparkSQL
21 Days Ago
Easy Apply
In-Office or Remote
2 Locations
Easy Apply
100K-180K Annually
Mid level
100K-180K Annually
Mid level
Healthtech • Software
The Data Program Associate/Manager will identify data acquisition opportunities, collaborate with product teams on datasets, and build AI systems for data improvement.
Top Skills: AISpreadsheets

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account