MSD Animal Health Technology Labs
Associate Director, Data Engineer: DSCS Digital Data Strategy
Job Description
Location: West Point, PA; Rahway, NJ; Boston, MA
We are a global biopharmaceutical leader with a diverse portfolio of prescription medicines, oncology, vaccines and animal health products. We are driven by our purpose to develop and deliver innovative products that save and improve lives. With 69,000 employees operating in more than 140 countries, we offer state of the art laboratories, plants and offices that are designed to inspire our employees as we learn, develop and grow in our careers. We are proud of our over 125 years of service to humanity and continue to be one of the world’s biggest investors in Research & Development.
We are seeking an Associate Director, Data Engineer to join our Digital Insights team within the Development Sciences and Clinical Supply (DSCS) Digital Technologies organization. Digital is the multiplier that will allow DSCS to deliver better experiments faster, efficient filing and launch, more robust supply chains and higher-confidence decisions across the portfolio.
The DSCS Digital Technologies organization is responsible for the invention and application of new digital tools/workflows to support scientists across drug substance development, drug product development and analytical development. We aspire to embed digital technologies into the fabric of DSCS culture to drive transformational impact. The tools that we develop are as diverse as the teams developing them, and in this Associate Director, Data Engineer role, the successful candidate will serve as a domain owner for data engineering in the biologics space — designing, building, and governing data pipelines that capture, curate, and deliver experimental and process data, including chromatography, filtration, and purification, into digital initiatives spanning process characterization, data lineage, and multivariate analytics. This role and the work being done to build this data product will be associated with efforts to establish DSCS Digital Data Strategy.
This engineer will help establish end-to-end data strategy in the biologics space: cataloging all processes, analytics, and systems; enforcing ontology alignment across data sources; aligning proactively with colleagues as new automated workflows create new data streams; and ensuring seamless data handoffs to adjacent domains. The pipelines and data products built here will directly enable colleagues across Digital Insights to deploy modeling, optimization, and decision-support tools that de-risk and accelerate biologics drug substance manufacturing from bench through commercial scale and across multiple manufacturing sites.
This role sits within the Digital Insights team which is working to digitally enable a decision engine for DSCS, by innovating solutions through strong partnership in the domains of Data Science, Data Analysis, Informatics, Multi-Omics, Predictive Science, and Data Engineering.
Responsibilities:
- Serve as a domain owner in the biologics data engineering space, maintaining full awareness of all digital projects, data sources, systems, and data flows within the domain.
- Inform work being done to establish DSCS Digital Data Strategy.
- Design and implement robust, scalable data pipelines that ingest experimental and process data from biologics source systems, including process historians, chromatography systems, electronic lab notebooks, and analytical instruments.
- Deliver analysis-ready datasets to support digital initiatives, including process characterization models, data lineage tracking, multivariate analytics, and cross-site manufacturing connectivity.
- Define and enforce data standards, metadata schemas, and ontology mappings that make biologics data interoperable and readily consumable by modeling and optimization workflows.
- Align proactively with automation colleagues, anticipating when new or modified automated workflows create new data streams that require pipeline development and ontology mapping.
- Own and govern system of record standards for biologics, ensuring consistent configuration and data entry practices across experiments, molecules, and sites.
- Catalog all processes, analytical methods, instruments, and digital systems within the biologics domain, creating a comprehensive map of the data landscape.
- Develop and maintain data visualizations, dashboards, and reports that enable scientists to explore process data across runs, molecules, scales, and manufacturing sites.
- Influence digital data strategy for biologics by identifying opportunities to improve data capture practices at the source and reduce friction between experimentation and modeling.
- Mentor and guide supporting data engineers working across modalities, ensuring alignment with domain strategy and ontology governance.
- Maintain and version all pipeline code in GitHub, following team standards for code review, documentation, and deployment.
- Build strong partnerships with process development scientists, analytical scientists, and manufacturing teams to gather requirements and shape the digital data strategy for the domain.
- Demonstrate excellent interpersonal, communication, and collaboration skills.
- Embrace and model our core values of diversity and inclusion, including fostering a supportive culture where all can thrive.
- Collaborate effectively in a dynamic, integrated, and multidisciplinary team environment.
Education Minimum Requirement:
- Ph.D. in Chemical Engineering, Biochemical Engineering, Biochemistry, Computer Science, Data Science, Engineering, Chemistry, Biology, Pharmaceutical Sciences, or a closely-related field with at least 3 years of industrial/pharmaceutical or relevant experience.
- M.S. in Chemical Engineering, Biochemical Engineering, Biochemistry, Computer Science, Data Science, Engineering, Chemistry, Biology, Pharmaceutical Sciences, or a closely-related field with at least 5 years of industrial/pharmaceutical or relevant experience.
- B.S. in Chemical Engineering, Biochemical Engineering, Biochemistry, Computer Science, Data Science, Engineering, Chemistry, Biology, Pharmaceutical Sciences, or a closely-related field with at least 7 years of industrial/pharmaceutical or relevant experience.
Required Experience and Skills:
- Proficient in Python and/or R programming. Comfortable working in development environments such as Jupyter, Posit/RStudio, or VS Code.
- Solid SQL skills with hands-on experience writing and optimizing queries against relational databases and data warehouses.
- Experience with ETL/ELT processes and building data pipelines in a scientific or pharmaceutical context.
- Familiarity with cloud platforms (AWS, Azure, or GCP) for data storage, processing, and integration.
- Working knowledge of how process models, multivariate analyses, and statistical tools consume and depend on experimental data — sufficient to anticipate modeler needs and deliver appropriately structured datasets.
- Experience defining or enforcing data standards, metadata schemas, or ontology mappings in a scientific or pharmaceutical context.
- Familiarity with version control systems (Git/GitHub) and collaborative software development practices.
- Demonstrated ability to lead technical initiatives, mentor junior engineers, and influence data strategy across multiple stakeholders.
- Ability to deliver complex solutions under compressed timelines in a dynamic environment.
Preferred Experience and Skills:
- Prior hands-on experience in biologics process development — including chromatography, filtration, purification, or formulation — with a demonstrated transition into a data engineering, data science, or computational role.
- Experience with data pipeline and analytics platforms such as Databricks, including notebook-based development, workflow orchestration, and Delta Lake.
- Experience with data visualization tools (Streamlit, Shiny, PowerBI, Spotfire, or Tableau) for building scientist-facing dashboards and exploratory data applications.
- Familiarity with ontology frameworks or standardized data models (e.g., Allotrope Simple Model, ISA-88, OPC-UA) and experience mapping instrument data to structured schemas.
- Understanding of Design of Experiments (DoE) methodologies and process characterization study designs — sufficient to structure data for statistical analysis of critical process parameters (CPPs) and critical quality attributes (CQAs).
- Experience with data lineage concepts and building traceability across experimental systems, materials, and manufacturing steps.
- Knowledge of regulatory expectations relevant to biologics process development, process characterization, and process validation (e.g., ICH Q8-Q12, process validation lifecycle, comparability studies).
- Experience working alongside or supporting lab automation teams, including integrating data from newly automated laboratory workflows.
- Experience with cross-site data integration, including harmonizing data from multiple manufacturing facilities with different systems and conventions.
- Evidence of cross-functional collaborations spanning laboratory, manufacturing, modeling, and digital teams.
Required Skills:
Biochemistry, Biopharmaceutical Industry, Change Catalyst, Chemical Engineering, Chromatography, Computer Science, Data Science, Data Visualization, Design Changes, Digital Strategy, Multivariate Data Analysis, Pharmaceutical Sciences, Production Optimization, Strategic PlanningPreferred Skills:
Current Employees apply HERE
Current Contingent Workers apply HERE
US and Puerto Rico Residents Only:
Our company is committed to inclusion, ensuring that candidates can engage in a hiring process that exhibits their true capabilities. Please click here if you need an accommodation during the application or hiring process.
As an Equal Employment Opportunity Employer, we provide equal opportunities to all employees and applicants for employment and prohibit discrimination on the basis of race, color, age, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or other applicable legally protected characteristics. As a federal contractor, we comply with all affirmative action requirements for protected veterans and individuals with disabilities. For more information about personal rights under the U.S. Equal Opportunity Employment laws, visit:
EEOC Know Your Rights
EEOC GINA Supplement
We are proud to be a company that embraces the value of bringing together, talented, and committed people with diverse experiences, perspectives, skills and backgrounds. The fastest way to breakthrough innovation is when people with diverse ideas, broad experiences, backgrounds, and skills come together in an inclusive environment. We encourage our colleagues to respectfully challenge one another’s thinking and approach problems collectively.
Learn more about your rights, including under California, Colorado and other US State Acts
The salary range for this role is
$129,000.00 - $203,100.00This is the lowest to highest salary we in good faith believe we would pay for this role at the time of this posting. An employee’s position within the salary range will be based on several factors including, but not limited to relevant education, qualifications, certifications, experience, skills, geographic location, government requirements, and business or organizational needs.
The successful candidate will be eligible for annual bonus and long-term incentive, if applicable.
We offer a comprehensive package of benefits. Available benefits include medical, dental, vision healthcare and other insurance benefits (for employee and family), retirement benefits, including 401(k), paid holidays, vacation, and compassionate and sick days. More information about benefits is available at https://jobs.merck.com/us/en/compensation-and-benefits.
You can apply for this role through https://jobs.merck.com/us/en (or via the Workday Jobs Hub if you are a current employee). The application deadline for this position is stated on this posting.
San Francisco Residents Only: We will consider qualified applicants with arrest and conviction records for employment in compliance with the San Francisco Fair Chance Ordinance
Los Angeles Residents Only: We will consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with the requirements of applicable state and local laws, including the City of Los Angeles’ Fair Chance Initiative for Hiring Ordinance
Search Firm Representatives Please Read Carefully
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company. No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails.
Employee Status:
RegularRelocation:
DomesticVISA Sponsorship:
YesTravel Requirements:
10%Flexible Work Arrangements:
HybridShift:
Not IndicatedValid Driving License:
NoHazardous Material(s):
n/aJob Posting End Date:
07/9/2026*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.
Similar Jobs
What you need to know about the Boston Tech Scene
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories



