Eli Lilly and Company Logo

Eli Lilly and Company

Discovery Data Team - Engineer

Reposted 14 Days Ago
Remote
Hiring Remotely in US
155K-242K Annually
Senior level
Remote
Hiring Remotely in US
155K-242K Annually
Senior level
Lead the Discovery Data Team in designing scalable data infrastructures for molecule discovery, focusing on cloud-native solutions and data pipelines.
The summary above was generated by AI

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

The Discovery Data Team (DDT) is accelerating molecule discovery through the integration of high-throughput lab data, next-generation sequencing (NGS), lab automation, and machine learning. We’re championing scalable, cloud-native infrastructure to power data pipelines and APIs that unify experimental and computational datasets across the molecule discovery lifecycle and modalities.

We’re seeking a Discovery Data Team Engineer to design and implement robust, scalable infrastructure for ingesting and processing scientific datasets—especially NGS and experimental workflows—from lab instruments, ELNs, and cloud storage systems. You’ll play a key role in leading the and generating the technical, engineering strategy and collaborating closely with scientific and Tech@Lilly team.  You will also lead the strategy to build data pipelines, APIs, and workflow orchestration platforms across AWS and modern data technologies. As the first engineer on the DDT, you’ll also work closely with bench scientists, computational scientists and bioinformatician, and Tech@Lilly on several data initiatives leading the technical strategy and influencing stakeholders and informing the leadership on the technical roadmaps.

Key Responsibilities:

  • Serve as a technical lead and data architect within the Discovery Data Team in Molecule Discovery
  • Thought partner to the DDT head on engineering and technical strategy for projects
  • Influence cross-functional partners and drive the technical design of new data products and pipelines
  • Lead a team of engineers to catalyze and execute on data initiatives in molecule discovery
  • Build and scale cloud-native infrastructure to support data ingestion, processing, and retrieval for molecule discovery and sequencing workflows.
  • Develop workflows using Nextflow for NGS data processing and integrate them into larger data pipeline systems.
  • Integrate and extract data from lab instruments and ELNs (e.g., Benchling, Signals) and route them into structured data lakes or databases.
  • Develop and maintain APIs using FastAPI to interface between data sources, pipelines, and downstream applications.
  • Design and implement data pipelines using Airflow, PostgreSQL, Spark, and columnar storage formats (e.g., Parquet, Redshift).
  • Deploy, monitor, and optimize infrastructure on AWS, including services like Lambda, Batch, S3, and EC2.
  • Build secure, scalable APIs for data sharing and querying between storage systems and data consumers.
  • Work cross-functionally with bioinformaticians, data scientists, and lab informatics teams to enable seamless scientific data workflows

Basic Requirements:

  • Bachelor's degree or higher degree in engineering, computer science or related sciences fields
  • 10+ years of work experience in leading engineering teams and working in cloud infrastructure or DevOps roles with strong focus on strategic leadership and data systems

Additional Skills/Preferences:

  • Familiarity with columnar data formats and scalable storage architectures (e.g., data lakes, Redshift, Parquet).
  • Excellent problem-solving skills and ability to troubleshoot complex issues.
  • Strong communication and collaboration skills.
  • Experience with Nextflow or similar workflow languages for NGS or scientific data processing.
  • Strong hands-on experience with AWS services, especially Lambda, Batch, S3, and container orchestration.
  • Proficiency with Python and frameworks like FastAPI for developing APIs.
  • Experience with scientific data systems and ELNs like Benchling and Signals.
  • Strong understanding of data pipeline orchestration (Airflow), distributed compute (Spark), and data modeling for scientific datasets.
  • Experienced in developing solutions using agile methodology (e.g. Scrum) and tools (e.g. JIRA)
  • Experience working with lab instrumentation data extraction and integration into cloud data stores.
  • Background in bioinformatics, molecular biology, or a related life sciences field.
  • Experience in regulated or GxP-compliant environments.
  • Knowledge of scientific computing environments and HPC systems.
  • Familiarity with workflow containerization (Docker, Singularity) and CI/CD pipelines.

Why Join Us?

  • Be part of a mission-driven, cutting-edge data team advancing scientific discovery through modern data and infrastructure tools.
  • Solve challenging technical problems with real-world impact at one of the biggest healthcare companies in the world
  • Competitive salary, stock options, and excellent benefits package

Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (https://careers.lilly.com/us/en/workplace-accommodation) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.

Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.


Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women’s Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.

Actual compensation will depend on a candidate’s education, experience, skills, and geographic location.  The anticipated wage for this position is

$154,500 - $242,000

Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly’s compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.

#WeAreLilly

Top Skills

Airflow
AWS
Batch
Benchling
Docker
Ec2
Fastapi
Lambda
Nextflow
Parquet
Postgres
S3
Signals
Singularity
Spark

Similar Jobs

35 Minutes Ago
Remote or Hybrid
2 Locations
50K-50K Annually
Mid level
50K-50K Annually
Mid level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Advocacy Coordinator, you'll enhance customer experiences in Small Business Banking through first call resolution and relationship building.
Top Skills: Google SuiteMS Office
4 Hours Ago
Remote or Hybrid
US
70K-97K Annually
Mid level
70K-97K Annually
Mid level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The GRC Consultant II supports information security risk management and compliance efforts, ensuring risks are assessed, mitigation efforts tracked, and stakeholder relationships developed.
Top Skills: GdprGovernance Risk And Compliance (Grc) SoftwareGrc Tools And PlatformsIso 27001NistPciPci DssSocSox
4 Hours Ago
Remote or Hybrid
US
55K-107K Annually
Senior level
55K-107K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
Lead strategic plans and business development for technology and staffing solutions, build relationships with clients, influence stakeholders, and drive revenue growth.
Top Skills: Cloud MigrationsDevops SolutionsInfrastructure As CodeSoftware DevelopmentTech Staffing ResourcesTechnology Solutions

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account