Axle Informatics Logo

Axle Informatics

Data Platform Engineer

Reposted 8 Days Ago
Easy Apply
Remote
Hiring Remotely in USA
125K-150K Annually
Mid level
Easy Apply
Remote
Hiring Remotely in USA
125K-150K Annually
Mid level
The Senior Data Architect will develop and maintain the core data infrastructure for health research, focusing on data pipelines, orchestration, and quality systems. Responsibilities include coding, data modeling, and supporting data ingestion and transformation processes.
The summary above was generated by AI

(ID: 2026-1524)

 

Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).


Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

About the Mission
Join the team at the forefront of revolutionizing medical research in the United States. We are building and maintaining the foundational infrastructure of the National Clinical Cohort Collaborative (N3C)—the nation’s largest and most significant public repository of harmonized electronic health record (EHR) data.

What began as a critical response to the COVID-19 pandemic has evolved into a multi-disease, terabyte-scale data resource that enables researchers across the country to accelerate discovery and improve public health outcomes. The platform integrates EHRs, claims, registries, and other data sources in a secure, regulated environment to support thousands of scientists.

This role is an opportunity to contribute to the core data platform that makes this research possible.


The Role
We are seeking a mid-level Data Platform Engineer to help build and operate the core data infrastructure that powers large-scale, regulated healthcare and research datasets. This role is ideal for an engineer who has moved beyond “entry level,” understands how production systems behave, and wants to grow into owning complex pipelines, orchestration logic, and platform reliability.

You’ll work alongside senior engineers and informatics experts to design, implement, and maintain ingestion, transformation, orchestration, and data quality systems that are reliable, observable, and secure.


What You’ll Do

Build Production-Grade Data Systems

  • Write clean, modular, well-tested Python code for data pipelines and platform services.
  • Use decorators, context managers, and unit tests to ensure correctness and maintainability.
  • Contribute to shared libraries and reusable components across the platform.


Design and Maintain Data Models

  • Implement relational data models aligned with medallion architectures (bronze/silver/gold).
  • Support schema evolution and backward-compatible changes.
  • Work with modern table formats such as Apache Iceberg.

Data Orchestration & Ingestion

  • Build and maintain data workflows using Dagster (preferred) or Airflow.
  • Manage sensors, schedules, and complex job dependencies.
  • Implement ingestion pipelines using Airbyte or similar ELT tools.

Transformation & Data Quality

  • Implement idempotent transformation logic using SQLMesh/Tobiko (preferred) or dbt.
  • Add data quality checks and validation gates using frameworks like Great Expectations.
  • Partner with upstream and downstream users to diagnose and resolve data issues.


Containerization & CI/CD

  • Build, debug, and optimize Docker images for local and production environments.
  • Contribute to CI/CD pipelines supporting automated testing and deployment.
  • Follow modern Git workflows including branching strategies, pull requests, and code reviews.

Infrastructure, Cloud & Security

  • Read and modify infrastructure-as-code using Terraform.
  • Work with AWS primitives (S3, Lambda, Glue, Fargate), with a focus on portability and migration toward open-source, cloud-agnostic alternatives.
  • Apply least-privilege and identity-based access concepts (OIDC/IAM).
  • Operate comfortably within regulated environments (HIPAA, FedRAMP).

Documentation & Collaboration

  • Document data flows, system architecture, and operational procedures clearly.
  • Collaborate closely with senior engineers, informaticists, and project stakeholders.
  • Participate in design reviews and contribute ideas for improving platform reliability and scalability.

What You’ll Bring
Required

  • 2–4 years of experience in Data Engineering or Backend Software Engineering.
  • Strong proficiency in Python and SQL.
  • Solid understanding of relational theory and data modeling.
  • Experience working with orchestration tools (Dagster, Airflow, or similar).
  • Familiarity with containerization and Docker-based workflows.
  • Experience working with version control, CI/CD, and collaborative development practices.
  • Ability to write clear technical documentation.

Nice to Have

  • Experience with Iceberg, Airbyte, Great Expectations, SQLMesh, or dbt.
  • Prior work on regulated data platforms (healthcare, government, finance).
  • Interest in data platform architecture and long-term system evolution.

 


Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact: [email protected]

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.

#IND

Salary Range
$125,000$150,000 USD

Top Skills

Airflow
Aws,S3,Lambda,Glue,Fargate,Sqlmesh,Dbt,Great Expectations
Dagster
Docker
Python
SQL
Terraform

Similar Jobs

6 Days Ago
In-Office or Remote
12 Locations
195K-258K Annually
Senior level
195K-258K Annually
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Design, build, and operate the core data warehouse, ingestion, orchestration, and cataloging platform. Develop batch and streaming pipelines, ensure data quality, governance, observability, and provide ML data platform capabilities. Lead architecture, improve platform reliability and performance, and collaborate with product, engineering, data science, security, and compliance teams.
Top Skills: Apache Flink,Google Cloud Dataflow,Bigtable,Cassandra
13 Days Ago
Remote
United States
152K-205K Annually
Mid level
152K-205K Annually
Mid level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Build and operate Dropbox's petabyte-scale data platform: enable reliable ingestion, storage, and processing, lead data lake modernization, support AI/ML workflows, integrate with product teams, and participate in on-call operations.
Top Skills: AirflowBigQueryC#Data LakeDatabricksGoHiveJavaKafkaLakehousePythonRedshiftSnowflakeSparkSparksqlSuperset
9 Hours Ago
Remote
USA
150K-215K Annually
Senior level
150K-215K Annually
Senior level
Artificial Intelligence • Machine Learning • Software • Defense
As a Backend Engineer, you will design and implement core infrastructure for Vannevar's platform, ensuring standards for data processing, security compliance, and performance. You will collaborate with various teams to enhance shared infrastructure and lead technical initiatives.
Top Skills: AWSAzureDockerGCPKubernetes

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account