Axcelis Technologies Logo

Axcelis Technologies

Data Infrastructure & ML Engineer (Hybrid Role)

Posted Yesterday
In-Office
Beverly, MA
122K-183K Annually
Senior level
In-Office
Beverly, MA
122K-183K Annually
Senior level
Design and build scalable end-to-end ETL/ELT data pipelines, process log and semi-structured data with Python and dataframes, and architect scalable database schemas (partitioning/sharding). Ensure data traceability, validation, monitoring, and enable datasets for machine learning model training and deployment while collaborating with data scientists and engineering teams.
The summary above was generated by AI

JOB DESCRIPTION

Job Description: Data Infrastructure & ML Engineer (Hybrid Role)

Role Summary

We are seeking a Senior Data Infrastructure & Machine Learning Engineer to design and implement scalable data systems and pipelines that support advanced analytics and machine learning workflows.

This is a hybrid role where the primary focus is on data pipeline engineering and Python-based data processing, supported by strong database design and management expertise.

Role Focus (Approximate Split)

  • Data Pipeline Engineering & Data Flow (Critical): ~50%
  • Python & Machine Learning Data Processing: ~30%
  • Database Design & Management: ~20%

Key Responsibilities

1. Data Pipeline Engineering (Primary Responsibility)

  • Design and build end-to-end data pipelines (ETL/ELT) for ingesting, processing, and transforming data.
  • Handle multiple data sources including:
    • Tool-generated logs (e.g., AT log files)
    • JSON and semi-structured data
  • Ensure full data traceability, enabling backward tracking of all data points.
  • Implement validation, monitoring, and error handling to ensure data quality and reliability.

2. Database Design & Data Architecture

  • Design and manage scalable database schemas.
  • Support both single-node and distributed database environments.
  • Implement tablespaces, partitioning, and sharding strategies to ensure performance and scalability.
  • Optimize queries and maintain high performance for large-scale datasets.

3. Python-Based Data Processing & Analytics

  • Develop data processing workflows using Python.
  • Work extensively with dataframes for transformation and analysis.
  • Utilize libraries such as:
    • Pandas, NumPy for data manipulation
    • Plotly (or similar) for visualization and exploratory analysis
  • Automate data workflows and integrate them into pipelines.

4. Machine Learning Data Enablement

  • Prepare and transform datasets for machine learning models.
  • Collaborate with data scientists and engineers to support model training and deployment workflows.
  • Enable scalable data foundations for AI/ML integration into production systems.

Required Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field with 5+ years of experience.
  • Strong experience in database design and SQL-based systems.
  • Hands-on experience with distributed systems, partitioning, and sharding.
  • Proven experience building data pipelines (ETL/ELT).
  • Strong proficiency in Python for data processing.
  • Experience working with log-based and semi-structured data (e.g., JSON).
  • Understanding of data traceability, validation, and governance.

Preferred Qualifications

  • Experience with time-series or log analytics systems.
  • Exposure to real-time/streaming architectures (e.g., Kafka).
  • Experience with cloud platforms (Azure, AWS, or GCP).
  • Familiarity with machine learning workflows and lifecycle.
  • Domain experience in semiconductor or high-throughput systems (nice to have).

Key Competencies

  • Strong problem-solving and analytical skills.
  • Ability to design production-grade, scalable systems.
  • Focus on data integrity, performance, and reliability.
  • Effective collaboration across engineering and data teams.
  • Clear communication and documentation.

EQUAL OPPORTUNITY STATEMENT

It is the policy of Axcelis to provide equal opportunity in all areas of employment for all persons free from discrimination based on race, sex, religion, age, color, national origin, disability status, medical condition (including pregnancy), veteran status, sexual orientation, marital status, or any other characteristic protected by federal, state or local law.  Axcelis will provide reasonable accommodation necessary to enable a disabled candidate or employee to perform the essential functions of the position, unless the accommodation would create an undue hardship for the Company.
 

U.S. BASE SALARY RANGE

$122,133.07 - $183,199.61

This base salary range reflects the typical compensation for this role across U.S. locations.

Our salary ranges are determined by role and level; individual pay is determined based on

multiple factors, including job-related skills, experience, relevant education or training, work

location, and internal equity. The range provides the opportunity for growth and progression as

you develop within the role.

Base pay is one part of our U.S. total compensation package which includes eligibility in the

Axcelis Team Incentive bonus plan, and comprehensive benefits package (for regular

employees working 20+ hours a week).

Similar Jobs

An Hour Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
100K-125K Annually
Senior level
100K-125K Annually
Senior level
Cloud • Mobile • Software
Lead discovery, design, configuration, testing, and validation of accounting integrations between BuildOps and customers' ERPs. Map GL/accounts/entities, build and execute test plans for AP/AR/POs/payments, reconcile data, troubleshoot discrepancies, document solutions, and advise customers on best practices to ensure scalable, accurate end-to-end syncs.
Top Skills: APIsBoomiBuildopsCeligoCsvErpExcelGoogle SheetsIpaasMulesoftNetSuiteQuickbooks OnlineSage IntacctSpectrumViewpoint VistaWorkato
An Hour Ago
Remote or Hybrid
Boston, MA, USA
125K-150K Annually
Senior level
125K-150K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Lead North American field marketing strategy and execution for enterprise accounts, owning event planning (trade shows, regional events), partner/channel programs, logistics, budgets, measurement, vendor management, and cross-functional coordination to drive pipeline and customer engagement.
Top Skills: CRMMarketing Automation Platforms
An Hour Ago
Remote or Hybrid
United States
Mid level
Mid level
Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
Manage high-volume, full-cycle recruiting for clinical and operations roles: source, screen, interview, negotiate offers, coordinate onboarding, maintain ATS data (Lever), and build long-term candidate pipelines while ensuring compliance and hiring metrics.
Top Skills: Google SuiteLeverLinkedIn

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account