Lead end-to-end data engineering on GCP: design scalable data platforms and architectures (lake/lakehouse/warehouse), build and optimize BigQuery/Spark pipelines, ensure data quality/security, enable ML workflows with Data Science, mentor engineers, and own delivery and performance.
We are seeking a Senior Data Engineer (Lead) to drive and own end-to-end data engineering initiatives. This role will lead all data engineering efforts, working closely with Data Science and Analytics teams to design scalable data platforms, enable advanced analytics, and support machine learning use cases.
The ideal candidate will bring deep expertise in cloud data engineering (GCP), strong data modeling capabilities, and proven experience in leading enterprise-grade data solutions.
Responsibilities- Lead and manage end-to-end Data Engineering delivery across projects and initiatives
- Act as the primary technical owner for data pipelines, architecture, and platform design
- Mentor and guide a team of data engineers, ensuring best practices and coding standards
- Design, build, and optimize scalable data pipelines on GCP
- Define and implement modern data architectures (data lake, lakehouse, warehouse)
- Ensure high performance, reliability, and data quality across pipelines
- Partner closely with Data Science teams to enable ML/AI workflows
- Translate business and modeling requirements into optimized data structures
- Support feature engineering, model training, and deployment pipelines
- Design logical and physical data models for analytics and ML use cases
- Implement dimensional modeling (Star/Snowflake schemas) and data vault where applicable
- Optimize datasets for performance, scalability, and usability
- Build and manage solutions using GCP services such as: BigQuery, Cloud Composer (Airflow), Cloud Storage, Dataproc
- Ensure security, governance, and cost optimization on GCP
Required Qualification and Skills:
- 8+ years of experience in Data Engineering, with leadership experience
- Strong expertise in GCP ecosystem and services
- Proficiency in SQL, Python, and/or Scala
- Hands-on experience with ETL frameworks and distributed processing
- Solid experience in Dimensional modeling, Data warehousing concepts, Data structures for ML/analytics
- Experience with Apache Spark and Real-time and batch processing frameworks
- Experience working with cross-functional teams (Data Science, Analytics, Business)
- Proven ability to lead, mentor, and drive delivery
- Strong ownership mindset with leadership capabilities
- Excellent problem-solving and architectural thinking
- Ability to operate in a fast-paced, collaborative environment
Preferred Qualifications:
- Experience in ML data pipelines / feature stores
- Knowledge of data governance, lineage, and quality frameworks
- Exposure to healthcare/payor domain (nice to have)
- Certifications in GCP (Professional Data Engineer)
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories
