We are seeking a Data Engineer to design, build and optimize scalable data pipelines. Strong experience in Databricks, PySpark, and data warehouse concepts is required.
This is a remote position.
We are looking for a Data Engineer with strong experience in Databricks, PySpark, and modern Data Warehouse systems. The ideal candidate can design, build, and optimize scalable data pipelines and work closely with analytics, product, and engineering teams.
Requirements
- Strong hands-on skills in Databricks, PySpark, and SQL
- Experience with data warehouse concepts, ETL frameworks, batch/streaming pipelines
- Solid understanding of Delta Lake and Lakehouse architecture
- Experience with at least one cloud platform (Azure preferred)
- Experience with workflow orchestration tools (Airflow, ADF, Prefect, etc.)
Similar Jobs
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and operate scalable data pipelines and AI-ready data products from large structured and unstructured sources (OCR/images/documents). Enable production Generative AI (RAG, semantic search), ensure data quality/observability, orchestrate CI/CD and infra-as-code, and mentor engineers while collaborating with product, analytics, and compliance teams.
Top Skills:
AirflowAWSAzureChartjsDatabricksDatabricksDeequDelta LakeDockerEvent HubsGCPGithub ActionsGreat ExpectationsJavaKafkaKinesisKubernetesLlmOcrPlotlyPysparkPythonRagScalaSeabornSemantic SearchSnowflakeSparkSQLTerraform
Artificial Intelligence • Fintech • Machine Learning • Mobile • Payments • Retail • Software
Own and modernize Upsides analytics data platform: migrate pipelines, reduce cost, improve governance, design reusable modeling/orchestration patterns, deliver domain-critical data products, lead cross-functional initiatives, mentor engineers, and support ML and product teams.
Top Skills:
AWSCi/CdDagsterDatabricksDbtSnowflakeTerraform
Artificial Intelligence • Legal Tech
Founding data engineer responsible for consolidating multiple data sources into a BigQuery warehouse, building ETL/ELT pipelines, creating self-serve data tools (including natural-language/LLM agents), enabling analytics and personalization, and defining data engineering standards and infrastructure for a growing AI product.
Top Skills:
BigQueryData LakeEtl/EltGoogle Cloud PlatformLlmsPythonSQLTerraformText-To-Sql
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories



