LifeStance Health Logo

LifeStance Health

Senior Data Engineer

Reposted 21 Days Ago
Remote
Hiring Remotely in USA
110K-135K Annually
Senior level
Remote
Hiring Remotely in USA
110K-135K Annually
Senior level
The Senior Data Engineer develops and optimizes AWS-based data solutions, focuses on data integrity and governance, and provides production support for data infrastructure.
The summary above was generated by AI

At LifeStance Health, we strive to help individuals, families, and communities with their mental health needs. Everywhere. Every day. It’s a lofty goal; we know. But we make it happen with the best team in mental healthcare.

Thank you for taking the time to explore a career with us. As the fastest growing mental health practice group in the country, now is the perfect time to join our team!

LifeStance Health Values

  • Belonging: We cultivate a space where everyone can show up as their authentic self.

  • Empathy: We seek out diverse perspectives and listen to learn without judgment.

  • Courage: We are all accountable for doing the right thing - even when it's hard - because we know it's worth it.

  • One Team: We realize our full potential when we work together towards our shared purpose.

Benefits
As a full-time employee of LifeStance Health, the following benefits are offered: medical, dental, vision, AD&D, short and long-term disability, and life insurance. Additional benefits include a 401k retirement savings with employer match, paid parental leave, paid time off, holiday pay and an Employee Assistance Program.

Role Overview:

The Senior Data Engineer will play a key role in building, maintaining, and optimizing cloud-based data solutions that support data-driven decision-making at LifeStance. This role requires expertise in AWS data services (Glue, Redshift, Lambda, S3, Kinesis), SQL, Python, Apache Spark, and PostgreSQL, with a focus on scalability, performance optimization, and real-time processing.

You will work closely with data analysts, business intelligence teams, and software engineers to ensure data integrity, governance, and efficiency, while also serving as a production support lead for troubleshooting and maintaining high availability of our data infrastructure.

Compensation: $110,000 - 135,000/annually, plus annual bonus potential

Challenges & Opportunities:

  • Scalable Data Infrastructure – Design and optimize AWS-based data pipelines for structured and unstructured data processing.

  • Real-time & Batch Processing – Develop robust ETL/ELT pipelines leveraging Apache Spark, AWS Glue, and Kinesis.

  • Production Support & Issue Resolution – Lead real-time monitoring, troubleshooting, and optimization of data pipelines.

  • Data Warehousing & Performance Optimization – Optimize PostgreSQL, Redshift, and S3-based data lakes for high performance.

  • Cost-efficient Data Engineering – Improve cost-efficiency and scalability of cloud data infrastructure.

  • Data Governance & Compliance – Implement security, compliance, and governance frameworks aligned with HIPAA, GDPR, and SOC2.

Responsibilities:

Production Support & Issue Resolution

  • Provide primary data engineer for real-time production support, ensuring minimal downtime of mission-critical data pipelines. Rotate the role and on call requirements.

  • Monitor, troubleshoot, and optimize data pipeline failures, query performance bottlenecks, and data discrepancies using AWS CloudWatch, Redshift, and PostgreSQL.

  • Automate Root Cause Analysis (RCA) reporting, reducing resolution time by 30%+ through proactive alerting and system monitoring.

  • Implement best practices in IAM roles, encryption, and regulatory compliance (HIPAA, GDPR) to ensure data security and governance.

Data Engineering & Platform Optimization

  • Design, develop, and maintain scalable ETL pipelines using AWS Glue, Lambda, Redshift, S3, and PostgreSQL.

  • Optimize query performance through partitioning, indexing, query tuning, and materialized views, achieving significant performance improvements.

  • Implement CI/CD pipelines for automated deployments using AWS CodePipeline, Terraform, and CloudFormation to improve system stability and deployment efficiency.

  • Support streaming data pipelines using Kafka, Kinesis, or Spark Streaming for real-time data ingestion and processing.

  • Assist in scaling, performance tuning, and automation to optimize data platform efficiency.

Data Analytics & Business Insights

  • Collaborate with business intelligence teams and analysts to develop high-quality data models that support analytics and decision-making.

  • Build interactive dashboards and reports using Power BI, Tableau, or Looker to enable real-time insights for stakeholders.

  • Automate data validation, cleansing, and quality checks, ensuring 99.9%+ data accuracy across critical business functions.

Skills & Experience:

  • 5+ years of experience in data engineering, cloud-based data platforms, and big data processing.

  • Bachelor’s degree in Computer Science, Data Engineering, or a related field.

  • Expertise in AWS services, including Glue, Lambda, Redshift, S3, CloudFormation, IAM, and CloudWatch.

  • Strong SQL & Python experience for data transformations, query optimization, and automation.

  • Deep knowledge of PostgreSQL, including performance tuning, indexing, query optimization, and schema design.

  • Experience with Apache Spark for big data processing and real-time analytics.

  • Expertise in ETL frameworks and data modeling (Star Schema, Snowflake Schema, OLAP/OLTP optimization).

  • Hands-on experience with Infrastructure as Code (IaC) tools like Terraform, CloudFormation.

  • CI/CD expertise with AWS CodePipeline, Jenkins, or GitHub Actions for data pipeline deployments.

  • Data security & governance expertise including IAM roles, encryption, and HIPAA/GDPR compliance.

  • Proven track record in troubleshooting production issues, conducting root cause analysis (RCA), and improving system performance.

  • Experience working in cross-functional teams with business analysts, data scientists, and DevOps engineers.

Preferred Qualifications:

  • Master’s degree in Computer Science, Data Engineering, or a related field preferred but not required.

  • Experience with streaming data frameworks (Kafka, Kinesis, Spark Streaming).

  • Familiarity with advanced analytics frameworks and ML pipelines (MLflow, SageMaker, Feature Stores).

  • Certifications: AWS Certified Data Analytics, Google Cloud Professional Data Engineer, or Databricks Certified Data Engineer Associate.

Additional Requirements:

  • Must be legally authorized to be employed in the United States.

  • LifeStance is an EEO/Affirmative Action Employer. We do not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability, or any other legally protected status.

  • Demonstrates awareness, inclusivity, sensitivity, humility, and experience working with individuals from diverse ethnic backgrounds, socioeconomic statuses, sexual orientations, gender identities, and other various aspects of culture.

LifeStance is an equal opportunity employer. We celebrate diversity and are fully committed to creating an inclusive work environment for all our employees. Learn more about Diversity, Equity and Inclusion at LifeStance.

Top Skills

Spark
Aws Codepipeline
Aws Glue
CloudFormation
Kinesis
Lambda
Postgres
Power BI
Python
Redshift
S3
SQL
Tableau
Terraform

Similar Jobs

An Hour Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
130K-165K Annually
Senior level
130K-165K Annually
Senior level
Artificial Intelligence • Insurance • Machine Learning • Software • Analytics
Lead design and implementation of scalable, HIPAA-compliant data pipelines and platforms for healthcare ML. Build ETL, orchestration, and tooling for processing EHR, claims, pharmacy, and bioinformatics data; collaborate with data scientists to produce modeling-ready datasets and ensure data quality, reliability, and operational excellence.
Top Skills: Python,Sql,Apache Spark (Pyspark),Databricks,Snowflake,Airflow,Dagster,Prefect,Terraform,Docker,Kubernetes,Aws,Dbt,Ci/Cd
Yesterday
In-Office or Remote
Raleigh, NC, USA
Senior level
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design and build enterprise data models, ETL/ELT pipelines and high-performance PySpark jobs on Azure/Databricks. Enable ML/AI with MLflow and model serving, secure restricted-data environments using Unity Catalog and access controls, deploy AI agents, and evaluate emerging data and AI trends.
Top Skills: Azure,Databricks,Databricks Workflows,Azure Data Factory,Python,Pyspark,Sql,Databricks Sql,Delta Lake,Unity Catalog,Mlflow,Model Serving,Github Copilot,Databricks Genai,Mlops
8 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
165K-175K Annually
Senior level
165K-175K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Data Engineer will design, build, and operate data pipelines for Zeta's AdTech platform, focusing on high-scale data processing and analytics-ready datasets.
Top Skills: AirflowAthenaAWSCassandraDagsterDeltaDynamoDBEmrFlinkGlueGoHudiIcebergJavaKafkaKinesisMySQLParquetPostgresPythonRedisRedshiftS3ScalaSparkSQLStep Functions

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account