Rula Logo

Rula

Staff Data Engineer - RCM (Remote)

Posted Yesterday
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The Staff Data Engineer role involves developing and maintaining near real-time data pipelines using various technologies to enhance operational reporting and patient outcomes. Responsibilities include designing data architecture, ensuring data quality, and collaborating across teams.
The summary above was generated by AI

We believe that mental health is just as important as physical health. We recognize that mental health issues can be complex and multifaceted, and we are dedicated to treating the whole person, not just the symptoms.

We aim to create a world where mental health is no longer stigmatized or marginalized, but rather is embraced as an integral part of one's overall well-being. 

We believe that by providing quality care that is both evidence-based and compassionate, we can empower individuals to take charge of their mental health and achieve their full potential. We are passionate about making a positive impact on the lives of those struggling with mental health issues and we strive to be a force for positive change in the field of mental healthcare.

About the Role

At Rula, our mission is to make mental health care more accessible and effective for those who need it. As a Staff Data Engineer for Operational Reporting, you will oversee the design and implementation of a greenfield near real-time data platform, starting with micro-batching pipelines using Kafka to deliver critical operational reports and evolving into a scalable Apache Flink architecture for sub-second analytics. Your work will power real-time dashboards and insights that enable our providers, leadership, and operational teams to make data-driven decisions, ultimately improving patient outcomes.

You will join our collaborative data team, nested within the broader engineering organization, working closely with business analysts, product managers, and data experts to transform raw event streams into reliable, actionable reporting data. Your daily responsibilities—building fault-tolerant pipelines, ensuring data accuracy, and optimizing for low-latency delivery—will lay the foundation for Rula’s near real-time data capabilities. This role offers the opportunity to own a strategic transition from micro-batching to a Flink-based streaming architecture, driving innovation in how we harness data to support our mission. If you’re passionate about turning complex data into impactful insights that advance mental health care, this is your chance to make a meaningful difference.

Required Qualifications

  • Data Pipeline Development (8+ yrs). Experience designing and maintaining scalable ETL/ELT pipelines for operational reporting using Kafka, Glue, dbt, Dagster, and Airflow. Leveraging Python and SQL for data transformation and quality checks, and working with Flink and Spark Streaming to build low-latency, near real-time pipelines.

  • Cloud Infrastructure & Data Warehousing (8+ yrs overall, 4+ yrs in AWS). Proficiency building and optimizing data pipelines using AWS services such as S3, Redshift, Glue, IAM, Kinesis, and EMR. Experience across GCP (BigQuery, Dataflow) and Azure (Synapse, Data Factory). Optimizing data warehouses (Redshift, Snowflake, BigQuery) and managing Data Lakes (S3, Delta Lake) for scalable, low-latency analytics. Ensuring cost efficiency, scalability, and compliance (CPRA, HIPAA) while supporting a migration toward Flink-based near real-time architecture.

  • Data Quality & Governance (8+ Years). Experience implementing scalable data validation, quality checks (e.g., deduplication, consistency), and error-handling mechanisms tailored for operational reporting pipelines, ensuring high-fidelity data for real-time dashboards and analytics. Proficiency in designing and enforcing data governance practices, including metadata management, lineage tracking for auditable reporting, and compliance with regulations like CPRA or HIPAA in Data Lake environments (e.g., AWS S3, Delta Lake).

  • Performance Optimization (3+ Years). Experience optimizing data pipelines, queries, and large-scale datasets for efficiency and scalability in operational reporting systems, with a focus on achieving low-latency delivery. Proficiency in tuning high-throughput streaming systems, including optimizing resource usage and implementing best practices for partitioning, caching, and indexing.

  • Security & Compliance (3+ Years). Experience implementing data security measures, including encryption, role-based access control (RBAC), and data masking, to protect sensitive data in operational reporting pipelines and Data Lakes (e.g., AWS S3, Delta Lake). Strong understanding of compliance standards such as HIPAA and CPRA, with hands-on expertise in applying these standards to streaming systems like Apache Kafka and Apache Flink. Demonstrated ability to ensure auditability and security in data workflows, supporting reliable and compliant near real-time analytics during the transition from micro-batching to a Flink-based architecture.

  • Collaboration & Communication (5+ Years). Strong ability to work cross-functionally with business analysts, product managers, leadership, and other stakeholders to define and deliver operational reporting requirements. Exceptional communication skills to translate complex technical concepts into clear, actionable insights for non-technical audiences. Proven adaptability to thrive in a fast-paced startup environment, collaborating effectively to support the rapid development and evolution of a near real-time data platform while aligning with Rula’s mission to improve mental health care outcomes.

Preferred Qualifications

While having the preferred qualifications enhances your candidacy, having all of them is not mandatory. We encourage all interested applicants to apply, even those who may not meet every preferred requirement.

  • Hands-on experience with AWS tools like S3, Glue, EMR, SageMaker, and Lambda for building scalable ETL/ELT pipelines optimized for ML/LLM training, including feature engineering, data versioning, and handling large-scale unstructured data

  • Demonstrated ability to maintain data integrity and accuracy in streaming systems like Apache Kafka and Apache Flink, supporting reliable operational insights during the transition from micro-batching to a near real-time architecture.

  • Familiarity with infrastructure as code (IaC) tools like Terraform or CloudFormation for managing cloud resources.

  • Experience implementing and maintaining CI/CD pipelines for data workflows.

  • Demonstrated ability to enhance pipeline performance to support near real-time analytics while maintaining cost efficiency and reliability during the transition from micro-batching to a streaming architecture.

  • Strong ability to partner with data scientists and ML engineers to design efficient pipelines, using orchestration tools (e.g., Airflow, Dagster) for incremental loading and cost optimization, while monitoring performance metrics like latency and resource utilization in AWS environments.

We're serious about your well-being! As part of our team, full-time employees receive:

  • 100% remote work environment (US-based only): Working hours to support a healthy work-life balance, ensuring you can meet both professional and personal commitments

  • Attractive pay and benefits: Full transparency of pay ranges regardless of where you live in the United States

  • Comprehensive health benefits: Medical, dental, vision, life, disability, and FSA/HSA

  • 401(k) plan access: Start saving for your future

  • Generous time-off policies: Including 2 company-wide shutdown weeks each year for self-care (for most employees)

  • Paid parental leave: Available for all parents, including birthing, non-birthing, adopting, and fostering

  • Employee Assistance Program (EAP): Support for your mental and physical health

  • New hire home office stipend: Set up your workspace for success

  • Quarterly department stipend: Fund team-building activities or in-person gatherings

  • Wellness events and lunch & learns: Explore a variety of engaging topics

  • Community and employee resource groups: Participate in groups that celebrate employee identity and lived experiences, fostering a sense of community and belonging for all

Our team

We believe that diversity, equity, and inclusion are fundamental to our mission of making mental healthcare work for everyone.  We are dedicated to having a culture of inclusion that will support our employees in feeling safe, seen, heard, and valued.

Top Skills

Airflow
AWS
Azure
BigQuery
Dagster
Data Factory
Dataflow
Dbt
Delta Lake
Emr
Flink
GCP
Glue
Iam
Kafka
Kinesis
Python
Redshift
S3
Snowflake
Spark
SQL
Synapse

Similar Jobs at Rula

Yesterday
Remote
United States
Expert/Leader
Expert/Leader
Healthtech • Other • Social Impact • Software • Telehealth
As a Principal Software Engineer, you'll lead architectural strategies, mentor engineers, prototype services, and ensure alignment with business goals in a remote environment.
Top Skills: Backend TechnologiesCloud-Based TechnologiesService-Oriented Architecture
Yesterday
Remote
United States
Senior level
Senior level
Healthtech • Other • Social Impact • Software • Telehealth
The Sr. Analytics Engineer designs data models for reporting, collaborates across teams, and optimizes cloud-based data warehouses, particularly in healthcare settings.
Top Skills: DbtFlink SqlHexLookerRedshift
8 Days Ago
Remote
United States
Senior level
Senior level
Healthtech • Other • Social Impact • Software • Telehealth
Lead forecasting, budgeting, and strategic financial planning for a fast-growing company, collaborating with executives to enhance decision-making.
Top Skills: Budgeting ProcessesFinancial ModelingFinancial ProcessesForecastingSystemsTools

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account