Allata Logo

Allata

Data Engineer (Databricks + Python + Azure)

Posted 15 Days Ago
Remote or Hybrid
Hiring Remotely in Missouri
Mid level
Remote or Hybrid
Hiring Remotely in Missouri
Mid level
Design and develop scalable data pipelines using Databricks and Python in Azure, focusing on data optimization for analytics in healthcare.
The summary above was generated by AI
Allata is a global consulting and technology services firm with offices in the US, India, and Argentina. We help organizations accelerate growth, drive innovation, and solve complex challenges by combining strategy, design, and advanced technology. Our expertise covers defining business vision, optimizing processes, and creating engaging digital experiences. We architect and modernize secure, scalable solutions using cloud platforms and top engineering practices.

Allata also empowers clients to unlock data value through analytics and visualization and leverages artificial intelligence to automate processes and enhance decision-making. Our agile, cross-functional teams work closely with clients, either integrating with their teams or providing independent guidance—to deliver measurable results and build lasting partnerships.

We are seeking a skilled Data Engineer to join our team and contribute to data-driven initiatives within the healthcare industry. This role focuses on designing, building, and optimizing scalable data solutions that support analytics, reporting, and advanced data use cases in regulated environments.
 

Role & Responsibilities:

  • Design, develop, and maintain scalable data pipelines using Databricks (PySpark) and Python.
  • Build and optimize ETL/ELT processes within Azure cloud environments.
  • Implement data models following modern Data Lakehouse principles (e.g., Medallion architecture).
  • Ensure data quality, consistency, and performance across ingestion, staging, and curated layers.
  • Collaborate with data architects, analysts, and business stakeholders to translate healthcare data requirements into technical solutions.
  • Develop reusable data transformation logic and modular processing components.
  • Support deployment processes following CI/CD and DevOps best practices.
  • Monitor and optimize data workflows for performance, scalability, and reliability.
  • Contribute to data governance, security, and compliance practices relevant to healthcare environments.

Hard Skills - Must have:

  • Current knowledge of an using modern data tools like (Databricks,FiveTran, Data Fabric and others); Core experience with data architecture, data integrations, data warehousing, and ETL/ELT processes 
  • Applied experience with developing and deploying custom whl and or in session notebook scripts for custom execution across parallel executor and worker nodes 
  • Applied experience in SQL, Stored Procedures, and Pysparkbased on area of data platform specialization. 
  • Strong knowledge of cloud and hybrid relational database systems, such as MS SQL Server, PostgresSQL, Oracle, Azure SQL, AWS RDS, Auroraor a comparable engine. 
  • Strong experience with batch and streaming data processing techniques and file compactization strategies.   

Hard Skills - Nice to have/It's a plus:

  • Strong hands-on experience with Databricks in Azure environments.
  • Advanced proficiency in Python and PySpark for distributed data processing.
  • Experience building and optimizing data pipelines in Azure (Azure Data Factory, Azure SQL, Data Lake Storage, etc.).
  • Solid understanding of data warehousing, data lakehouse concepts, and ETL/ELT frameworks.
  • Experience working with relational databases such as SQL Server, PostgreSQL, Oracle, or similar.
  • Knowledge of batch and streaming data processing patterns.
  • Experience working with large, complex datasets in cloud-based distributed environments.

Soft Skills / Business Specific Skills:

  • Strong analytical and problem-solving skills.
  • Ability to work effectively in cross-functional and distributed teams.
  • Clear communication skills, with the ability to explain technical concepts to non-technical stakeholders.
  • Proactive mindset with a strong sense of ownership.
  • Commitment to delivering high-quality, reliable data solutions.

At Allata, we value differences.

Allata is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Allata makes employment decisions without regard to race, color, creed, religion, age, ancestry, national origin, veteran status, sex, sexual orientation, gender, gender identity, gender expression, marital status, disability or any other legally protected category.

This policy applies to all terms and conditions of employment, including but not limited to, recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.

Top Skills

Aws Rds
Azure
Azure Data Factory
Azure Sql
Data Lake Storage
Databricks
Ms Sql Server
Oracle
Postgres
Pyspark
Python
SQL

Similar Jobs

59 Minutes Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Senior Data Engineer will design and implement data architectures and pipelines, ensure data quality, automate processes, and collaborate with stakeholders to provide actionable insights using AI and big data technologies.
Top Skills: AWSAzureHadoopJavaKafkaPythonScalaSnowflakeSparkSQL
An Hour Ago
Remote
United States
181K-213K Annually
Senior level
181K-213K Annually
Senior level
Healthtech • Other • Social Impact • Software • Telehealth
Lead design for billing and financial workflows in mental health care, collaborating with cross-functional teams to improve user experiences and workflows.
Top Skills: Figma
An Hour Ago
Remote or Hybrid
3 Locations
76K-114K Annually
Senior level
76K-114K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The Bilingual Implementation Business Consultant assists agencies in transitioning operational workflows into Applied Epic while optimizing policy lifecycle management and ensuring effective customer support and project delivery.
Top Skills: Agency Management SoftwareApplied Epic

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account