We create technology with heart for the health of every person in the world.
About Buoy Health
Buoy builds a digital health tool that helps people – from the moment they get sick – start their health care on the right foot. Started by a team of doctors and computer scientists working at the Harvard Innovation Laboratory in Boston MA, Buoy was developed in direct response to the downward spiral we’ve all faced when we attempt to self-diagnose our symptoms online. Buoy leverages artificial intelligence – powered by advanced machine learning and proprietary granular data - to resemble an exchange you would have with your favorite doctor – to provide consumers with a real-time, accurate analysis of their symptoms and help them easily and quickly embark on the right path to getting better. Buoy is based in Boston and was founded in 2014.
About the role
The Data Engineer is responsible for deploying and maintaining data warehousing environments, managing ETL/ELT pipelines and job orchestration frameworks, and ensuring data quality. The charter of the data engineering and analytics team is to promote a data driven culture throughout the company, and to make high quality data broadly accessible and easy to work with for analysis, data science, and machine learning. The data engineer can expect to work closely with business analysts, data scientists, machine learning engineers, and developers to build the first generation of our data warehouse, ETL pipelines, and data models. We are currently building our data analytics ecosystem from the ground up, and our company and datasets are growing rapidly - so the data engineer will also have the opportunity to inform the design, implementation, and best practices or this system.
-- Build cloud-based data warehousing environments, data processing pipelines, and data models that support a variety of business needs
-- Support a variety of data processing pipelines, integrate new data sources into our data warehouse, and create jobs to load, transform, and QA vital datasets
-- Work with data scientists, analysts, and developers in the product development process to ensure that newly designed data models meet analytics requirements and follow best practices
-- Share your expertise on scalable data processing with analysts and data scientists to further our goal of being a truly data driven organization
-- 2-3 years of experience as a data engineer using data warehousing technologies like Snowflake, Amazon Redshift, S3, Athena, EMR, and Hadoop/Hive/Spark
-- Proficient in SQL including one or more relational databases like MySQL, MariaDB, Oracle, Postgres, or similar
-- 2-3 years experience with ETL and job scheduling or orchestration using tools like Airflow
-- 2-3 years programming experience and familiarity with Python, AWS, and Git
-- Excellent communication and ability to work on a growing team
Bonus points if you have
-- Experience with web-scale data or working with healthcare data in a HIPAA-compliant environment
-- Experience with Looker for business intelligence, and snowflake for data warehousing
-- Experience with AB Testing
-- Experience with machine learning
-- Stock Options
-- Unlimited PTO
-- Medical, Dental, Vision
-- 401k with matching
-- Dogs in the office!
-- Half day Fridays
**This role is located in the United States. Unfortunately, we are unable to support international applicants at this time.
Read Full Job Description