Sanctuary Computer Logo

Sanctuary Computer

Senior Data Engineer

Reposted An Hour Ago
In-Office or Remote
Hiring Remotely in New York, NY
150K-200K Annually
Senior level
In-Office or Remote
Hiring Remotely in New York, NY
150K-200K Annually
Senior level
The Senior Data Engineer will design and maintain data pipelines, ensuring scalability and reliability while integrating diverse data sources and optimizing workflows.
The summary above was generated by AI

We are recruiting a Sr. Data Engineer for a client in the AI / health & wellness space

Original Job Post link
https://www.notion.so/garden3d/Senior-Data-Engineer-2e7131fea2c7800094e4d9340a5df499?source=copy_link

About garden3d

We are worker owned creative collective, innovating on everything from brands and IRL communities to IoT devices and cross platform apps. We share profit, open source everything, spin out new businesses, and invest in exciting ideas through financial and/or in-kind contributions.

Our client roster includes Google, Stripe, Figma, Hinge, Black Socialists in America, ACLU, Pratt, Parsons, Mozilla, The Nobel Prize, MIT, Gnosis, Etsy & Gagosian. We’re the software team behind innovative products like The Light Phone & Mill, and a global, decentralized community space collective called Index Space.

We think of our garden3d as collective for creative people, prioritizing a happy, talented, and diverse studio culture. We work on projects that bring value to our world, and we balance deep care for the work we do with a genuine curiosity about life outside of our jobs.

About the client

Our client is an early-stage AI startup based in NYC (but open to remote team members). The founders have experience building and scaling successful ventures including a 9-figure exit.

Who we’re looking for:

We’re looking for a Senior Data Engineer with deep expertise in designing and owning data pipelines, workflow orchestration, and complex data integrations. You’ll play a key role in evolving our data ingestion architecture, from an existing in-house, code-defined workflow system backed by queues, to a more scalable and observable orchestration layer using Prefect.

In this role, you’ll lead the development and optimization of pipelines handling both structured and unstructured data from a wide range of sources, including web crawls and scrapers. You’ll be expected to make architectural decisions, ensure reliability and scalability, and establish best practices for workflow design, monitoring, and performance as our data platform grows.

In this role, you’ll work across a variety of initiatives to find cost-effective, high-quality, pragmatic solutions to complex problems. Responsibilities will include:

  • Monitoring and maintaining data pipelines, troubleshooting new errors, and addressing format drift
  • Extracting and enriching additional data elements from diverse sources
  • Reprocessing and validating large datasets in batch workflows
  • Designing and integrating new data sources into existing pipelines
  • Aligning and integrating extracted data with the core application data model to ensure consistency and usability
  • Participating in code reviews, providing constructive feedback to teammates and ensuring adherence to best practices
  • Contributing to project success by keeping a close eye on team velocity, project scope, budget, and timeline
  • Negotiating with clients to align project scope with budget and timeline, if needed

Who you are

The person we’re looking for is happy, relaxed and easy to get along with. They’re flexible on anything except conceits that will lower their usually outstanding work quality. They work “smart”, by carefully managing their workflow and staggering features that have dependencies intelligently — they prefer deep work but are OK coming up to the surface now and then for top level / strategic conversations.

We believe people with backgrounds or interests in design, art, music, food or fashion tend to have a well rounded sense of design & quality — so a variety of hobbies or side projects is a big nice to have!

Must Have Competencies:

  • Senior-level Python expertise
  • Experience with data/workflow orchestration tools (e.g., Prefect, Airflow, Dagster)
  • A thorough understanding ETL & data transformation for the ingestion of industry standard LLMs (OpenAI, Claude, etc)
  • Familiarity with Large Language Models (LLMs)
  • Skilled in interfacing with APIs (OpenAI, Google Gemini/Vertex, etc.) using wrapper libraries such as Instructor, LiteLLM, etc.
  • Practical experience in prompt engineering
  • Ability to work with structured outputs and potentially tool calling
  • 5+ years general experience in backend (Ruby on Rails, Elixir Phoenix, Python Django, or Node Express) and/or native app development (React Native, Flutter, Android, AOSP, Kotlin/Java).

Nice to Have Competencies:

We’re always pitching for new and exciting technology niches. Some of the areas below are relevant to us!

  • Experience with Google Cloud Platform (GCP), particularly Cloud Run and Cloud Tasks
  • Knowledge of search technologies, including embeddings and vector databases for semantic search, as well as keyword-based search (BM25)
  • Familiarity with PySpark for batch data processing
  • Experience working with LLMs, Vector Databases, and other generalist AI-enabled application patterns
  • Client-facing experience: working directly with customers to gather requirements and provide technical solutions
  • Product management experience: defining product roadmaps and collaborating closely with stakeholders
  • Engineering management experience: leading teams, setting technical direction, and mentoring developers
  • Recent experience working in a startup environment
  • NYC-based preferred for collaboration, but not a strict requirement.

Compensation

The pay scale ranges from $125 p/hr to $175 p/hr, $150-200k/year based on experience.

In addition to cash compensation, equity may be offered for candidates with the right level of experience, commitment, and long-term alignment.

How we interview:

Our interview process starts with a call where you get to meet a few members of our team. From there we’ll ask appropriate candidates to take part in a technical exercise which helps illustrate skill level and comfort.

Direct application link here:
https://garden3d.notion.site/1f1131fea2c78095922ec7e09bd96101
(Tell us a bit about your interest in the role and share your information by filling out the questions.)

Quick tip! Adding a Loom recording to your profile in our form to showcase your skillset can really make your application stand out!

Similar Jobs

Yesterday
Easy Apply
Remote
United States
Easy Apply
165K-228K Annually
Senior level
165K-228K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Security Engineer, you will lead Upstart's data security program, design scalable data protection systems, and collaborate with engineering and cross-functional teams while mentoring other security practitioners.
Top Skills: APIsConcentric AiCyera)Data Loss PreventionData Security Posture ManagementSecurity Tooling (BigidVaronis
An Hour Ago
In-Office or Remote
14 Locations
103K-165K Annually
Senior level
103K-165K Annually
Senior level
Healthtech
Design and operate data pipelines for analytics and reporting, ensuring data quality and security, mentoring engineers, and translating data into insights.
Top Skills: Apache AirflowAws GlueAzure Data ExplorerAzure Data FactoryBigQueryCriblKafkaMongoDBMySQLPostgresPower BIPythonRedshiftSnowflakeSQLTableau
Yesterday
Remote
USA
145K-200K Annually
Senior level
145K-200K Annually
Senior level
Cloud • Software
The Senior Data Engineer will maintain and improve data infrastructure, design ETL processes, and create high-performance pipelines for analytics and AI initiatives using SQL and Python in cloud environments.
Top Skills: Apache AirflowAWSETLGoogle Cloud PlatformPythonSQL

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account