DoubleVerify Logo

DoubleVerify

Staff Software Engineer - Data - Rockerbox

Reposted 22 Days Ago
Remote
Hiring Remotely in United States
128K-230K Annually
Senior level
Remote
Hiring Remotely in United States
128K-230K Annually
Senior level
As a Staff Data Engineer, you'll architect and build data pipelines, champion small data strategies, mentor teams, and ensure operational excellence in data workflows.
The summary above was generated by AI
Who We Are

We are looking for a Staff Software Engineer to shape the future of our data platform with a focus on small data at scale. While many companies over-index on heavyweight distributed systems, we believe in the power of efficient, local-first, columnar engines like DuckDB to process and analyze data quickly, reliably, and cost-effectively.

As a Staff Software Engineer, you will set the technical direction for how our teams ingest, transform, and serve data, bridging the gap between lightweight embedded tools and cloud-scale systems. You’ll be hands-on in building pipelines, while also mentoring engineers and setting best practices across the organization.

 What You’ll Do
  • Architect and Build Data Pipelines
    • Design and implement data processing workflows using DuckDB, Polars, and Arrow/Parquet.
    • Balance small-data local pipelines with cloud data warehouse backends (Snowflake etc).
  • Champion the Small Data Mindset
    • Advocate for efficient, vectorized, local-first approaches where appropriate.
    • Drive best practices for designing reproducible and testable data workflows.
  • Collaborate Cross-Functionally
    • Partner with data science, professional services, and product engineering teams to define semantic data layers.
    • Provide technical leadership in how data is versioned, validated, and surfaced for downstream use.
  • Operational Excellence
    • Establish standards for CI/CD, observability, and reliability in data pipelines.
    • Automate workflows and optimize data layout for performance and cost efficiency.
  • Mentor & Lead
    • Serve as a thought leader in the organization, guiding engineers on when to use lightweight tools vs. distributed platforms.
    • Mentor senior and mid-level data engineers to accelerate their growth.

Who You Are

  • Core Technical Skills

    • Deep expertise in SQL (window functions, CTEs, optimization).
    • Strong Python skills with data libraries.
    • Proficiency with DuckDB (extensions, parquet/iceberg integration, embedding in pipelines).
    • Hands-on with columnar formats (Parquet, Arrow, ORC) and schema evolution.
    • Expertise in Kubernetes and Helm
  • Infrastructure & Tools

    • Cloud storage experience (AWS S3, GCS).
    • Experience with semantic layer frameworks (CubeJS).
    • CI/CD tooling (GitHub Actions, Terraform, Docker/Kubernetes).
  • Leadership
    • Track record of leading architecture decisions and mentoring teams.
    • Ability to set standards for maintainability and developer experience.
Nice to Haves
  • Experience with serverless and embedded analytics (DuckDB WASM, in production).
  • Exposure to data versioning (Delta Lake, Iceberg, Hudi).
  • Knowledge of ML/LLM data prep workflows and vector database integrations.
  • Previous experience building hybrid stacks (local development + cloud warehouse production).
What Success Looks Like
  • Data pipelines that are fast, simple, and reproducible—running in seconds or minutes, not hours.
  • A team that defaults to the right level of tooling for the problem (small-data-first, scale-up only when necessary).
  • Clear semantic data definitions that power analytics, experimentation, and AI/ML initiatives.
  • Reduced infrastructure cost and complexity without sacrificing reliability.

The successful candidate’s starting salary will be determined based on a number of non-discriminating factors, including qualifications for the role, level, skills, experience, location, and balancing internal equity relative to peers at DV.
The estimated salary range for this role based on the qualifications set forth in the job description is between [$128,000 - $230,000]. This role will also be eligible for bonus/commission (as applicable), equity, and benefits.
The range above is for the expectations as laid out in the job description; however, we are often open to a wide variety of profiles, and recognize that the person we hire may be more or less experienced than this job description as posted.
Why You’ll Love Rockerbox: At Rockerbox, you’ll find a fast-paced, results-driven environment where your work has a direct impact on our growth and the success of our clients. Our iterative development process means you’ll see your contributions come to life quickly. You’ll join a supportive, light-hearted team that values collaboration and innovation. We are committed to professional growth and will actively support your development in both technical and business domains.

Not-so-fun fact: Research shows that while men apply to jobs when they meet an average of 60% of job criteria, women and other marginalized groups tend to only apply when they check every box. So if you think you have what it takes but you’re not sure that you check every box, apply anyway!


Top Skills

Arrow
Aws S3
Cubejs
Docker
Duckdb
Gcs
Github Actions
Helm
Kubernetes
Parquet
Polars
Python
Snowflake
SQL
Terraform

Similar Jobs

16 Days Ago
Remote
United States
Senior level
Senior level
Fintech • Insurance • Payments • Financial Services
Responsible for building infrastructure for data ingestion and availability, focusing on backend data processing and occasional frontend tooling. Drive architectural design and maintain high-quality processes.
Top Skills: Aws CdkDynamoDBGoNoSQLReactRedshiftTypescript
An Hour Ago
Easy Apply
Remote
United States
Easy Apply
60K-71K Annually
Mid level
60K-71K Annually
Mid level
Insurance
The Agency Onboarding Associate manages onboarding for new agency partners, ensuring they meet production goals in their first 90 days.
Top Skills: CRMGoogleMicrosoft
An Hour Ago
Remote or Hybrid
US
84K-117K Annually
Junior
84K-117K Annually
Junior
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
As a Jr Marketing Data Scientist, you'll support marketing analytics, develop predictive models, apply statistical methods, and automate insights using Python and machine learning techniques.
Top Skills: Generative Ai ToolsMatplotlibNumpyPandasPythonScikit-LearnSQL

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account