Sayari Logo

Sayari

Data Engineer (Remote, US)

Posted Yesterday
Remote
Hiring Remotely in United States
90K-120K Annually
Junior
Remote
Hiring Remotely in United States
90K-120K Annually
Junior
The Data Engineer will build and maintain scalable data pipelines, improve ETL processes, collaborate with AI/ML teams, and contribute to engineering culture.
The summary above was generated by AI
About Sayari: 

Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.7B+ primary-source records from 250+ jurisdictions forming the ground truth of global commerce. A Judgment Ontology, encoding over a decade of investigative tradecraft, and Superconductor, an agentic orchestration platform, deliver AI that reasons like an expert analyst, shows its work, and traces every finding to its source. Trusted by U.S. Customs and Border Protection, HM Revenue & Customs, and Fortune 500 enterprises, Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks. Headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.

POSITION DESCRIPTION

As a Data Engineer at Sayari, you will be the engine behind the world’s most comprehensive commercial world model. You will join a high-autonomy team responsible for building and scaling the complex orchestration systems that transform billions of primary-source records into actionable intelligence. This is a role for a "builder" who respects the complexity of large-scale ETL and graph databases and is "PhD-curious" about the future of AI-native data products and modern orchestration.


JOB RESPONSIBILITIES
  • Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow to support our core data acquisition and entity resolution engines.
  • Collaborate cross-functionally with AI/ML and Product teams to implement new features and AI-native products.
  • Proactively identify and resolve bottlenecks in our complex ETL processes, bringing a fresh perspective to refine and optimize our existing codebase.
  • Contribute to a robust engineering culture through rigorous code reviews, unit testing, and clear communication of design decisions.
  • Own the end-to-end delivery of roadmap tasks within two-week sprints, ensuring work meets high standards for quality, documentation, and performance.
  • Participate in roadmap planning and story refinement, eventually taking ownership of major epics that drive our long-term product defensibility.
SKILLS & EXPERIENCE

Required

  • Professional proficiency in Python and experience contributing to shared codebases using Git (branching, PRs, code reviews).
  • Demonstrated experience working with relational databases (PostgreSQL/BigQuery) and an interest in or familiarity with graph databases.
  • Familiarity with distributed computing (Spark) or a strong desire to master it.
  • Strong collaborative skills and the ability to work effectively in an Agile, sprint-based environment.
  • A "self-directed" orientation: ability to move tasks from "assigned" to "complete" with high autonomy and clear communication.

Preferred

  • Experience with Django, Scala, or Scrapy.
  • Hands-on experience with workflow orchestration tools like Airflow.
  • Experience or strong interest in LLM tuning, deployment, and AI engineering best practices.
  • Experience working with international or non-English datasets.
  • Prior experience working with high-scale, complex data pipelines.

The target base salary for this position is $90,000-$120,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.


Benefits: 
  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.
Pay Range
$90,000$120,000 USD

Similar Jobs

20 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
150K-190K Annually
Senior level
150K-190K Annually
Senior level
Marketing Tech • Real Estate • Software • PropTech • SEO
In this role, you'll build and scale high-throughput data pipelines, ensure data quality, and leverage AI for improved data operations. You'll also lead initiatives in a fast-paced environment focused on real estate data engineering.
Top Skills: AirflowAWSIcebergKafkaKubernetesNode.jsPostgresPydanticPysparkPythonSpark StreamingSqsTypescript
8 Days Ago
Remote
United States
156K-180K Annually
Senior level
156K-180K Annually
Senior level
Big Data • Real Estate • Software
The Senior Data Engineer will optimize data ingestion pipelines, manage PostgreSQL databases, design ETL processes, and ensure system reliability in a remote-first, collaborative environment.
Top Skills: AWSAws AuroraCi/CdCloudFormationPostgresRubyRuby On RailsTerraform
16 Days Ago
In-Office or Remote
135K-155K Annually
Senior level
135K-155K Annually
Senior level
Artificial Intelligence • Machine Learning • Database
The Data Engineer will design, build, and deploy scalable data solutions, optimize data flows, and implement ETL processes, leveraging various cloud platforms and technologies.
Top Skills: Azure DevopsCloudFormationDatabricksJavaJavaScriptLookerPower BIPythonSnowflakeSQLTableauTerraformTypescript

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account