At Socure, we’re on a mission—to verify 100% of good identities in real time and eliminate identity fraud from the internet.
Using predictive analytics and advanced machine learning trained on billions of signals to power RiskOS™, Socure has created the most accurate identity verification and fraud prevention platform in the world. Trusted by thousands of leading organizations—from top banks and fintechs to government agencies—we solve real, high-impact problems at scale. Come join us!
Socure is seeking a talented and motivated Data Engineer to join our Identity Graph team. In this role, you’ll help design, build, and optimize the core data services that power Socure’s identity verification and fraud detection products. Your work will directly impact the performance, scalability, and reliability of the platform that underpins our industry-leading identity solutions.
You’ll collaborate cross-functionally to deliver high-quality, secure, and scalable data pipelines that support machine learning, analytics, and real-time inference. This is a great opportunity for an engineer who enjoys solving complex problems and wants to work at the intersection of big data, identity, and fraud prevention.
What You’ll DoData Platform EngineeringDesign and build scalable, secure data pipelines for both batch and real-time processing.
Support data systems that power machine learning workflows, model inference, and analytics use cases.
Write clean, production-ready code in Java, Scala, or Python.
Work with tools like Apache Spark, Kafka, Flink, Airflow, and AWS-native services including EMR.
Apply graph data modeling principles using technologies like Neo4j or Amazon Neptune.
Optimize data architecture for performance, cost-efficiency, and ease of maintenance.
Partner with Data Science, Product, and Security teams to translate business needs into data-driven solutions.
Participate in system design and architecture reviews.
Contribute to code reviews and promote best practices in software engineering and data infrastructure.
Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related technical field.
3–5+ years of experience building and supporting complex data systems and applications in cloud environments.
Strong programming skills in Java, Scala, or Python.
Deep knowledge of distributed data processing frameworks such as Spark, Kafka, or Flink.
Experience working with cloud services (preferably AWS) and containerized environments (Docker, Kubernetes).
Solid understanding of software design patterns, data structures, and DevOps/CI-CD best practices.
Hands-on experience with Airflow or other orchestration tools for managing data pipelines.
Familiarity with building ML data pipelines using platforms such as Databricks or SageMaker.
Experience in developing and utilizing scalable, high-performance APIs.
Bonus points for experience with graph databases and graph algorithms.
Socure is an equal opportunity employer and values diversity of all kinds at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Follow Us!
YouTube | LinkedIn | X (Twitter) | Facebook
Top Skills
Similar Jobs
What you need to know about the Boston Tech Scene
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories