At Medal, we’re redefining how people capture and share gameplay experiences. Every day, our platform ingests gameplay video that is raw, unfiltered, and packed with insights. We're looking for a seasoned Data Engineer to take full ownership of our Clips/ML data infrastructure, building the next generation of scalable, real-time pipelines that power everything from user-facing discovery to machine learning research. You'll lead the architecture, operations, and performance of data systems that sit at the heart of our product, influencing everything from content indexing to model training.
If you're passionate about building petabyte-scale video pipelines, love working on low-latency systems, and are excited to help define the future of real-time gaming insights, we want to hear from you.
You WillArchitect and operate petabyte-scale ingestion pipelines
Design automated QA guard-rails (schema validation, anomaly detection, deduplication)
Build high-performance ETL and feature-extraction jobs to process and index hundreds of millions of clips into columnar/video-native formats
Own the end-to-end data ingestion stack (desktop, web & mobile clients)
Establish real-time monitoring, lineage, and “five-nines” SLAs, driving continuous improvement across storage, compute, and network layers
Partner with research and product to curate high-signal data slices, data-health metrics, and accelerate model experimentation
Champion security, privacy, and governance: implement robust RBAC, audit trails, and compliant retention policies for sensitive gameplay footage and user inputs
Mentor and uplevel engineers (including internal Medal platform talent), fostering a culture of craftsmanship, documentation, and ruthless focus on data excellence
5+ years of experience in data engineering, backend systems, or related roles. Experience with video data or ML infrastructure is a plus.
Deep knowledge of ETL/ELT pipelines, distributed systems, and streaming data architectures (e.g., Kafka, Spark, Flink, etc.)
Strong proficiency with Python, Java, Go, or similar languages used in data-intensive environments
Experience with cloud infrastructure (e.g., AWS, GCP), provisioning tools (e.g. terraform, pulumi) and modern data stack tools (e.g., dbt, Airflow, Parquet, Arrow)
Track record of designing systems with extreme scale and performance requirements
Experience consolidating data from diverse sources into unified data models
Deep understanding of data QA methodologies, anomaly detection, and automated testing in production systems
Passion for mentorship and team development; able to upskill engineers and advocate for engineering excellence
A bias toward ownership, urgency, and a desire to build systems that just work, even at scale
Work on cutting-edge tech and help shape the future of gaming
Passionate team that values ownership and innovation
Competitive salary, equity options, comprehensive health insurance, 401k
Top Skills
Similar Jobs
What you need to know about the Boston Tech Scene
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories



