Protege Logo

Protege

Data Annotation Associate

Posted 2 Days Ago
Remote
Hiring Remotely in USA
Entry level
Remote
Hiring Remotely in USA
Entry level
The Data Annotation Associate prepares healthcare documents for AI training by de-identifying PDFs, ensuring accuracy, consistency, and adherence to privacy guidelines.
The summary above was generated by AI

Company Overview:

We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data.

Solving AI’s data problem is a generational opportunity. We’re backed by world-class investors and already powering partnerships with some of the most ambitious teams in AI. The company that succeeds will be one of the largest in AI — and in tech.

We’re a lean, fast-moving, high-trust team of builders who are obsessed with velocity and impact. Our culture is built for people who thrive on ambiguity, own outcomes, and want to shape the future of data and AI.

Role Overview

The Data Annotation & Redaction Associate (PHI) supports Protege’s core data operations by helping prepare sensitive healthcare documents for AI training workflows. This is a fully remote, W2 role based anywhere in the United States, and will involve handling Protected Health Information (PHI) under strict security and confidentiality requirements.

This is a 2-month position with the possibility of extending into a permanent hire based on business needs and performance.

Your first major project will be de-identifying thousands of PDFs by redacting HIPAA identifiers (e.g., names, locations, ages, dates, contact information, record numbers) according to a clear playbook and review process. Accuracy, consistency, and speed matter.

Full-time (40 hrs/week) is preferred, but hours are flexible and we will consider part-time applicants.


What You’ll Do
  • De-identify high volumes of healthcare PDFs by accurately redacting PHI identifiers (names, locations, dates, ages, IDs, and other identifiers) in accordance with established guidelines (hhs.gov)

  • Follow a redaction/annotation playbook closely, including how to handle edge cases and when to escalate questions

  • Complete light QA on your own work (spot checks, verify redactions applied correctly, ensure no PHI remains visible/searchable)

  • Track daily throughput and communicate status clearly (what’s done, what’s blocked, what needs review)

  • Maintain organized file handling and versioning so work is easy to audit and review

  • Operate within strict security policies for PHI handling (confidentiality, access controls, and device hygiene)

About You
  • You have excellent attention to detail and can do focused, repetitive work without accuracy drift

  • You’re dependable and consistent—show up, hit your daily targets, and follow process

  • You learn rules quickly and apply them consistently

  • You are comfortable asking questions when something is unclear rather than guessing

  • You work well independently in a fully remote environment

  • You treat those around you with kindness

Required
  • Authorized to work in the U.S. and able to work as a W2 employee based anywhere in the United States (required for PHI access)

  • Comfort handling sensitive information and following strict privacy/security rules

  • Experience with detail-oriented work (administrative operations, document review, medical records handling, QA, compliance support, or data labeling/annotation)

  • Comfort working with PDFs and basic productivity tools (Google Workspace / Microsoft Office)

  • Strong written communication and reliable follow-through

  • Ability to maintain speed and accuracy across large volumes of similar documents

Bonus:
  • Prior experience in HIPAA-regulated environments or working with healthcare documents (hhs.gov)

  • Experience redacting or reviewing documents (legal, healthcare, insurance, or compliance contexts)

  • Experience in data annotation/labeling workflows

  • Comfort tracking work in spreadsheets and following simple metrics (throughput, error rate)

Top Skills

Google Workspace
MS Office

Similar Jobs

An Hour Ago
Remote or Hybrid
US
148K-200K Annually
Senior level
148K-200K Annually
Senior level
Information Technology
The Principal Solution Architect develops Hybrid Infrastructure solutions, provides consultative guidance, and supports IBM Z Systems environments to drive business growth and strategic customer relationships.
Top Skills: Hardware Configuration Manager (Hcm)Hardware Management Console (Hmc)Ibm TsoIbm Z SystemsIbm ZosSupport Element
An Hour Ago
Remote or Hybrid
US
147K-211K Annually
Senior level
147K-211K Annually
Senior level
Information Technology
The Principal Solution Architect will provide pre-sales consultancy for Hybrid Infrastructure solutions, architecting GPU-based systems, mentoring teams, and driving customer relationship management for business growth.
Top Skills: AnsibleGpuaasIce ClusterwareKubernetesNeocloudNetrisPowerscalePythonRafaySlurmTerraform
An Hour Ago
Remote or Hybrid
CO, USA
80K-120K Annually
Mid level
80K-120K Annually
Mid level
Information Technology
The Sales Manager leads and develops a team of Client and Account Executives to drive strategic sales growth across the Southwest markets, manage client relationships, and achieve business objectives.

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account