Design and execute test strategies for AI/ML models and agent-driven workflows. Validate model accuracy, reliability, and data integrity; build automated test frameworks; integrate testing into CI/CD; identify AI risks like bias and hallucinations; collaborate cross-functionally to improve reliability and resolve production issues.
At TREND Health Partners, a tech-enabled payment integrity company, our mission is to facilitate collaboration between payers and providers for mutual benefit and waste reduction, ultimately improving access to healthcare. We achieve this by aligning the common goals of payers and providers and fostering collaboration through a shared technology platform and seamless workflows.
Joining TREND Health Partners means becoming a part of a dynamic growing organization that promotes a collaborative and innovative work environment. Our comprehensive compensation package includes competitive salaries, highly valued health insurance, a 401(k) plan with employer match, paid parental leave, and more.
The AI Test Engineer II's primary responsibility is validating the quality, reliability, and safety of AI powered systems across TREND's platform.
The AI Test Engineer II's primary responsibility is validating the quality, reliability, and safety of AI powered systems across TREND's platform.
Role and Responsibilities
• AI Testing & Validation
o Design and execute test strategies for AI/ML models, including functional, regression, and
scenario-based testing
o Validate accuracy, consistency, and reliability of AI-generated outputs (e.g., summaries,
classifications, recommendations)
o Develop test cases for edge cases, model drift, and unexpected behavior scenarios
o Evaluate model performance against defined success criteria and business requirements
• Agent & Workflow Testing
o Test agent-driven workflows (multi-step AI processes, automation pipelines, orchestration
logic)
o Validate interactions between AI agents, APIs, and downstream systems
o Identify failure points in end-to-end AI workflows and recommend improvements
• Data Quality & Integrity
o Validate training, validation, and test datasets for completeness, accuracy, and bias risks
o Perform data validation testing across pipelines to ensure integrity of inputs and outputs
o Partner with Data Engineering to detect anomalies, inconsistencies, and gaps
• Automation & Tooling
o Build and maintain automated test frameworks for AI-enabled systems
o Develop scripts and tools to support repeatable testing of AI outputs at scale
o Integrate testing into CI/CD pipelines to enable continuous validation
• AI Risk & Guardrails
o Help define and validate guardrails for AI systems (accuracy thresholds, explainability, fallback
logic)
o Identify risks related to hallucinations, bias, and model drift
o Partner with Engineering and AI teams to improve reliability and safety
• Collaboration & Continuous Improvement
o Work cross-functionally with Engineering, QA, AI, Product, and Operations teams
o Participate in sprint planning, backlog refinement, and release validation
o Contribute to testing standards, best practices, and documentation
o Support root cause analysis for AI-related defects and production issues
o Design and execute test strategies for AI/ML models, including functional, regression, and
scenario-based testing
o Validate accuracy, consistency, and reliability of AI-generated outputs (e.g., summaries,
classifications, recommendations)
o Develop test cases for edge cases, model drift, and unexpected behavior scenarios
o Evaluate model performance against defined success criteria and business requirements
• Agent & Workflow Testing
o Test agent-driven workflows (multi-step AI processes, automation pipelines, orchestration
logic)
o Validate interactions between AI agents, APIs, and downstream systems
o Identify failure points in end-to-end AI workflows and recommend improvements
• Data Quality & Integrity
o Validate training, validation, and test datasets for completeness, accuracy, and bias risks
o Perform data validation testing across pipelines to ensure integrity of inputs and outputs
o Partner with Data Engineering to detect anomalies, inconsistencies, and gaps
• Automation & Tooling
o Build and maintain automated test frameworks for AI-enabled systems
o Develop scripts and tools to support repeatable testing of AI outputs at scale
o Integrate testing into CI/CD pipelines to enable continuous validation
• AI Risk & Guardrails
o Help define and validate guardrails for AI systems (accuracy thresholds, explainability, fallback
logic)
o Identify risks related to hallucinations, bias, and model drift
o Partner with Engineering and AI teams to improve reliability and safety
• Collaboration & Continuous Improvement
o Work cross-functionally with Engineering, QA, AI, Product, and Operations teams
o Participate in sprint planning, backlog refinement, and release validation
o Contribute to testing standards, best practices, and documentation
o Support root cause analysis for AI-related defects and production issues
Qualifications
• Bachelor’s degree in Computer Science, Data Science, Engineering, or related field (or equivalent
experience)
• Experience in QA, testing, or a related engineering role
• Experience testing APIs, data pipelines, or distributed systems
• Strong understanding of software testing principles and test automation
• Experience working in Agile development environment.
• Familiarity with AI/ML concepts (model evaluation, training vs inference, prompt behavior, etc.)
• Experience with test automation tools
• Strong SQL/data validation skills
• Experience working with cloud-based systems (AWS or Databricks preferred)
• Ability to analyze system behavior and identify root causes
experience)
• Experience in QA, testing, or a related engineering role
• Experience testing APIs, data pipelines, or distributed systems
• Strong understanding of software testing principles and test automation
• Experience working in Agile development environment.
• Familiarity with AI/ML concepts (model evaluation, training vs inference, prompt behavior, etc.)
• Experience with test automation tools
• Strong SQL/data validation skills
• Experience working with cloud-based systems (AWS or Databricks preferred)
• Ability to analyze system behavior and identify root causes
Preferred Skills
• Experience testing AI/ML models or data-driven systems
• Experience validating AI-generated outputs or prompt-based systems
• Familiarity with LLMs, agent workflows, or AI-assisted coding tools
• Healthcare or regulated data environment experience (HIPAA, compliance)
• Experience working with CI/CD and DevOps pipelines.
• Experience validating AI-generated outputs or prompt-based systems
• Familiarity with LLMs, agent workflows, or AI-assisted coding tools
• Healthcare or regulated data environment experience (HIPAA, compliance)
• Experience working with CI/CD and DevOps pipelines.
Mental And Physical Demands
• This position will be exposed mainly to an indoor office environment and will be expected to work in or
around computers and printers.
• The nature of the work is sedentary, and the employee will be sitting most of the time.
• Essential physical functions of the job include typing and the repetitive motion to utilize computer
software and hardware continuously throughout the day.
• Essential mental functions of this position include concentrating on analytical tasks, reading
information, and verbal/written communication to others continuously throughout the day.
around computers and printers.
• The nature of the work is sedentary, and the employee will be sitting most of the time.
• Essential physical functions of the job include typing and the repetitive motion to utilize computer
software and hardware continuously throughout the day.
• Essential mental functions of this position include concentrating on analytical tasks, reading
information, and verbal/written communication to others continuously throughout the day.
Related Duties As Assigned
• This job description documents the general nature and level of work but is not intended to be a
comprehensive list of activities, duties, or responsibilities required for this position.
comprehensive list of activities, duties, or responsibilities required for this position.
Consequently, employee’s may be asked to perform other duties as required.
• Employees may also be asked to complete certain compliance requirements set forth by our Business
Partners in the performance of their jobs including but not limited to requests for background and drug
screenings and disclosures of personal health information or personally identifiable information.
Exemptions as provided under the ADA and TITLE VII of the Civil Rights Act will be observed and
followed.
• Reasonable accommodations may be made to enable individuals with disabilities to perform the
functions outlined above.
• Employees may also be asked to complete certain compliance requirements set forth by our Business
Partners in the performance of their jobs including but not limited to requests for background and drug
screenings and disclosures of personal health information or personally identifiable information.
Exemptions as provided under the ADA and TITLE VII of the Civil Rights Act will be observed and
followed.
• Reasonable accommodations may be made to enable individuals with disabilities to perform the
functions outlined above.
Similar Jobs
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Lead large-scale facility modernization and expansion programs ($50M–$300M) end-to-end. Define requirements and estimates, select construction delivery methods, apply Earned Value Management for performance tracking, manage P&L and capital, communicate program health to executives and government customers, and govern vendors and risk to ensure on-time, on-budget delivery.
Top Skills:
Earned Value Management (Evm)Integrated Master Schedule (Ims)Ms ProjectPrimavera P6
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Develops and maintains Integrated Master Plans (IMP) and Integrated Master Schedules (IMS); monitors schedule performance using EVM metrics; performs schedule risk assessments; identifies and resolves critical path and logic conflicts; prepares status and performance reports; supports Integrated Baseline Reviews and Performance Measurement Baseline maintenance; coordinates scheduling across finance, engineering, materials, manufacturing, and quality.
Top Skills:
Deltek CobraEarned Value Management (Evm)GanttExcelMS OfficeMicrosoft ProjectPertSchedule Risk SoftwareSsi Tool SetWbs Pro
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Develop and maintain Integrated Master Plans (IMP) and Integrated Master Schedules (IMS); integrate schedules across functions; monitor schedule performance using earned value metrics; perform schedule risk assessments; identify and resolve critical path and network logic conflicts; prepare status reports and support Integrated Baseline Reviews and Performance Measurement Baseline maintenance.
Top Skills:
Deltek CobraEarned Value Management SystemsGanttExcelMS OfficeMs ProjectPertSchedule Risk SoftwareSsi Tool SetWbs Pro
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

