Photon Logo

Photon

QA Engineer (Automation)- Dallas, TX

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in United States
38K-133K Annually
Expert/Leader
In-Office or Remote
Hiring Remotely in United States
38K-133K Annually
Expert/Leader
Design and build automation frameworks and evaluation pipelines for agentic AI products. Develop non-deterministic tests, golden datasets, tool-use validation, prompt regression tests, latency/token monitoring, hallucination detection, and collaborate with AI engineers to convert requirements into measurable automated test cases.
The summary above was generated by AI

We are seeking a QA Automation Engineer who is ready to move beyond traditional "Pass/Fail" testing. In this role, you will design and build automation frameworks specifically for Agentic AI products. You will focus on evaluating the performance of autonomous agents, ensuring they follow logical reasoning paths, call the correct tools, and provide accurate, safe outputs.

Your mission is to build the "evaluations" (Evals) that define what high-quality AI behavior looks like, moving the needle from unpredictable experiments to production-grade software.

Key Responsibilities

  • Non-Deterministic Testing: Develop automation strategies for probabilistic outputs, using model-based evaluation to "test the tester."
  • Building "Eval" Pipelines: Create and maintain "Golden Datasets" to benchmark agent performance across different versions of prompts and models.
  • Tool-Use Validation: Build automated tests to verify that agents call the correct functions/APIs with the right parameters in complex multi-step workflows.
  • Regression Testing for Prompts: Monitor how subtle changes in prompt engineering or model updates (e.g., moving from GPT-4 to Claude 3.5) affect the product’s reliability.
  • Latency & Token Monitoring: Integrate performance testing into the CI/CD pipeline to track agent reasoning time and cost-efficiency.
  • Hallucination Detection: Develop automated checks to identify and report AI hallucinations, bias, or "jailbreak" attempts.
  • Collaboration: Work closely with AI Engineers to translate "vague" business requirements into measurable, automated test cases.

Required Skills & Qualifications

  • Experience: 10+ years in QA Automation, with a recent focus on AI/ML or LLM-based applications.
  • Python Proficiency: Expert-level Python skills (the industry standard for AI testing) and experience with testing frameworks like Pytest.
  • AI Testing Tools: Familiarity with AI evaluation frameworks such as LangSmith, DeepEval, RAGAS, or Promptfoo.
  • API & Backend Testing: Deep experience with Playwright, Selenium, or Cypress for UI, but a heavy focus on API-level testing and database validation.
  • Statistical Mindset: Understanding that AI testing often requires "scoring" (e.g., 85% accuracy) rather than a simple binary pass/fail.
  • Data Skills: Ability to work with SQL and JSON to validate data retrieved by agents during RAG (Retrieval-Augmented Generation) processes.

Preferred Qualifications

  • Experience testing Multi-Agent Systems (where one agent tests another).
  • Knowledge of Prompt Engineering and how it influences software behavior.
  • Background in Investment Banking or Fintech (if applicable) to understand high-stakes data accuracy.

Compensation, Benefits and Duration

Minimum Compensation: USD 38,000
Maximum Compensation: USD 133,000
Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role.
Medical, vision, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees.
This position is not available for independent contractors
No applications will be considered if received more than 120 days after the date of this post

Similar Jobs

Yesterday
In-Office or Remote
United States
38K-133K Annually
Expert/Leader
38K-133K Annually
Expert/Leader
Agency • Information Technology
Design, develop, and execute automated test strategies for data pipelines, ETL processes, and data warehouses. Create and maintain automation frameworks and CI/CD-integrated test suites, perform data validation and reconciliation using complex SQL and PySpark, identify and track defects, and collaborate with development and data teams to improve data quality and system reliability.
Top Skills: Azure DevopsCi/CdEtl Testing ToolsGitJavaJIRAPlaywrightPysparkPythonSeleniumSQLTestrail
Yesterday
In-Office or Remote
United States
38K-133K Annually
Expert/Leader
38K-133K Annually
Expert/Leader
Agency • Information Technology
Design, implement, and execute automated tests for distributed microservices applications. Troubleshoot Selenium and web-service tests, build test automation, collaborate with global Agile teams, and support quality practices and CI/CD integration.
Top Skills: Ci/CdCucumberCypressDockerJavaJavaScriptJunitSeleniumSQLWeb Service Testing
Yesterday
In-Office or Remote
United States
35K-124K Annually
Mid level
35K-124K Annually
Mid level
Agency • Information Technology
Design, build, and maintain automated test frameworks for a cloud-native AWS data ingestion platform. Create test plans, execute unit/integration/performance/security tests, validate data reconciliation and parity with legacy systems, run DR drills, verify security controls and CI/CD/IaC, document results, and support UAT and handover.
Top Skills: Apache FlinkApache KafkaSparkAPIsAWSCi/CdDisaster Recovery (Rto/Rpo)Encryption (At-Rest/In-Transit)IamInfrastructure As CodeLogging And MonitoringToken-Based Authentication

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account