Photon Logo

Photon

QA Lead (Automation+Performance)- Dallas, TX

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in United States
38K-133K Annually
Expert/Leader
In-Office or Remote
Hiring Remotely in United States
38K-133K Annually
Expert/Leader
Lead QA Automation for agentic AI products by designing eval pipelines, golden datasets, and automated tests for tool-use, hallucination detection, latency/token monitoring, and regression across models and prompts. Integrate performance testing into CI/CD and collaborate with AI engineers to convert requirements into measurable automated evaluations.
The summary above was generated by AI

We are seeking a QA Automation Lead who is ready to move beyond traditional "Pass/Fail" testing. In this role, you will design and build automation frameworks specifically for Agentic AI products. You will focus on evaluating the performance of autonomous agents, ensuring they follow logical reasoning paths, call the correct tools, and provide accurate, safe outputs.

Your mission is to build the "evaluations" (Evals) that define what high-quality AI behavior looks like, moving the needle from unpredictable experiments to production-grade software.

Key Responsibilities

  • Non-Deterministic Testing: Develop automation strategies for probabilistic outputs, using model-based evaluation to "test the tester."
  • Building "Eval" Pipelines: Create and maintain "Golden Datasets" to benchmark agent performance across different versions of prompts and models.
  • Tool-Use Validation: Build automated tests to verify that agents call the correct functions/APIs with the right parameters in complex multi-step workflows.
  • Regression Testing for Prompts: Monitor how subtle changes in prompt engineering or model updates (e.g., moving from GPT-4 to Claude 3.5) affect the product’s reliability.
  • Latency & Token Monitoring: Integrate performance testing into the CI/CD pipeline to track agent reasoning time and cost-efficiency.
  • Hallucination Detection: Develop automated checks to identify and report AI hallucinations, bias, or "jailbreak" attempts.
  • Collaboration: Work closely with AI Engineers to translate "vague" business requirements into measurable, automated test cases.

Required Skills & Qualifications

  • Experience: 10+ years in QA Automation, with a recent focus on AI/ML or LLM-based applications.
  • Python Proficiency: Expert-level Python skills (the industry standard for AI testing) and experience with testing frameworks like Pytest.
  • AI Testing Tools: Familiarity with AI evaluation frameworks such as LangSmith, DeepEval, RAGAS, or Promptfoo.
  • API & Backend Testing: Deep experience with Playwright, Selenium, or Cypress for UI, but a heavy focus on API-level testing and database validation.
  • Statistical Mindset: Understanding that AI testing often requires "scoring" (e.g., 85% accuracy) rather than a simple binary pass/fail.
  • Data Skills: Ability to work with SQL and JSON to validate data retrieved by agents during RAG (Retrieval-Augmented Generation) processes.

Preferred Qualifications

  • Experience testing Multi-Agent Systems (where one agent tests another).
  • Knowledge of Prompt Engineering and how it influences software behavior.
  • Background in Investment Banking or Fintech (if applicable) to understand high-stakes data accuracy.

Compensation, Benefits and Duration

Minimum Compensation: USD 38,000
Maximum Compensation: USD 133,000
Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role.
Medical, vision, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees.
This position is not available for independent contractors
No applications will be considered if received more than 120 days after the date of this post

Similar Jobs

6 Minutes Ago
Remote
USA
30-34 Hourly
Junior
30-34 Hourly
Junior
eCommerce • Retail
Manage and optimize the affiliate program: recruit and onboard partners, monitor activity and compliance, analyze performance and report insights, grow network revenue, manage tracking platforms, and coordinate partner-facing content and training.
Top Skills: ImpactLookerPower BIRakutenTableau
8 Minutes Ago
Remote or Hybrid
120K-150K Annually
Senior level
120K-150K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Partner with Finance, Accounting, and Revenue Operations to design and deliver Power BI analytics, semantic models, and decision support for forecasting, close, revenue, and spend analysis. Translate finance concepts into metrics, reconcile data across systems, collaborate with data engineering, and apply AI-assisted tools to automate and scale finance analytics for executives and business partners.
Top Skills: AdaptiveAnaplanClaudeDaxMicrosoft CopilotNetSuiteOnestreamPower BISalesforceSnowflakeSQL
10 Minutes Ago
Easy Apply
Remote
United States
Easy Apply
110K-120K Annually
Junior
110K-120K Annually
Junior
Healthtech • Software
Build and maintain scalable, governed data pipelines from ingestion through transformation and integration. Implement schema validation, data quality checks, job monitoring, and production-grade APIs. Work with Python, SQL, Airflow, dbt, AWS (EMR, Athena, S3), Iceberg/Parquet, and contribute to CI/CD, observability, and on-call support while collaborating with analytics and product teams.
Top Skills: Apache AirflowAPIsAthenaAws EmrCi/CdDbtIcebergKafkaParquetPythonS3SQLTesting Frameworks

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account