- Design, build, and maintain automated test suites using Playwright for web and API surfaces, including AI-generated content flows.
- Lead QA strategy for voice automation pipelines built on ElevenLabs — developing test cases for synthesis quality, latency, and failure modes.
- Validate Claude (Anthropic) integrations: prompt-response accuracy, edge case handling, safety behaviors, and output consistency across builds.
- Build and maintain Node.js-based test tooling, harnesses, and custom reporters for CI/CD pipelines.
- Deploy, monitor, and triage test infrastructure on Google Cloud Platform — leveraging Cloud Run, GCS, and Pub/Sub for scalable test execution.
- Define and track quality metrics: test coverage, flakiness rates, mean-time-to-detect, and regression velocity.
- Collaborate with engineers during design reviews to surface testability gaps and advocate for observable, fault-tolerant system design.
- Mentor junior QA engineers and establish team-wide standards for test authoring, review, and maintenance.