Blog

Engineering insights, testing best practices, and product updates from the Gaffer team.

Playwright MCP with Claude Code: Setup & CI Guide

June 22, 2026

Set up Playwright MCP with Claude Code, run it in CI, and store the results across runs. A complete guide to the browser-automation agentic loop.

By Alex Gandy Read more →

Playwright Pricing in 2026: Free vs Paid Compared

June 22, 2026

Is Playwright free? A 2026 Playwright pricing breakdown of Microsoft Playwright Testing, Currents, and test-result hosting, compared by billing model.

By Alex Gandy Read more →

Best Test Automation Reporting Tools in 2026 (Compared)

June 16, 2026

A fair comparison of 8 test automation reporting tools in 2026: Gaffer, Allure, ReportPortal, BrowserStack, ExtentReports, Testmo, Katalon TestOps, and TestRail.

By Alex Gandy Read more →

Playwright Reports: HTML, JSON, JUnit & Sharing in CI

June 11, 2026

Generate Playwright HTML, JSON, JUnit, and Blob reports, run them in CI, and host them so your team can open every run without the local machine.

By Alex Gandy Read more →

Test Reporting: What to Include and How to Automate It

June 8, 2026

What a test report should contain, the types of test reports, and how small dev teams automate test reporting in CI instead of writing reports by hand.

By Alex Gandy Read more →

Playwright Test Agents: Planner, Generator & Healer

June 8, 2026

How to use Playwright's planner, generator, and healer agents (v1.56+), run them in CI, and track whether agent-written tests actually stay stable over time.

By Alex Gandy Read more →

Cypress Reports: How to Generate, Configure, and Share Them

May 20, 2026

Generate Cypress test reports with mochawesome, Allure, JUnit, or CTRF reporters, wire them into CI, and host them so your team can see results.

By Alex Gandy Read more →

Test Failures: Types, Root Causes, and How to Fix Them

May 20, 2026

When 200 tests fail in CI, there are rarely 200 problems. A guide to test failure types, what causes each, and how to cluster by root cause.

By Alex Gandy Read more →

Why `gaffer affected-tests` Returned Empty for Every E2E-Touching Edit I Made

May 13, 2026

I tried to fix flaky Playwright tests by using Gaffer the way a coding agent would: find flakies, narrow with `affected-tests`, fix, verify. The loop broke at step two. Here's what was wrong and what shipped.

By Alex Gandy Read more →

Playwright MCP + Claude Code: A Complete Test Loop

May 10, 2026

Wire Playwright MCP into Claude Code, then close the loop with Gaffer's MCP so the agent reads CI results, spots flaky failures, and iterates.

By Alex Gandy Read more →

GitHub Agentic Workflows: Automated Test Reviews with MCP

February 21, 2026

GitHub Agentic Workflows let AI agents run as CI. Here's how we connected Gaffer's MCP server for automated weekly test health reviews.

By Alex Gandy Read more →

Health Score Alerts: Know Before Your Test Suite Degrades

February 14, 2026

Per-build notifications tell you a test failed. Health score alerts tell you your suite is degrading. Here's how they work.

By Alex Gandy Read more →

How to Manage Flaky E2E Tests at Scale

February 14, 2026

Most teams retry and move on. A systematic approach to flaky test management: flip rate detection, CI cost prioritization, and a fix-quarantine-delete framework.

By Alex Gandy Read more →

OpenTelemetry for Test Metrics: Export to Any Stack

February 14, 2026

Your observability stack monitors production but not tests. Gaffer's OpenTelemetry export puts test metrics into Datadog, Grafana, and any OTLP endpoint.

By Alex Gandy Read more →

Test Intelligence: The Missing Context for AI Coding Tools

February 14, 2026

AI coding tools know your code but not your tests. Test intelligence via MCP bridges the gap with flaky detection, failure clustering, and coverage data.

By Alex Gandy Read more →

Give Your AI Coding Tools Access to Your Test Results

February 11, 2026

Claude Code, Cursor, and Windsurf can query your test history, find flaky tests, and diagnose failures — if you give them structured data. Here's how.

By Alex Gandy Read more →

Your AI Agent's Missing Layer: Test Intelligence

February 8, 2026

Agents write code and run tests. But when tests fail, they're flying blind. Here's how test intelligence closes the loop.

By Alex Gandy Read more →

How Much Are Flaky Tests Costing You?

January 25, 2026

Flaky tests are an invisible tax on engineering velocity. Industry data from Atlassian, Google, and Microsoft shows 13-16% of test failures are flaky. Here's how to calculate the real cost.

By Alex Gandy Read more →

We Built Coverage Analytics. Then We Used Them On Ourselves.

January 22, 2026

Dogfooding Gaffer's MCP coverage tools on our own codebase.

By Alex Gandy Read more →

JUnit XML Format Explained: Schema, Examples, CI Integration

January 19, 2026

JUnit XML is the de facto test report format used by Playwright, Jest, Vitest, pytest, Mocha, and Go. Schema, working examples, and CI integration patterns.

By Alex Gandy Read more →

What Are Test Artifacts in Software Testing?

January 19, 2026

Test artifacts are the files generated when tests run: reports, logs, screenshots, coverage data. Here's what they are and how to manage them at scale.

By Alex Gandy Read more →

Dogfooding Gaffer's MCP Server to Fix Slow Playwright Tests

January 1, 2026

Using Gaffer's own analytics to find and fix slow E2E tests. Here's what I found and how I fixed it.

By Alex Gandy Read more →