arielshemesh1999@gmail.com · Israel
← All articles

MCP Playwright

Claude drives a real browser — click, fill, snapshot, run E2E — through the accessibility tree, not pixels.

What it is

Microsoft's playwright-mcp is an MCP server that lets an LLM steer a real Chromium / Firefox / WebKit browser. It speaks the accessibility tree, not screenshots — so it's fast, deterministic, and doesn't need a vision model.

What you get (top 5 tools)

  • browser_navigate — open a URL.
  • browser_click — click an element by accessibility ref.
  • browser_snapshot — capture the accessibility tree of the current page.
  • browser_type — type into an input.
  • browser_fill_form — fill many fields in one call.

Install

One line via the Claude Code CLI:

bash
claude mcp add playwright npx @playwright/mcp@latest

Flags worth knowing:

  • --headless — run without a visible browser window.
  • --caps=vision — opt in to pixel-based clicks (when a11y refs aren't enough).
  • --allowed-origins — restrict which domains the agent can hit.

Try it

prompt
Open the homepage of a site, snapshot the page, click the primary nav item, and verify the dropdown opens.

github.com/microsoft/playwright-mcp — full tool reference and config flags.