AI coding agents like Roo and GitHub Copilot are incredible at writing code, but they’re completely blind to what your application actually looks like. While working on a recent project, I needed my AI agent to help debug UI issues and verify that components rendered correctly. The agent could read my React code and suggest fixes, but it couldn’t see that the submit button was hidden behind a modal, or that my CSS changes broke the mobile layout. That’s when I discovered the Playwright MCP server—a way to give your AI agent actual eyes to see and interact with your web application.
The Problem: AI Agents Live in a Text-Only World
Traditional AI coding agents work exclusively with text:
- They read your source code files
- They write code changes
- They run terminal commands and see text output
- They analyze error messages and stack traces
But they can’t answer questions like:
- “Does this button actually appear on the page?”
- “Is the modal positioned correctly?”
- “Did my CSS changes work as expected?”
- “Can a user actually click this element, or is something covering it?”
This creates a significant blind spot. You end up in conversations like this:
You: “The login button isn’t working”
Agent: “Let me check the code… the onClick handler looks correct”
You: “But it’s not clickable on the page”
Agent: “Try adding cursor: pointer in the CSS?”
You: sigh
The agent is guessing based on code, when the real issue might be a z-index problem, an overlapping element, or a CSS specificity conflict that’s only visible in the rendered page.
The Playwright MCP server specifically gives AI agents the ability to:
- Navigate to web pages
- Take screenshots
- Click elements
- Fill forms
- Inspect the DOM
- Run JavaScript in the browser context
It’s like giving your AI agent a pair of eyes and hands to interact with your web application.
Setting Up Playwright MCP
The setup is surprisingly straightforward. The Playwright MCP server runs as a separate process that your AI agent communicates with.
Configuration in Roo (VS Code)
If you’re using Roo in VS Code, you configure MCP servers in your settings file. Here’s my actual configuration:
Location: %APPDATA%\Code - Insiders\User\globalStorage\rooveterinaryinc.roo-cline\settings\mcp_settings.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
{
"mcpServers": {
"playwright": {
"command": "npx",
"args": [
"@playwright/mcp@latest",
"--browser=msedge",
"--extension"
],
"alwaysAllow": [
"browser_click",
"browser_navigate",
"browser_snapshot",
"browser_screenshot"
]
}
}
}
Key points:
- The
commandisnpxto auto-download latest version --browser=msedgeuses Microsoft Edge (you can usechromium,firefox, orwebkit)--extensionenables authenticated session accessalwaysAllowlists tools the agent can use without asking permission each time
Why --extension Matters
The --extension flag is a game-changer for real-world development. Without it, Playwright starts a fresh browser session with no cookies, no authentication, no nothing. But with --extension, your AI agent can:
- Access authenticated pages - If you’re logged into Azure DevOps, the agent can navigate authenticated routes
- Use existing browser state - Cookies, local storage, service workers all persist
- Test real user scenarios - Not just public pages, but actual authenticated workflows
- Skip login flows - No need to automate authentication in every test
This is especially valuable when working with private feeds, internal tools, or any application behind authentication.
For example, here i asked roo to get the colors from my open tab
it then starts the Playwright MCP and open the browser for me to pick the tab i want
it then process the information and give me the results
For more details on the extension mode, see the Playwright MCP extension documentation.
The full list includes many more—check the Playwright MCP documentation for details.
Conclusion
Giving your AI coding agent eyes through Playwright MCP bridges the gap between code and reality. It’s like having a pair programmer who can actually see your application, not just imagine it from the code.
The combination of code analysis and visual inspection creates a powerful debugging workflow:
- AI reads code - Understands logic
- AI inspects visuals - Verifies behavior
- AI suggests fixes - Based on both code and visual context
If you’re already using AI coding agents, adding Playwright MCP is a no-brainer. The setup takes 5 minutes, and the productivity boost is immediate.
Have you tried Playwright MCP or other visual testing tools with AI agents? What’s your experience? Share in the comments below!