Playwright vs. Selenium — Trade-offs at Expert Depth • Krishan Chawla

Introduction#

“Playwright vs. Selenium” is usually framed as a popularity contest — newer, faster tool vs. the incumbent everyone already knows. That framing hides the actual decision, which is architectural: how each tool talks to the browser, what that buys you, and what it costs you in environments Selenium was never designed for and Playwright hasn’t fully solved either.

This isn’t a “just use Playwright” post. Both tools are still correct choices depending on what you’re testing, what you already have in CI, and how much control you need over the browser layer.

At a glance, before the detail:

Category	Playwright	Selenium
Protocol	CDP / WebDriver BiDi	W3C WebDriver
Synchronization	Auto-waiting, built in	Explicit/implicit waits you manage
Cross-browser	Chromium, Firefox, WebKit (bundled builds)	Virtually every real installed browser
Mobile	Emulation only, no native apps	Real device/native coverage via Appium
Parallelism	Native (workers, contexts, sharding)	Framework-dependent (Grid, TestNG, Docker)
Network interception	Native (`page.route()`)	Requires CDP or a proxy like BrowserMob
API testing	Built in (`APIRequestContext`)	Needs a separate client (REST Assured, etc.)
Debugging artifacts	Trace, video, screenshots built in	Needs third-party tooling
Ecosystem maturity	Growing fast	Extremely mature, 15+ years

The rest of this post is why each row is the way it is, and where the table oversimplifies.

Architecture: the actual difference#

Selenium — WebDriver, out-of-process#

Selenium talks to the browser through the W3C WebDriver protocol: your test issues an HTTP request to a driver process (chromedriver, geckodriver, msedgedriver), which translates it into native browser automation calls.

Test code → WebDriver client → HTTP (JSON Wire / W3C) → Driver binary → Browser

plaintext

Every action — click, type, navigate — is a network round trip through an external process. This is why Selenium is slower and why “flaky” has historically been associated with it: there’s no shared execution context between your test and the browser, so Selenium has no native way to know when the DOM has actually settled after an action. That’s what WebDriverWait and ExpectedConditions exist to paper over.

Playwright — CDP/BiDi, in-process control#

Playwright talks to Chromium and WebKit browsers directly over the Chrome DevTools Protocol (and increasingly WebDriver BiDi for Firefox), using a persistent WebSocket connection from a Node/Python/Java/.NET process it manages itself — no separate driver binary to version-match.

Test code → Playwright driver process → WebSocket (CDP/BiDi) → Browser

plaintext

Because Playwright owns the browser process lifecycle and has a live protocol connection, it can subscribe to actual browser events — network idle, DOM mutations, frame navigation — instead of polling. This is the real source of Playwright’s “less flaky” reputation: auto-waiting is a byproduct of architecture, not a feature someone bolted on.

Auto-waiting vs. explicit waiting#

This is the single biggest day-to-day productivity difference.

Selenium requires you to explicitly wait for actionability:

WebElement button = new WebDriverWait(driver, Duration.ofSeconds(10))
    .until(ExpectedConditions.elementToBeClickable(By.id("submit")));
button.click();

java

Skip this and you get intermittent ElementClickInterceptedException or StaleElementReferenceException — not because your test is wrong, but because Selenium clicked before the element was interactable (covered by an overlay, animating, or detached and re-rendered by a framework).

Playwright bakes actionability checks into every action — visible, stable (not animating), receives events (not obscured), enabled — before it acts:

await page.getByRole('button', { name: 'Submit' }).click();

typescript

If the element isn’t actionable within the timeout, Playwright throws a specific error telling you which check failed, not a generic “element not found.” That diagnostic quality alone cuts triage time significantly on a large suite.

Where this cuts the other way: auto-waiting can mask genuine race conditions in your app. A button that becomes clickable a beat before its handler is actually wired up will pass Playwright’s actionability check and still fail intermittently — the failure just moves from “test infra” to “app timing bug,” which is more honest but not free.

Locator strategy#

Selenium’s By locators (id, cssSelector, xpath) return a reference to whatever matched at that instant. If the DOM re-renders (React/Vue re-mount), that reference goes stale and you eat a StaleElementReferenceException on the next interaction.

Playwright’s Locator is lazy — it re-resolves the query on every action, so it survives re-renders by design:

const row = page.getByRole('row', { name: 'jane@example.com' });
await row.getByRole('button', { name: 'Delete' }).click();

typescript

Playwright also pushes role/text-based locators (getByRole, getByLabel, getByTestId) as the default idiom, which happens to align with accessible markup — a genuine secondary benefit, not just API sugar.

Shadow DOM is the other place this matters in practice. Playwright pierces open shadow roots automatically — page.locator('my-component >> text=Save') just works. Selenium requires you to explicitly traverse into shadowRoot (and behavior varies by driver/browser version), which is a recurring source of custom helper methods in mature Selenium frameworks.

Network interception, mocking, and API testing#

Selenium has no native network layer. Selenium 4 exposes some Chrome DevTools Protocol access, but intercepting and rewriting requests still typically means bolting on a proxy like BrowserMob, or dropping to raw CDP calls that aren’t part of the stable WebDriver API. API testing is a separate concern entirely — you bring in REST Assured (Java), requests (Python), or similar, wired up alongside your UI framework rather than inside it.

Playwright treats both as first-class:

// Mock a flaky third-party dependency (payment gateway, OTP provider) directly in the test
await page.route('**/api/payment/authorize', (route) =>
  route.fulfill({ status: 200, body: JSON.stringify({ status: 'approved' }) })
);

// Drive API calls in the same test, no separate HTTP client
const api = await request.newContext();
const response = await api.post('/api/orders', { data: { sku: 'ABC-123', qty: 1 } });
expect(response.status()).toBe(201);

typescript

That means you can seed state via API, then assert through the UI, in one test file with one runner — no context-switching between an HTTP client library and your browser automation stack. For flows gated behind a third-party dependency (payment, OTP, SSO), page.route() mocking is often the only practical way to test the failure paths deterministically.

Authentication state reuse#

Logging in through the UI once per test is one of the biggest sources of wasted CI time in both tools, but Playwright makes the fix a one-liner. Authenticate once, persist the session, and reuse it across every test that doesn’t specifically need to test login itself:

// global-setup.ts — runs once
const page = await browser.newPage();
await page.goto('/login');
await page.getByLabel('Email').fill('user@example.com');
await page.getByLabel('Password').fill('••••••••');
await page.getByRole('button', { name: 'Sign in' }).click();
await page.context().storageState({ path: 'auth.json' });

typescript

// playwright.config.ts
use: { storageState: 'auth.json' }

typescript

Every subsequent test starts already authenticated — no repeated login flow eating seconds per test across a suite that might run thousands of times a month. Selenium has no equivalent primitive; teams typically replicate this by injecting cookies or local storage manually after driver init, which works but isn’t built in and isn’t as robust across auth schemes (e.g. anything involving httpOnly or SameSite cookie nuances).

Parallelism and execution model#

Selenium Grid (v4) distributes across nodes, but each session is a full browser instance with its own driver process — heavy to spin up, and coordination overhead (hub/router, session queuing) grows with grid size. It’s proven at scale, but the ops burden is real: someone owns the Grid infrastructure.

Playwright ships parallelism as a first-class runner feature (playwright.config.ts workers), and browser contexts are cheap — isolated, cookie/storage-separated sessions within a single browser process launch. You get Selenium-Grid-like isolation without spinning up a new browser per test:

export default defineConfig({
  workers: process.env.CI ? 4 : undefined,
  fullyParallel: true,
});

typescript

For a suite of a few hundred E2E tests, Playwright’s context-based parallelism is materially cheaper in CI compute than an equivalent Selenium Grid setup — fewer browser process cold-starts per test.

The same context model simplifies multi-tab and popup flows — OAuth redirects, “open in new tab” links, payment provider popups — which are a genuine pain point in Selenium:

// Selenium: manual window-handle juggling
const originalWindow = await driver.getWindowHandle();
const windows = await driver.getAllWindowHandles();
for (const handle of windows) {
  if (handle !== originalWindow) await driver.switchTo().window(handle);
}

typescript

// Playwright: the new page is just an object
const [popup] = await Promise.all([
  page.waitForEvent('popup'),
  page.getByRole('link', { name: 'Pay with PayPal' }).click(),
]);
await popup.getByLabel('Email').fill('user@example.com');

typescript

Cross-browser and real-device coverage#

This is where Selenium still wins outright.

Selenium drives real installed browsers via vendor-maintained drivers — actual Chrome, actual Firefox, actual Safari, actual legacy Edge if you’re unlucky enough to need it. It’s the only realistic path to real mobile browser testing through Appium (which reuses the WebDriver protocol) or cloud grids like BrowserStack/Sauce Labs for genuine device/OS/browser matrices.
Playwright ships its own bundled builds of Chromium, Firefox, and WebKit. “WebKit” here is not Safari — it’s a Playwright-patched WebKit build, which is a very good proxy for Safari behavior but has occasionally diverged from real Safari edge cases (especially around media APIs, permissions prompts, and some CSS rendering). If your product has a Safari-specific bug class (and most do, eventually), you cannot fully rule it out with Playwright’s WebKit alone.
Playwright has no first-party mobile device automation — page.emulate gives you viewport/UA emulation of a desktop browser pretending to be mobile, not a real mobile browser engine.
Selenium is also the only option for legacy corporate environments still running Internet Explorer or older Edge builds via vendor drivers. Playwright never supported them and never will — it targets evergreen browsers only.

Verdict: if your test matrix must include “real Safari on real macOS” or “real Chrome on a real Android device,” you still need Selenium/Appium or a cloud grid in the loop, full stop — Playwright is not a substitute there.

Debuggability#

This isn’t close. Playwright’s tooling — Trace Viewer (full DOM snapshots, network, console, and a timeline scrubber per test run), the Inspector, --debug step mode, and codegen — gives you a post-mortem of a CI failure that looks like a time-travel debugger. Selenium has nothing built-in comparable; you’re left instrumenting screenshots-on-failure and video capture yourself (or paying for a cloud grid that bolts this on).

npx playwright test --trace on
npx playwright show-trace trace.zip

bash

For a team fighting flaky-test triage time, Trace Viewer alone can be the deciding factor, independent of everything else in this comparison.

Screenshots, video, and reporting#

Playwright bundles page.screenshot() and locator.screenshot() with built-in masking and clipping, plus recordVideo on the browser context — no extra dependency. Getting the same artifacts out of Selenium means wiring up FFmpeg, a Grid video-recording sidecar, or a paid cloud grid that adds it for you.

Reporting follows the same pattern. Playwright’s HTML reporter ships with traces, videos, and screenshots attached out of the box; you can still layer Allure on top if that’s your org’s standard. Selenium has no built-in report format — ExtentReports, Allure, and ReportPortal are the de facto standard, but they’re all separate dependencies you own the versioning and configuration for. Neither gap is fatal, but it’s real setup and maintenance surface Playwright starts you without.

Language and ecosystem maturity#

Selenium: bindings in Java, Python, C#, Ruby, JavaScript, Kotlin — genuinely first-class in each, backed by 15+ years of Stack Overflow answers, Page Object frameworks (Serenity, Selenide), and enterprise QA tooling that assumes WebDriver underneath (most commercial test-management and visual-regression tools integrate with Selenium by default).
Playwright: first-class in TypeScript/JavaScript, Python, .NET, and Java — but the JS/TS experience is where the tool is designed and battle-tested first; other bindings sometimes lag on API parity for newly released features. If your org is Java-first with a large existing Selenium/TestNG investment, migrating fully is a bigger lift than the marketing suggests.

When to actually choose which#

Choose Playwright when:

You’re starting a new E2E suite and the target is modern web apps (React/Vue/Angular) in Chromium/Firefox/WebKit-class browsers
CI cost and flakiness triage time are real pain points
Your team is JS/TS-native or Python-native
You want network interception/mocking and API testing as native features (page.route(), APIRequestContext), not a Selenium-plus-BrowserMob-plus-REST-Assured bolt-on
Login flows dominate your suite’s runtime and storageState reuse would meaningfully cut CI time

Choose Selenium when:

You need real Safari, real mobile browsers, or a specific legacy browser (including IE/old Edge) in the matrix
You already have a mature Selenium Grid + Page Object investment and the migration cost outweighs the flakiness savings
Your org’s commercial test-management/visual-testing tooling is WebDriver-native
You’re testing through Appium for native + hybrid mobile apps, where WebDriver is the shared protocol

Conclusion#

The honest framing: Playwright’s architecture eliminates an entire category of flakiness that Selenium engineers have spent a decade building workarounds for, and its debugging tooling is a genuine step change. But Selenium’s WebDriver protocol is still the only standard that gets you real cross-browser and real-device coverage, and that’s not a gap Playwright’s bundled-browser model closes. Pick based on what’s actually in your browser matrix, not which tool has the better changelog.

Key Takeaways#

Selenium drives real browsers out-of-process via WebDriver; Playwright drives its own bundled browser builds in-process via CDP/BiDi — this single difference explains almost every other trade-off
Playwright’s auto-waiting is an architectural byproduct, not a convenience feature, and materially reduces flaky-test triage time
Selenium remains the only realistic choice for real Safari and real mobile-device coverage; Playwright’s WebKit is a very good but imperfect Safari proxy
Playwright’s Trace Viewer has no real Selenium equivalent and is often the deciding factor on its own
Native API testing (APIRequestContext) and storageState auth reuse let Playwright seed state and skip repeated logins without a separate HTTP client — a real CI-time win at scale
Don’t migrate a mature Selenium Grid investment to Playwright purely for speed — check your browser/device matrix requirements first