A powerful cross-browser web scraping tool using Playwright for complete browser rendering. Supports Chromium, Firefox, and WebKit browser engines. Perfect for dynamic pages, single-page applications (SPAs), infinite scroll pages, and cross-browser testing scenarios.
Playwright Web Scraping is a cross-browser web scraping tool based on Playwright, designed for handling complex websites that require full browser rendering and JavaScript execution. Unlike scrapers that only support Chromium, it supports Chromium, Firefox, and WebKit engines, allowing you to test and scrape data across different browsers. With CoreClaw, you can scrape dynamic pages, SPA applications, and infinite scroll pages without writing code, enabling scenarios such as cross-browser data collection, compatibility testing, and dynamic content extraction.
| 🔗 Page URL | 📄 Page Title |
| 📏 Crawling Depth | 🔢 HTTP Status Code |
| 🔗 Number of Links Found | 📝 Page Content |
| 🌐 Dynamically Generated Content | 🎯 Custom Extracted Data |
| 🌐 Browser Type | ⏱️ Loading Timestamp |
CoreClaw Playwright Web Scraping handles multi-browser startup, page loading, JavaScript execution, link discovery, and data extraction in the background. In just a few minutes, you can extract data through these steps:
| Parameter | Type | Default | Description |
|---|---|---|---|
| startUrls | array | - | Required. List of start URLs |
| browserType | string | "chromium" | Browser type:chromium, firefox, webkit |
| linkSelector | string | "a[href]" | CSS selector for discovering links |
| maxDepth | integer | 1 | Maximum crawling depth (0 means only crawl start pages) |
| maxPages | integer | 50 | Maximum number of pages to crawl |
| concurrency | integer | 3 | Number of concurrent browser tabs (recommended 3-5) |
| pageTimeout | integer | 30 | Page load timeout in seconds |
| waitUntil | string | "domcontentloaded" | Page navigation completion:domcontentloaded, load, networkidle |
| pageFunction | string | - | Custom page function (JavaScript code) |
| infiniteScroll | boolean | false | Enable infinite scroll |
| scrollMaxTimes | integer | 5 | Maximum scroll times for infinite scroll |
| scrollDelay | integer | 2000 | Scroll delay in milliseconds |
| closeCookieModals | boolean | true | Automatically close Cookie banners |
| urlPattern | string | - | Glob pattern for URL filtering (e.g.,**/article/**) |
| regexPattern | string | - | Regular expression for URL filtering |
| debugLog | boolean | false | Enable debug logging |
Example 1: Basic Scraping
Example 2: Cross-Browser Testing
Example 3: WebKit Mobile Scraping
Example 4: Infinite Scroll Pages
Example 5: Custom Data Extraction
Example 6: URL Filtering
**/article/**^https://example\.com/article/\d+$For your convenience, output results are displayed in tables and tabs. You can download results in JSON format.
Each scraped page will output the following data:
Basic Fields
Link Information
Time Information
Browser Information
Custom Data
Other Information
Example Data:
Use Playwright when you need cross-browser testing or support for Firefox/WebKit. Use Puppeteer when you only need Chromium.
In most scenarios, Chromium is sufficient. Choose other browsers for special requirements.
In most scenarios, use domcontentloaded or load. Use networkidle only when you need to ensure all resources are loaded.
The tool has built-in automatic Cookie banner closing functionality.
Explore more popular scrapers from our marketplace
by CoreClaw
It queries the Google search engine by keyword and returns a structured SERP summary, including the final search parameters, organic results, related queries, and people-also-ask data.
by Odin Kael
A powerful Google Sheets data import export tool designed for data synchronization, backup, and integration between Google Sheets and external systems. Supports three operation modes, two authentication methods, batch processing, data deduplication, and automatic backup.
by Odin Kael
A high-speed static page scraper based on Cheerio, designed specifically for static HTML pages. Uses Cheerio for HTML parsing, delivering speeds 10-50 times faster than full browser rendering.
by Odin Kael
A powerful web scraping tool using Puppeteer for complete browser rendering. Supports full browser rendering, automatic Cookie banner closing, URL filtering, and more.