A powerful website screenshot tool designed specifically for the CoreClaw platform. It captures web page screenshots and returns them as Base64-encoded data embedded in JSON output.
The Website Screenshot Tool is an automated web page capture solution designed to provide high-quality screenshot services for CoreClaw platform users. With CoreClaw, you can capture web page screenshots without writing code, empowering website monitoring, design asset collection, SEO auditing, content archiving, and brand monitoring.
| 📸 Webpage Screenshots | 📄 Base64 Encoded Data |
| 🖼️ Thumbnail Optimization | 🔤 OCR Extracted Text |
| 🏷️ Page Title Information | 🔗 Final URL |
| 📏 Image Dimensions | 📊 File Size (KB) |
| 🎨 Output Format Information | ⚠️ Error Information |
CoreClaw Website Screenshot Tool handles browser startup, page loading, screenshot generation, and data encoding in the background. In just a few minutes, you can capture data through these steps:
| Parameter | Type | Default | Description |
|---|---|---|---|
| urls | array | - | Required. List of URLs to capture. Each item can be a string or { "url": "..." } object |
| format | string | "jpeg" | Output format:jpeg, png, webp |
| quality | integer | 80 | JPEG/WebP quality (1-100). Higher values mean better quality but larger files |
| outputMode | string | "thumbnail" | Output mode:thumbnail, base64, metadata, ocr, chunks |
| thumbnailWidth | integer | 400 | Thumbnail width in pixels |
| maxBase64Size | integer | 500 | Maximum embed size in KB. 0 = unlimited |
| fullPage | boolean | true | Capture the entire scrollable page |
| viewportWidth | integer | 1280 | Browser viewport width in pixels |
| timeout | integer | 60 | Page load timeout in seconds |
| blockAds | boolean | true | Block ads and tracking scripts |
| scrollToBottom | boolean | false | Scroll to bottom before capture (for lazy-loaded content) |
| waitForSelector | string | "" | CSS selector to wait for before capture |
| selectorsToHide | array | [] | CSS selectors to hide (e.g., cookie banners) |
| delay | integer | 0 | Additional delay after page load in milliseconds |
Example 1: Basic Screenshot (Recommended)
Example 2: Full-Page Screenshot
Example 3: Batch Processing
Example 4: OCR Text Extraction
Example 5: Hide Unwanted Elements
Each screenshot request will output the following data:
Basic Information
Image Information
Image Data
<img> tags)Other Information
JSON Example:
JPEG - Recommended Format
PNG
WebP
| Mode | Description | Use Case |
|---|---|---|
| thumbnail | Recommended. Compressed thumbnail | Most cases, batch processing |
| base64 | Full Base64 embed | When full quality is needed |
| metadata | Metadata only, no image | Quick URL status check |
| ocr | Extract text content | Content analysis, SEO auditing |
| chunks | Split large images | Extra-long full-page screenshots |
Recommendation: Use thumbnail mode for regular use, ocr mode for text analysis, and metadata mode for quick verification.
Full-Page Screenshot (fullPage: true)
Viewport-Only Screenshot (fullPage: false)
If the screenshot appears blank, try these methods:
1. Use Element Wait
Wait for the main content element to appear before capturing.
2. Set Delay Time
Add an extra 2-second wait after page load.
3. Check Website Structure
Explore more popular scrapers from our marketplace
by CoreClaw
It queries the Google search engine by keyword and returns a structured SERP summary, including the final search parameters, organic results, related queries, and people-also-ask data.
by Odin Kael
Dedup Datasets Worker is a powerful tool for merging and deduplicating datasets from multiple JSON/JSONL files. Fully optimized for the CafeScraper platform with enhanced features and robust error handling.
by Odin Kael
A powerful Google Sheets data import export tool designed for data synchronization, backup, and integration between Google Sheets and external systems. Supports three operation modes, two authentication methods, batch processing, data deduplication, and automatic backup.
by Odin Kael
A high-speed static page scraper based on Cheerio, designed specifically for static HTML pages. Uses Cheerio for HTML parsing, delivering speeds 10-50 times faster than full browser rendering.