

Extract comprehensive TikTok data with a single click profiles with detailed metrics (followers, engagement, verification status), video analytics (views, likes, comments, hashtags), and hashtag trending data. Built with anti-detection technology for reliable scraping.
It is a TikTok data collection worker that follows the CoreClaw Worker specification.
It collects TikTok data through a remote browser connection and supports four collection modes:
authorvideosearchtagThe worker is orchestrated by main.py. Each request is normalized, dispatched to the corresponding scraping module by collection_type, and the standardized results are sent to the platform via CoreSDK.Result.push_data().
| Mode | Description | Example Input |
|---|---|---|
author | Author profile scraping | bellapoarch, @bellapoarch, https://www.tiktok.com/@bellapoarch |
video | Video detail scraping | https://www.tiktok.com/@user/video/1234567890 |
search | User search scraping | apple |
tag | Tag page scraping | fyp |
The worker reads platform input via CoreSDK.Parameter.get_input_json_dict().
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
collection_type | string | Yes | author | Collection mode. One of:author, video, search, tag |
targets | array | Yes | - | List of targets to collect. Depending on the mode, this can be a username, URL, search keyword, or tag word |
max_results | integer | No | 10 | Maximum number of rows returned for search and tag modes |
retry_times | integer | No | 3 | Maximum retry count for retryable network-related errors |
retry_delay_seconds | number | No | 1 | Delay in seconds between retries |
page_timeout_ms | integer | No | 180000 | Page timeout in milliseconds |
wait_after_load_ms | integer | No | 3000 | Additional wait time after page load, in milliseconds |
User search
Tag scraping
Author scraping
Video scraping
This worker currently uses a shared superset output schema across all four modes. The main output fields include:
input_typeinput_valueurlprofile_urltitledescentity_idvideo_idauthor_idusernamenicknametagcreate_timedurationplay_countdigg_countcomment_countshare_countcollect_countverifiedsignatureavatar_urloriginal_avatar_urlprivate_accountfollowing_countfriends_countfans_countheart_countvideo_countmusic_namemusic_authormusic_idmusic_play_urlcover_urlwidthheightstatuserrordata_jsonmax_results is enforced centrally inside main.py:
search: returns at most max_results rowstag: returns at most max_results rowsauthor: no row truncationvideo: no row truncationIf max_results is not provided, the default value is 10.
This worker relies on a remote browser connection. The following environment variables are supported:
| Env Variable | Required | Default | Description |
|---|---|---|---|
BROWSER_WS | No | - | When set, this WebSocket endpoint is used directly to connect to the browser |
ChromeWs | No | chrome-ws-inner.coreclaw.com | Browser host address |
PROXY_AUTH | No | - | Browser connection authentication credential |
PROXY_DOMAIN | No | - | Currently used mainly for logging |
The browser endpoint is resolved in the following order:
BROWSER_WS is set, it is used directly.BROWSER_WS is not set but PROXY_AUTH is set, the endpoint becomes ws://{PROXY_AUTH}@{ChromeWs}.ws://{ChromeWs}.normalize_request_items() converts the platform input into an internal task object:
The worker processes tasks sequentially and does not run collection jobs in parallel.
run() iterates over the normalized task list and calls process_item() for each entry.
The worker only retries recognizable network-related errors. Common retryable error markers include:
network is unreachablefailed to establish a new connectionmax retries exceededname or service not knowntemporary failure in name resolutionconnection refusedconnection timed outread timed outtimeout 30000ms exceededtimeout 180000ms exceededtarget page, context or browser has been closedbrowser closedwebsocketsocket hang upsearch mode automatically converts a keyword into a search URL.tag mode automatically converts a tag word into a tag page URL.rep_search.py includes captcha handling logic.targets.Explore more popular scrapers from our marketplace
by CoreClaw
Extract public TikTok post data via profile URLs, including engagement, viral trends and audio info. One-click CSV/JSON export, zero code required.
by CoreClaw
Extract public TikTok creator profile data using search URLs, including bio, follower counts, content performance and engagement metrics, without platform API limitations. Supports data export, API calls and third-party integrations.
by CoreClaw
Extract public TikTok video comment data in batches by entering video URLs, including comment content, user information, like counts, reply lists, etc., outputting in CSV or JSON format. Supports sentiment analysis and user insights with zero-code operation and one-click structured data export.
by CoreClaw
By entering URLs, batch extract public TikTok creator profile data, including bio, follower count, content performance, engagement metrics, and more, outputting in CSV or JSON format. Support user analysis and marketing decisions with zero-code operation and one-click export of structured data.