
Russian international media
With this tool, you can extract the following news data from the RT website (https://www.rt.com/):
If you need to extract data beyond the scope listed above, please contact us.
The script runs within the platform and pushes data via SDK. The platform supports exporting results in two structured data formats: JSON and CSV:
Yes. Each scraping task retrieves the latest publicly available news data from the RT website in real-time, without relying on any caching mechanism, ensuring full data timeliness.
Below is a sample display of the scraping results.
| Column Name | Description | Data Type |
|---|---|---|
| NewsUrl | Full link to the news article | Url |
| Author | Author name | Text |
| NewsLanguage | News language (e.g., en) | Text |
| NewsTitle | News title | Text |
| NewsImageUrl | News image link | Url |
| NewsPublished | Publication time (ISO 8601 format) | Text |
| NewsSourceType | News type (e.g., news / russia) | Text |
| NewsTextContent | Plain text content of the news (paragraph-separated) | Text |
| Parameter | Type | Required | Description | Default Value |
|---|---|---|---|---|
| website | Text | Yes | Target homepage URL (other RT mirror sites can be passed) | https://www.rt.com/ |
| pages | Text | No | Page range: 1-3 (pages 1 to 3), 3 (first 3 pages), 1,3,5 (specify) | "1" |
| limit | Integer | No | Scraping limit, only scrape the first N news items (0 or empty = unlimited) | 0 |
| workers | Integer | No | Concurrent scraping threads for news detail pages | 8 |