CoreClaw Ethical Data Collection Statement
Version: v2.0 | Last Updated: March 25, 2026
CoreClaw (hereinafter referred to as "we" or "the Platform") is committed to building a responsible ecosystem that connects developers with data seekers. We firmly believe that the value of open data should be realized under the premises of respecting privacy, complying with the law, and maintaining a healthy internet ecosystem.
This Ethical Data Collection Statement outlines our moral stance and compliance commitments regarding data scraping activities, and applies to all developers and users who run scripts through the CoreClaw platform.
1. Our Commitments and Principles
CoreClaw adheres to the highest ethical standards in conducting its business. We follow the five core principles regarding market and social research in international guidelines:
- Legality: All data collection activities conducted through the platform must comply with applicable laws and regulations.
- Transparency: Scraping behavior should remain transparent to the collected websites.
- Respect for Privacy: Personal information must be strictly protected.
- Fairness: Collection activities must not impose an undue burden on the target website.
- Accountability: We are committed to maintaining a compliant environment for the platform and require users to bear ultimate responsibility for their actions.
2. Core Ethical Guidelines for Data Collection
2.1 Respect for Website Rules
We require all users to comply with the rules of target websites:
- Compliance: Before scraping any website, users must check and respect the explicit instructions in the target website's robots.txt file. This is fundamental etiquette in web scraping and an important criterion for distinguishing ethical collection from abusive behavior.
- Terms of Service: Users have the responsibility to understand the target website's terms of service. Users should carefully evaluate the terms of service of the target website and independently bear the risk of breach of contract and legal consequences that may result from violating the terms.
2.2 Responsible Technical Practices
To prevent damage to target websites, user collection requests must be controlled within a reasonable frequency to avoid imposing excessive load on target servers. Specific requirements include:
- Setting reasonable request intervals (typically at least 1 second)
- Limiting the number of concurrent requests
- Complying with the access frequency limits of the target website
- Automatically adjusting frequency when the target website responds slowly
2.3 Privacy Protection and Data Minimization
In accordance with the requirements of privacy laws such as GDPR and CCPA:
- Strictly prohibit the collection of non-public personal information and highly sensitive information. Unless there is a clear legal basis (such as obtaining consent from the data subject), personally identifiable information must not be collected.
- Data Minimization Principle: Only collect data necessary to achieve a specific purpose, and do not over-collect. Before collection, the scope of required data should be clearly defined to avoid obtaining irrelevant sensitive information.
- Respect Non-public Content: Strictly prohibit the collection of content that requires logging in, bypassing paywalls, or otherwise circumventing access controls to obtain.
2.4 Respect for Intellectual Property
● Copyright Protection: Do not collect copyrighted content on a large scale (such as full text of articles, images, videos, etc.) unless it constitutes fair use or explicit authorization has been obtained. It is strictly forbidden to use collected data for any activities that infringe upon the intellectual property rights of third parties.
● Database Rights: Respect the database rights of target websites. Do not collect or copy the content of databases on a large scale unless it constitutes fair use or explicit authorization has been obtained.
2.5 Transparency and Fairness
Collected data should only be used for the legitimate purposes clearly identified before collection. Unauthorized use of data for other purposes is prohibited, especially for purposes that may harm the data subject or the data owner.
3. Platform Roles and User Responsibilities
3.1 Role of CoreClaw
CoreClaw, as a technical infrastructure provider, provides connectivity and execution environments for developers and data seekers. We do not actively control the scripts and their collection activities run by users on the platform, but we will take necessary measures in accordance with the law after receiving legitimate complaints from rights holders. We have a responsibility to:
- Clearly communicate expectations for ethical collection through this statement.
- Perform security scans on uploaded scripts to guard against malicious code.
- Take appropriate measures against behaviors that clearly violate this statement, such as restricting access, deleting scripts, or banning accounts.
- Cooperate with law enforcement agencies to investigate illegal activities.
- Regularly review the content of this statement and update it according to changes in laws and regulations, technological developments, and industry best practices.
3.2 Core Responsibilities of Users
Developers and data seekers using the CoreClaw platform must:
- Ensure Compliance Independently: You bear full legal responsibility for the collected data, collection methods, and data use.
- Obtain Necessary Rights: Ensure that your collection behavior does not infringe upon the rights of any third party, including but not limited to copyrights, database rights, and privacy rights.
- Retain Compliance Records: Logs and records of collection activities should be retained, including collection time, source, scope, and purpose, so as to prove compliance when necessary. The record retention period shall comply with the requirements of relevant laws and regulations.
- Respond Promptly to Rights Requests: If a data subject or website owner makes a legitimate request (such as stopping collection or deleting data), it must be handled properly and promptly.
- Data Deletion Obligation: If a data subject exercises the "right to be forgotten" or "right to erasure," the user should immediately delete relevant personal information. CoreClaw will cooperate in deleting relevant data stored on the platform after receiving a legitimate request.
- Indemnification Liability: If CoreClaw suffers any third-party claims, administrative penalties, or reputational damage due to the user's violation of this statement, the user shall bear full legal responsibility and compensate CoreClaw for all losses suffered as a result.
4. Special Reminder: Cross-border Legal Risks
Data collection activities may be subject to the laws of multiple jurisdictions. Please note:
- Different countries have different legal regulations for data collection.
- The legal requirements of the region where the target website is located should be understood before collection.
- Cross-border data transmission must comply with relevant privacy protection regulations (such as GDPR).
- It is recommended to consult professional legal advisors to ensure compliance.
5. Handling of Violations
CoreClaw takes violations of this statement seriously. If we find or receive a valid report indicating that a user is engaged in unethical data collection activities, we have the right to take the following measures without prior notice and without assuming any responsibility:
- Suspend or terminate access permissions of the violating account.
- Delete violating scripts and related content.
- Report suspected illegal activities to law enforcement agencies.
- Take other remedial measures permitted by law.
6. Reporting and Consultation
We encourage everyone to jointly maintain the cleanliness and compliance of the network. If you find any abuse, suspicious activities, or behaviors that may violate this statement, please contact us via:
Reporting Email: support@coreclaw.com
If you have questions about the compliance of a specific collection project, it is recommended to consult professional legal advisors.
7. Relationship of This Statement to Other Documents
This statement is a supplement to the "CoreClaw Terms of Service" and the "CoreClaw Acceptable Use Policy", further clarifying the platform's ethical stance on data collection activities. In case of conflict, the "CoreClaw Acceptable Use Policy" shall prevail.
8. Statement Revision
CoreClaw will regularly review the content of this statement and update it according to changes in laws and regulations, technological developments, and industry best practices. The updated statement will be published on the website and users will be notified through appropriate means.