Nothing Special   »   [go: up one dir, main page]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NoML Proposal #31

Open
ColinHayhurst opened this issue May 24, 2024 · 0 comments
Open

NoML Proposal #31

ColinHayhurst opened this issue May 24, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@ColinHayhurst
Copy link

Since there are many companies scraping/crawling webpages in order to collect data, and often without identifying themselves, a way which addresses more than just search engine crawler bots, and using robots.txt is needed. The proposal here is to add a new ‘noml’ value to the already-existing meta and X-Robots tag.

This can be simply expressed for HTML pages using:

and for non-HTML using:
X-Robots-Tag: noml

Full details of the NoML proposal are given in this Open Letter, which so far as 5 signatories that offer search engines and/or proxies and/or AI search.

Obviously this is directly relevant to at least section 4.5.1: Comparison with search engines, so might be added as a bullet point in the orange box as follows:

  • NoML proposes an opt-out mechanism to supplement the existing robots.txt mechanism and thus address the new challenges faced by creators in the age of AI systems. The open letter is co-signed by individiuals and several relevant companies.
@ColinHayhurst ColinHayhurst added the enhancement New feature or request label May 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant