Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
GaggiX
3 months ago
|
parent
|
context
|
favorite
| on:
Nepenthes is a tarpit to catch AI web crawlers
As always, I find it hilarious that some people believe that these companies will train their flagship model on uncurated data, and that text generated by a Markov chain will not be filtered out.
JTyQZSnP3cQGa8B
3 months ago
[–]
Then why the DDOS on random web sites?
GaggiX
3 months ago
|
parent
[–]
I guess that depends on how the webspider is configured, I doubt the curation is done in real-time while scraping.
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: