Anubis is awesome! Stopping (AI)crawlbots

zoey@lemmy.librebun.com · edit-2 2 months ago

Anubis is awesome! Stopping (AI)crawlbots

e0qdk@reddthat.com · 2 months ago

I don’t like Anubis because it requires me to enable JS – making me less secure. reddthat started using go-away recently as an alternative that doesn’t require JS when we were getting hammered by scrapers.

BakedCatboy@lemmy.ml · 2 months ago

Fwiw Anubis is adding a nojs meta refresh challenge that if it doesn’t have issues will soon be the new default challenge

dan@upvote.au · 2 months ago

Won’t the bots just switch to using that instead of the heavier JS challenge?

Sekoia@lemmy.blahaj.zone · 2 months ago

They can, but it’s not trivial. The challenge uses a bunch of modern browser features that these scrapers don’t use, regarding metadata and compression and a few other things. Things that are annoying to implement and not worth the effort. Check the recent discussion on lobste.rs if you’re interested in the exact details.

baod_rate@programming.dev · 2 months ago

Check the recent discussion on lobste.rs if you’re interested in the exact details.

For those coming from the future: https://lobste.rs/s/aa7ske/anubis_now_supports_non_js_challenges

yetAnotherUser@discuss.tchncs.de · 2 months ago

Plus even if they were to implement those features, the challenges would still get increasingly harder the more bot-like a scraper behaves.

You can’t prevent scraping entirely but you can certainly prevent scraping that behaves like a DOS attack.

Jumuta@sh.itjust.works · 2 months ago

iirc there’s instructions on completing the anubis challenge manually

Anubis is awesome! Stopping (AI)crawlbots

Anubis is awesome! Stopping (AI)crawlbots

Incoherent rant.

Behold, Anubis.

“Weighs the soul of incoming HTTP requests to stop AI crawlers”