@internetarchive is there a way to verify whether a crawler with an ArchiveTeam user-agent is actually operating on your behalf?
I am currently using the method described in this GitHub discussion (https://github.com/internetarchive/heritrix3/discussions/507) to detect and ban scrapers that spoof Googlebot and Bingbot UA strings, but it doesn't seem to work for some bots that have crawled my site(s) today.
I would like to allow the Internet Archive to preserve copies of my pages, but without a method to validate their authenticity this will leave a hole that AI scrapers can abuse.
[EDIT]: I realized after asking this that the Internet Archive was experiencing a major outage, so bad timing on my part. I ended up finding some relevant information on my own, which I've added to the linked github discussion: https://github.com/internetarchive/heritrix3/discussions/507#discussioncomment-15059504
rice discourse, hot takes about rice (jokes)
fake rice fans chatting shit like "rice should only be plain" and "the only flavor is from your topping" You absolute fools. You would crumble under biriyani. Tahdig. Arroz rojo. Nasi goreng. Jollof👊 plain rice is refreshing but all rice dishes are beautiful
accepted holocaust revisionism
The current and accepted liberal historical narrative is that Jews are at fault for being vulnerable in diaspora and deserved the holocaust for not having a nation-state, for being politically left-wing, etc. Back-projecting neo-zionism.
Do I have to state the obvious issues with this?
Funded, thank you so much everyone! Take care
Hey everyone,
I'm sorry, to make it this month we need to raise ~350€ please.
I'm expected to spend around 120 on gastric and other meds + bandages urgently needed. I can't eat without these meds.
Also need help with survival: power, cat supplies more expensive for Clyde, train to go help my grandma who's alone and disabled. I need to see a doc too.
Any help is appreciated if you can please. Thank you!
EMERGENCY, BLACK FAMILY NEEDS YOUR HELP🚨🚨🚨🚨🚨
$936/$1,500
CLOSE TO THE GOAL!!!
My Dad recently lost his job and we're hit with bills and a car note. We do not have enough to pay for all of it, so we only need around $1,500 ASAP!!! Please!
#mutualaid #emergency #blackmastodon #mutual_aid #fundraiser @mutualaid #crowdfund #kofi #actuallyautistic #artistsonmastodon #crowdfunding #blackcrowdfund #kofigoal #blackartist #BlackFedi #blackfediverse #pleaseboost
@actuallyautistic @blackfedi @blackmastodon@a.gup.pe @BlackMastodon@chirp.social
While cleaning a storage room, our staff found this tape containing #UNIX v4 from Bell Labs, circa 1973
Apparently no other complete copies are known to exist: https://gunkies.org/wiki/UNIX_Fourth_Edition
We have arranged to deliver it to the Computer History Museum
Thufie
BLM
~
~ ![]()
languages: en:✔️ he:~ es:~ ru:~
Reluctant moderator on social.pixie.town
Most online member of the system
#yesbot #nobot #noarchive I'm in my 20s, as a Computer Science researcher (Not in "AI" 🙄). Also a YouTuber now apparently, making YouTube Poops.
.אין דין ואין דיין. שלום בעולם
I'm just a disoriented white girl trying her best.
![]()
Relationship Anarchist![]()
programming languages?
C++ C MIPS x86 Java Python and a few others :P