Blocklist scraping by fash 

So this has been an ongoing issue, would love it if people found the earlier threads about it for more context cause I don't have the spoons right now

Originally written by "mint", hosted on the kiwifarms git is a tool that continuously scrapes publicized instance blocklists to allow searching who has you blocked (resulting in emails like uwu we did nothing wrong how dare you block our instance)

Through correlation, turns out the main IP being used by fba.ryona.agency is `54.37.233.246`. Blocking that at the firewall level prevents them from getting any new data.

Other instances exist too though, being hosted on
`23.24.204.110`, `45.86.70.49`, `88.65.6.124`, `187.190.192.31`

the drow.be / bka.li / teleyal.blog / mooneyed.de "kromonos" user has their own version, that feeds an API that gives your instance a highscore for blocking their shit, scrapes from `185.244.192.119`, with user agents presenting as random instances

These, and other scrapish ip's are also listed in git.pixie.town/f0x/nixos/src/b

Blocklist scraping by fash 

@f0x@social.pixie.town disregard the question, looks like the nginx worked. I also located a new one so figured I'd share:

209.141.56.3 - - "GET /.well-known/nodeinfo HTTP/2.0" 200 213 "-" "FediList agent (
https://fedilist.com/)" "-"

Follow

re: Blocklist scraping by fash 

@leni oh yeah, FediList (used to?) scrape over tor, so that's caught by a user-agent block instead git.pixie.town/f0x/nixos/src/b

re: Blocklist scraping by fash 

@f0x@social.pixie.town looks like they still are, I just caught it again with a different IP :ablobrollingeyes: thanks again for sharing all this!

Sign in to participate in the conversation
Pixietown

Small server part of the pixie.town infrastructure. Registration is closed.