I’ve spent a few hours over the past week doing some preliminary hacking on a global search for mastodon. With <2.5M toots/day from <12k instances, this is not a Twitter scale problem.

I’m thinking that near real-time discovery (e.g. searching and finding people talking about an earthquake) is more important than being able to search all of history, so starting off with an index that’ll go back 7 days, and will aim to have 90% of toots in the index within 10 minutes of their creation.

@doe I think I can make some guesses as to why, but I would love to hear more from you if you’re open to sharing.

@angilly well it's just that, search doesn't work like that on here on purpose. people don't want all their posts scraped without specifically consenting to it

meta 

@doe @angilly And to expand on this: full-text indexing of the fediverse is a harassment vector, and that's a big part of *why* people do not want it, and why it deliberately does not exist.

· · Web · 0 · 0 · 1
Sign in to participate in the conversation
Pixietown

Small server part of the pixie.town infrastructure. Registration is closed.