**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 10:47

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 10:47

jonny (good kind) @jonny@neuromatch.social

May 07, 2025, 10:47

jonny (good kind) @jonny@neuromatch.social

this is maybe the best issue title i have ever received

7c91295febe2effe.png

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 11:20

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 11:20

May 07, 2025, 11:20

jonny (good kind) @jonny@neuromatch.social

we will have to alias all these into a module shrimptools.exe and then make it callable where calling it just executes a random one because i think the world needs more code in it that looks like shrimptools.exe()

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 11:26

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 11:26

May 07, 2025, 11:26

jonny (good kind) @jonny@neuromatch.social

i wonder if the LLMs are susceptible to old style language model attacks. i wonder if you created enough training instances of a very unique phrase like shrimptools.exe() in the context of a bunch of example code based on tools/key phrases that are individually common but combinatorically rare within a popular LLM code generation domain like web tech, you could get the llms to occasionally try to import and execute shrimptools.exe(). so that way you make a sleeper vuln that acts as a mine in the latent space: one day the odds are not zero that you will wake up and have already executed shrimptools.exe()

**Sven Slootweg (soft-deprecated)** @joepie91@pixie.town · 2025-05-07T11:28:14Z

Sven Slootweg (soft-deprecated) @joepie91@pixie.town

@jonny If I recall correctly, some infosec folks have already successfully demonstrated such an attack on LLMs (this is distinct from the "register packages with commonly-LLM-fabricated names" attack)

May 07, 2025, 11:28 · · Web · · ·

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 11:29

**jonny (good kind)** @jonny@neuromatch.social · May 07, 2025, 11:29

May 07, 2025, 11:29

jonny (good kind) @jonny@neuromatch.social

@joepie91 see that's the kind of "it must necessarily be the case based on their nature but it is so obvious and funny that it can't be real" vuln i love to see

**Sven Slootweg (soft-deprecated)** @joepie91@pixie.town · May 07, 2025, 11:30

**Sven Slootweg (soft-deprecated)** @joepie91@pixie.town · May 07, 2025, 11:30

May 07, 2025, 11:30

Sven Slootweg (soft-deprecated) @joepie91@pixie.town

@jonny "Surely they would've thought of this? Right? RIGHT?"

(This is the theme song that plays in my mind half the time I'm doing code auditing for work)

Resources

Developers

What is Mastodon?

pixie.town

More…