**blackle mori** @suricrasia@lethargic.talkative.fish · Feb 23, 2023, 12:41

**blackle mori** @suricrasia@lethargic.talkative.fish · Feb 23, 2023, 12:41

blackle mori @suricrasia@lethargic.talkative.fish

Feb 23, 2023, 12:41

blackle mori @suricrasia@lethargic.talkative.fish

I feel that the best explanation for neural text degeneration is that language is a communication medium. when I talk I'm transmitting bytes to you; the protocol we're using to transmit those bytes is english. therefore normal human language contains entropy that you just can't get rid of (unless you're omniscient.)

when you try to generate text with a LLM by maximizing likelihood, you're in turn minimizing entropy and therefore producing a sentence that contains no information. hence the repetitions and nonsensical outputs

**‍fuchsiaaaaaaaaaaaaaaaaa** @f0x@pixie.town · 2023-02-23T12:43:17Z

‍fuchsiaaaaaaaaaaaaaaaaa @f0x@pixie.town

@suricrasia I also feel like training prioritized long-form answers (like essay word counts lol) over quality, because it more "impressive"?

Feb 23, 2023, 12:43 · · · ·

Resources

Developers

What is Mastodon?

pixie.town

More…