slop rule

als@lemmy.blahaj.zone · 1 month ago

slop rule

1rre@discuss.tchncs.de · 1 month ago

Why do you think I said "thinking"/planning instead of just calling it thinking…

The “thinking” stage is actually just planning so that it can list out the facts and then try and find inconsistencies, patterns, solutions etc. I think planning is a perfectly reasonable thing to call it, as it matches the distinct between planning and execution in other algorithms like navigation.

AliasAKA@lemmy.world · 1 month ago

“Thinking” is just an arbitrary process to generate additional prompt tokens. In their training data now, they’ve realized people suck at writing prompts, and that it was clear their models lack causal or state models of anything. They’re simply good at word substitution to a context that is similar enough to the prompt they’re given. So a solution to sucky prompt writing and trying to sell people on its capacity (think full self driving — it’s never been full self driving, but it’s marketed that way to make people think it is super capable) is to simply have the model itself look up better templates within its training data that tend to result in better looking and sounding answers.

The thinking is not thinking. It’s fancier probabilistic look up.

TonyTonyChopper@mander.xyz · 1 month ago

nope