How would you design a test that only a human can pass, but a bot cannot?

Is This Lemmy Open?@lemmy.dbzer0.com · 2 years ago

How would you design a test that only a human can pass, but a bot cannot?

cwagner@discuss.tchncs.de · 2 years ago

GPT 3.5 (what chatGPT was at the beginning) failed at non-trivial math ;) It couldn’t figure out how many characters even were in a word.

SirGolan@lemmy.sdf.org · 2 years ago

Yeah. It still definitely does! The interesting thing is that it seems to be very good at estimating and the final answer it gives is usually pretty close to correct in my experience. Of course close doesn’t really count in math problems.

cwagner@discuss.tchncs.de · 2 years ago

Just tested it, at least with number of characters in a word and splitting words, GPT4 does it flawlessly.