How would you design a test that only a human can pass, but a bot cannot?

Is This Lemmy Open?@lemmy.dbzer0.com · 1 year ago

How would you design a test that only a human can pass, but a bot cannot?

SirGolan@lemmy.sdf.org · 1 year ago

I was going to say you could give it a math problem that uses big numbers but tried one on GPT4 and it succeeded. GPT3 though will absolutely fail at nontrivial math every time.

cwagner@discuss.tchncs.de · 1 year ago

GPT 3.5 (what chatGPT was at the beginning) failed at non-trivial math ;) It couldn’t figure out how many characters even were in a word.

SirGolan@lemmy.sdf.org · 1 year ago

Yeah. It still definitely does! The interesting thing is that it seems to be very good at estimating and the final answer it gives is usually pretty close to correct in my experience. Of course close doesn’t really count in math problems.

cwagner@discuss.tchncs.de · 1 year ago

Just tested it, at least with number of characters in a word and splitting words, GPT4 does it flawlessly.