misk@sopuli.xyz to Technology@lemmy.worldEnglish · 2 months agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square104fedilinkarrow-up1492arrow-down117cross-posted to: apple_enthusiast@lemmy.world
arrow-up1475arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · 2 months agomessage-square104fedilinkcross-posted to: apple_enthusiast@lemmy.world
minus-squareHeyListenWatchOut@lemmy.worldlinkfedilinkEnglisharrow-up10·2 months ago …a spellchecker on steroids. Ask literally any of the LLM chat bots out there still using any headless GPT instances from 2023 how many Rs there are in “strawberry,” and enjoy. 🍓
minus-squareSemperverus@lemmy.worldlinkfedilinkEnglisharrow-up10arrow-down1·2 months agoThis problem is due to the fact that the AI isnt using english words internally, it’s tokenizing. There are no Rs in {35006}.
minus-squareSterile_Technique@lemmy.worldlinkfedilinkEnglisharrow-up4·2 months agoThat was both hilarious and painful. And I don’t mean to always hate on it - the tech is useful in some contexts, I just can’t stand that we call it ‘intelligence’.
minus-squarePieisawesome@lemmy.worldlinkfedilinkEnglisharrow-up3·2 months agoLLMs don’t see words, they see tokens. They were always just guessing
Ask literally any of the LLM chat bots out there still using any headless GPT instances from 2023 how many Rs there are in “strawberry,” and enjoy. 🍓
This problem is due to the fact that the AI isnt using english words internally, it’s tokenizing. There are no Rs in {35006}.
That was both hilarious and painful.
And I don’t mean to always hate on it - the tech is useful in some contexts, I just can’t stand that we call it ‘intelligence’.
LLMs don’t see words, they see tokens. They were always just guessing