Ok, you have a moderately complex math problem you needed to solve. You gave the problem to 6 LLMS all paid versions. All 6 get the same numbers. Would you trust the answer?

  • supersquirrel@sopuli.xyz
    link
    fedilink
    arrow-up
    17
    ·
    23 hours ago

    Why would I bother?

    Calculators exist, logic exists, so no… LLMs are a laughably bad fit for directly doing math, they are bullshit engines they cannot “store” a value without fundamentally exposing it to hallucinating tendencies which is the worst property a calculator could possibly have.

    • Farmdude@lemmy.worldOP
      link
      fedilink
      arrow-up
      3
      arrow-down
      2
      ·
      23 hours ago

      It was about all six models getting the same answer from different accounts. I was testing it. Over a hundred each same numbers

      • supersquirrel@sopuli.xyz
        link
        fedilink
        arrow-up
        20
        ·
        edit-2
        20 hours ago

        Right so because LLMs are attrocious at actually precisely carrying out logic operations the solution was likely to just throw a normal calculator inside the AI, make the AI use the calculator and then turn around and handwave that the entire thing is AI.

        So… you could just skip the bullshit and use a calculator, the AI just repackages the same answer with more boilerplate bullshit.

        Wolfram Alpha is the non-bullshit version of this.

        https://www.wolframalpha.com/