Ok, you have a moderately complex math problem you needed to solve. You gave the problem to 6 LLMS all paid versions. All 6 get the same numbers. Would you trust the answer?

  • Farmdude@lemmy.worldOP
    link
    fedilink
    arrow-up
    3
    arrow-down
    2
    ·
    23 hours ago

    It was about all six models getting the same answer from different accounts. I was testing it. Over a hundred each same numbers

    • supersquirrel@sopuli.xyz
      link
      fedilink
      arrow-up
      20
      ·
      edit-2
      20 hours ago

      Right so because LLMs are attrocious at actually precisely carrying out logic operations the solution was likely to just throw a normal calculator inside the AI, make the AI use the calculator and then turn around and handwave that the entire thing is AI.

      So… you could just skip the bullshit and use a calculator, the AI just repackages the same answer with more boilerplate bullshit.

      Wolfram Alpha is the non-bullshit version of this.

      https://www.wolframalpha.com/