• jacksilver@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    2 months ago

    LLMs do suck at math, if you look into it, the o1 models actually escape the LLM output and write a python function to calculate the output, I’ve been able to break their math functions by asking for functions that use math not in the standard Python library.

    I know someone also wrote a wolfram integration to help solve LLMs math problems.

    • 0ops@lemm.ee
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 months ago

      Wow that’s really clever actually. Basically using the library as digital scratch paper