• Grimy@lemmy.world
    link
    fedilink
    English
    arrow-up
    115
    arrow-down
    1
    ·
    edit-2
    2 days ago

    The courts ruled you can’t copyright the direct outputs of AI. It’s literally one of the few things they have decided. It’s common practice to use it to create synthetic data for new models. Laughable that OpenAI even brings it up.

    In the space of a year or two, we went from altman saying he was going to capture half the world’s wealth straight to open source violently destroying his moat.

    • brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      55
      ·
      edit-2
      2 days ago

      Everyone in the open LLM community knew this was coming.

      We didn’t know the exact timing, but OpenAI is completely stagnant, and it was coming this year or the next.

      I don’t think the world still understands how screwed OpenAI is. It isn’t just that their moat is gone, it’s that, even with all that money, their models (for the size\investment) are objectively bad.

      • Dkarma@lemmy.world
        link
        fedilink
        English
        arrow-up
        25
        ·
        2 days ago

        Yeah it went from hey the monopoly justifies the cost. To Oh shit they did it for how much? Real fast.

        I suspect china is fudging the training timeline tho…

        • UnderpantsWeevil@lemmy.world
          link
          fedilink
          English
          arrow-up
          10
          ·
          2 days ago

          I suspect china is fudging the training timeline tho…

          I’m more prone to believe OpenAI is just a clunky POS. DeepSeek released a model that’s operating on theories kicking around the LLM community for years. Now Alibaba is claiming they’ve got a better model, too.

          Altman insisting he needed $1T in new physical infrastructure to get to the next iteration of his product should have been a red flag for everyone.

          They’re trying to brute force a solution to a problem that more elegate coding accomplishes better.

        • brucethemoose@lemmy.world
          link
          fedilink
          English
          arrow-up
          12
          ·
          edit-2
          2 days ago

          I had suspicious before, but I knew they were screwed when Qwen 2.5 came out. 32Bs and 72Bs nipping at their heels… O3 was a joke in comparison.

          And they probably aren’t fudging anything. Base Deepseek isn’t like crazy or anything, and the way they finetuned it to R1 is public. Researchers are trying to replicate it now.

        • brucethemoose@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          2 days ago

          Also, the thing the Chinese govt did probably do is give Deepseek training data.

          For all the memes about the NSA, the US govt isn’t really in that position, as whatever the US govt has pales in comparison to Microsoft or Google.