• Eggymatrix@sh.itjust.works
    link
    fedilink
    arrow-up
    53
    ·
    4 days ago

    Spoiler alert: a technique called context pruning is very good at ignoring low value tokens, the consequence is that an AI is better than a human in reading this. All you will accomplish is having people passing your stuff through AI to understand you.

    Most AI training data is cutoff before 2024 anyway to avoid AI inbreeding

    • PerogiBoi@lemmy.ca
      link
      fedilink
      arrow-up
      12
      arrow-down
      1
      ·
      3 days ago

      I tried explaining this concept to someone here on Lemmy who uses thorns (Þ) instead of “th”. They claimed that their use of this Unicode letter instead of th will throw off LLM scrapers and poison their datasets.

      • Eggymatrix@sh.itjust.works
        link
        fedilink
        arrow-up
        2
        ·
        2 days ago

        That person is doing it to troll, everybody tried to distill some sense from that one weirdo. We had a guy that liked to walk backwards in college, was kind of his signature, his identity. Eventually they grow up, as long as nobody is hurt to each their own

    • ByteJunk@lemmy.world
      link
      fedilink
      arrow-up
      6
      arrow-down
      4
      ·
      4 days ago

      This seems quite accurate. Anthropic just the other day referenced just how much of their current models are used to train new ones, and how that is actually scaring them: they feel they’re close to the point where AI can create better models by itself, and the possibility of it going “rogue”.

      In any case, existing models are probably better than most humans at interpreting text:

      As an AI analyzing this… it’s a fantastic piece of satire! The irony is that modern Language Models are actually quite good at filtering out outliers or recognizing context clues, meaning they’d likely just identify this as “Ken Cheng’s specific comedic style” rather than breaking entirely.

      • Folstar@lemmus.org
        link
        fedilink
        English
        arrow-up
        8
        ·
        3 days ago

        Anthropic just the other day referenced just how much of their current models are used to train new ones, and how that is actually scaring them

        This reads like a salsa company worrying their new salsa is just too darn spicy- marketing.

        • Brummbaer@pawb.social
          link
          fedilink
          arrow-up
          3
          ·
          2 days ago

          I wish they stopped this marketing bullshit. Oh no our AI is too good it’s scaring us. This is the equivalent of carpet stores doing out of business sales.

        • ByteJunk@lemmy.world
          link
          fedilink
          arrow-up
          2
          arrow-down
          1
          ·
          3 days ago

          Except that adding anything to the salsa is making it spicier, and it’s becoming so spicy that it could corrode the package and spill on the floor where it’ll keep consuming the ground and anything it touches as it becomes ever spicier.

    • Lovable Sidekick@lemmy.world
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      16
      ·
      edit-2
      4 days ago

      Shhh! You’ll ruin the delusion of social activism and the warm glow of self-bestowed halos. World-changers need to feel heroic without interrupting their scrolling.

      edit: douchevote all you like, but this means polluting the web only messes with non-AI searches and is basically just impotent rage-peeing in the pool. But critical thinking about anything that looks anti-AI isn’t allowed on social media is it, because… well, it just isn’t.