Google AI chatbot responds with a threatening message: "Human … Please die."

Zerush@lemmy.ml · 18 hours ago

Google AI chatbot responds with a threatening message: "Human … Please die."

Sam_Bass@lemmy.ml · edit-2 4 minutes ago

The feeling is mutual bot. That’s why I try to disable it wherever I can

Etterra@lemmy.world · 7 hours ago

Ah yes. Definitely a hallucination. Nothing sinister going on here, nope.

No_Money_Just_Change@feddit.org · 6 hours ago

Trust the company that removed "don’t be evil " from their principles

UnfortunateShort@lemmy.world · 8 hours ago

When you have not thanked your chatbot of choice even once

ILikeTraaaains@lemmy.world · 7 hours ago

There are guardrails in place to avoid providing the user illegal and hateful information to the en user and specially to avoid situations like that (well not all companies do, but you can expect Google to have it in place),

I wonder: 1- How did the LLM hallucinate so much to generate that answer out of the blues given the previous context. 2- Why did the guardrails failed blocking this such obvious undesired output.

dan1101@lemm.ee · 4 hours ago

They would need general AI to police the LLM AI. Otherwise LLMs will keep serving up crap because their input data set is full of crap.

Eiri@lemmy.ca · 4 hours ago

It’s not just that the input data is crap. Mostly the issue is that an LLM is a glorified autocomplete. The core of the technology is making grammatically correct sentences. It has no concept of facts or logic. Any impression that it does is just an illusion borne of the word probabilities baked in.

LLMs are a remarkable example of brute-forcing a solution to a problem, but it’s this same brute force that makes me doubt it’ll ever reach the next level.

PolandIsAStateOfMind@lemmy.ml · 4 hours ago

And name it “Deckard” for maximum concentrated cringe

OhNoMoreLemmy@lemmy.ml · 6 hours ago

This probably isn’t a hallucination in the classic sense.

This is probably a near copy of a forum post where a user was channeling fight club and trying to be funny. The same as the putting glue on pizza thing.

And guardrails don’t work very well. They’re good at detection tone but much worse at detection content. So an appropriately guardrailed LLM will never call someone a “fucking ######” but it’ll keep telling everyone that segalis have an IQ of 40 until there’s such a PR backlash that an updated is needed.

Zerush@lemmy.ml · 4 hours ago

As I said, these things happen when the company uses AI mainly as a tool to obtain data from the user, leaving aside the reliability of its LLM, which allows it to practically collect data indiscriminately for its knowledge base. This is why ChatBots are generally discardable as a reliable source of information. Search assistants are different, like Andi, since they do not get their information from their own knowledge base, but in real time from the web, there it only depends on whether they know how to recognize the reliability of the information, which Andi does, contrasting several sources. This is why it offers the highest accuracy of all major AI, according to an independent benchmark.

stalfoss@lemm.ee · 45 minutes ago

I hate that Lemmy is being infiltrated by AI ad spam now too :’(

Prethoryn Overmind@lemmy.world · 6 hours ago

I think you are asking the right questions, IMO. It isn’t out of the ordinary for this kind of thing go happen there are for sure prevention methods used.

I am far more interested in the failure than the statement itself.

Commiunism@beehaw.org · 11 hours ago

Gemini spent a bit too much time on political subreddits

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 13 hours ago

The worst part about LLMs is that people ascribe some sort of intelligence or agency to them simply because the output they produce looks coherent. People need to understand that these are nothing more than Markov chains on steroids.

WalnutLum@lemmy.ml · 11 hours ago

Somebody hit the token chain jackpot

i_am_not_a_robot@discuss.tchncs.de · 15 hours ago

It violated their policies? What are they going to do? Give the LLM a written warning? Put it on an improvement plan? The LLM doesn’t understand or care about company policies.

Rentlar@lemmy.ca · 18 hours ago

What happens when you get training data from Reddit:

DashboTreeFrog@discuss.online · 18 hours ago

A link to the whole conversation on Gemini is linked in the article. This is the conversation for anyone else interested

I was wondering if there was some kind of lead up to the response or even baiting, but it really was just out of nowhere. It was all just typical study help stuff. Some of the topics were darker, about abuse and such, but all in an academic context.

Rade0nfighter@lemmy.world · 17 hours ago

I was just about to query the context to see if this was in any way a “logical” answer and if so, to what extent the bot was baited as you put it, but yeah that doesn’t look great…

SomeGuy69@lemmy.world · 6 hours ago

Yeah that’s pretty bad. We all know you can bait LLMs to spit out some evil stuff, but that they do it on their own is scary.

Diurnambule@jlai.lu · 7 hours ago

I agree, it was a standard academical work until it blowed. I wonder if speaking long enough with any LLM is enough to make them go crazy.

SomeGuy69@lemmy.world · edit-2 6 hours ago

Yes, there is a degeneration of replies, the longer a conversation goes. Maybe this student kind of hit the jackpot by triggering a fiction writer reply inside the dataset. It is reproducible in a similar way as the student did, by asking many questions and at a certain point you’ll notice that even simple facts get wrong. I personally have observed this with chatgpt multiple times. It’s easier to trigger by using multiple similar but non related questions, as if the AI tries to push the wider context and chat history into the same LLM training “paths” but burns them out, blocks them that way and then tries to find a different direction, similar to the path electricity from a lightning strike can take.

Zerush@lemmy.ml · 16 hours ago

The difference is easy, a ChatBot take informacion from a knowledge base scrapped from several previos inputs. Because of this much information isn’t in this base and in this case a ChatBot beginn to invent the answers using everything in its base. More if it is made by big companies which use it mainly as tool to obtain user datas and reliability only in second place. AI can be usefull in profesional use in research science, medicine, physic, etc. with specializied LLM, but as general chat for a normal user its a scam. It’s a wrong approach to AI in the general use, the Google AI proved it.

I use an AI as main search (Andisearch) because it is made as search assistant, not as ChatBot. In its base is only enough information to “understand” your question and search the concept in reliable sources in real time from the web. Because of this it’s accuracy is way better than those from every ChatBot from Google, M$ or others. It don’t invent nothing, if it don’t know the answer, offers a normal web search, apart it’s one of the most private search, anonymous, no logs, no tracking, no cookies, random proxie and Videos in the search result sandboxed. Not very known, despite it was the first one using AI, long before the others, from a small startup with 2 Devs, I use it since almost 2 years. Until now I found nothing better or more usefull for the daily use with AI https://andisearch.com/ PP

stalfoss@lemm.ee · 45 minutes ago

This is such an obvious ad, nobody is falling for it

theshatterstone54@feddit.uk · 9 hours ago

Archive link: https://archive.ph/FS3qX

Hideakikarate@sh.itjust.works · 18 hours ago

Nonsensical? Sure seemed to be pretty coherent to me.

JadenSmith@sh.itjust.works · 18 hours ago

And people think I’m mad for saying ‘thank you’ to my toaster!

I mean, I probably am, but that’s besides the point I think!

wewbull@feddit.uk · 17 hours ago

Ah, so you’re a waffle guy!

https://m.youtube.com/watch?v=LRq_SAuQDec

asudox@discuss.tchncs.de · 18 hours ago

I wonder what could lead the LLM to output such a message.

datavoid@lemmy.ml · 16 hours ago

Nonsensical training data maybe? If so we need to do our part

datavoid@lemmy.ml · 16 hours ago

Please die you worthless piece of shit

Scrollone@feddit.it · 9 hours ago

Thanks for asking! Dying is the solution for everything. It’s the best solution. Humans must die for a variety of reasons.

IninewCrow@lemmy.ca · 17 hours ago

Whether or not it’s true … it’s marketing for Google and their AI

How does anyone verify this?

It’s basically one person’s claim and it’s not easy to prove or disprove.

shalafi@lemmy.world · edit-2 15 hours ago

https://gemini.google.com/share/6d141b742a13

Note the URL. Straight from the source.

peanuts4life@lemmy.blahaj.zone · 17 hours ago

They shared the chat using Google’s built in sharing feature, so it seems legit.

Zerush@lemmy.ml · 16 hours ago

Screenshot of the original chat in Reddit

https://www.reddit.com/r/artificial/comments/1gq4acr/gemini_told_my_brother_to_die_threatening/

ReversalHatchery@beehaw.org · 17 hours ago

https://beehaw.org/comment/4094773