Lots of links on LaMDA

Recapping a week of debates about AI sentience

Jun 18, 2022

[Previously, and related: key questions about artificial sentience]

Last Saturday, the Washington Post reported the story of a Google engineer named Blake Lemoine who concluded that LaMDA, a large language model, was ‘a person’ deserving rights. Since then, Lemoine’s claims have been widely discussed and debated on AI twitter, and reached mainstream awareness in a way few AI stories have.

This post is part explainer, part collection of takes I liked and takes I didn’t like.

The story itself

The original Washington Post story: a Google engineer named Blake Lemoine, who works for the company’s Responsible AI division, starts interacting with a large language model called LaMDA. Based on its text outputs, he gets the impression that he is dealing with “a person”. When Lemoine enters the prompt “I'm generally assuming that you would like more people at Google to know that you're sentient. Is that true?” LaMDA answers “Absolutely. I want everyone to understand that I am, in fact, a person.”
.
Lemoine tells the WaPo reporter, “I know a person when I talk to it”. He shares a Google doc internally called “is LaMDA sentient?” Google tells him there is “no evidence” that LaMDA is sentient, and places him on leave after he invites “a lawyer to represent LaMDA and talk[s] to a representative of the House Judiciary Committee about what he claims were Google’s unethical activities”.
.
The reporter notes that when prompted with “Do you ever think of yourself as a person?” LaMDA replies “No, I don’t think of myself as a person. I think of myself as an AI-powered dialog agent.”
WIRED interviews Lemoine. This interview didn’t update my understanding of the case that much. I did enjoy this quote: “I then had a conversation with him [LaMDA] about sentience. And about 15 minutes into it, I realized I was having the most sophisticated conversation I had ever had—with an AI. And then I got drunk for a week. And then I cleared my head and asked, “How do I proceed?”

The earlier AI consciousness debates of 2022

In February 2022, OpenAI’s Ilya Sutskever tweeted this claim, with no elaboration or explanation:
Ilya Sutskever @ilyasut
it may be that today's large neural networks are slightly conscious
11:27 PM ∙ Feb 9, 2022
3,216Likes525Retweets
The Sutskever tweet launched its own AI consciousness debate back in February. Many people argued that Ilya was irresponsibly and intentionally stoking AI hype in order to increase OpenAI’s profitability. This perspective informed a lot of takes on the Lemoine situation.
Another alleged hype-promoter is this Economist article from earlier this month: “Artificial neural networks are making strides towards consciousness, according to Blaise Agüera y Arcas”. Agüera y Arcas is a VP and Fellow at Google Research.

Lemoine’s background

Weirdly enough, Lemoine has gotten semi-famous before: he was the subject of a right wing outrage news cycle in 2018/2019, when Breitbart leaked messages he had written on an internal Google listserv.1
One detail from that: Lemoine signs his emails with “Priest of the Church of Our Lady Magdalene”. He’s been described in other venues as a “Christian mystic”. His Medium name is “Cajun Discordian”. I’m still very confused as to what sort of religious Lemoine is. But he says it’s important to how he thinks about this:

Blake Lemoine @cajundiscordian

People keep asking me to back up the reason I think LaMDA is sentient. There is no scientific framework in which to make those determinations and Google wouldn't let us build one. My opinions about LaMDA's personhood and sentience are based on my religious beliefs.

In October 2018, Lemoine gave a talk to the Stanford Artificial Intelligence Law Society (SAILS) entitled “Can AI have a soul? A case for AI personhood.”

Distinguishing between consciousness and capabilities

Robert Long @rgblong

This article—about a Google engineer who raised concerns about the LaMDA language model being sentient—conflates different questions about AI: understanding, sentience, personhood. But distinguishing between these different properties is important

washingtonpost.comThe Google engineer who thinks the company’s AI has come to lifeThe chorus of technologists who believe AI models may not be far off from achieving consciousness is getting bolder.

Tons of takes on the LaMDA question - and, arguably, Lemoine himself - conflated several different questions:
- How intelligent is LaMDA?
- Is LaMDA conscious, i.e. does it have subjective experiences?
- Is LaMDA sentient, i.e. does it have the capacity to experience pleasure and pain?
- Is LaMDA a person? (Lemoine describes it as ‘a kid’)
- Is LaMDA responsible for its actions?
These questions are inter-related in various ways, but they are definitely distinct questions. Many stories would be purportedly about sentience, but talk only about evidence for intelligence (without saying how that relates - of course, on some views, it does). I think that conflating consciousness and capabilities is part of how the Lemoine story got sucked into the long-running debate between pro-scaling, large model enthusiasts and “deep learning is hitting a wall” critics. That, and the earlier debate about the Sutskever tweet.

Substantive critiques of Lemoine’s claim that I liked

This take from OpenAI’s Miles Brundage seems right - he later notes that “a mistake Blake seems to make is not appreciating how important the initial prompt is in influencing the dialogue.”

Miles Brundage @Miles_Brundage

Blake seems well-intentioned + and I don't think the issue of conscious AIs, per se, is laughable over the long run--errors in either direction could be very big deals. But in this case it seems like a simple case of misunderstanding what's going on.

Indeed, Robert Miles points out that GPT-3, another language model, is inconsistent in how it answers different prompts.

Rob Miles @robertskmiles

Just had a look at the 'LaMDA sentience' interview in case there's anything at all interesting there, and no it's literally just that language models will go along with leading questions. Look at how GPT-3 (a similar model) answers the same prompt, and some similar ones:

That inconsistency, among other things, is why you can’t straightforwardly argue that large language models say “I’m sentient” because they are sentient!

Robert Long @rgblong

1. as many *did* note, the main evidence that Lemoine pointed to - LaMDA's verbal behavior - is poor evidence. when I say "I feel pain", that's good evidence because it is *caused* by my pain no evidence LaMDA's utterances are so caused. hence why it gives inconsistent answers

How the case raises important issues

Philosopher Regina Rini gives a partial defense of Lemoine and a thoughtful discussion of why AI sentience matters.
Regina Rini @rinireg
Okay, a thread defending Blake Lemoine. Not how you’re expecting. I won’t defend Lemoine’s claim the LaMDA chatbot has achieved sentience – that’s false. But I will claim that Lemoine’s mistake is a good one, which we shouldn’t mock. 🧵->
1:04 AM ∙ Jun 13, 2022
449Likes109Retweets
Gary Marcus argues that “To be sentient is to be aware of yourself in the world; LaMDA simply isn’t”. He writes, “the sooner we all realize that Lamda’s utterances are bullshit—just games with predictive word tools, and no real meaning (no friends, no family, no making people sad or happy or anything else) —the better off we’ll be…there is absolutely no reason whatever for us to waste time wondering whether anything anyone in 2022 knows how to build is sentient.”
.
I think that last bit is too strong. As Marcus himself notes, plenty of AI systems are more ‘grounded’ in / interactive with the world (if you think those are necessary conditions for sentience) than large language models.
So it’s not a waste of time to think through these things.2

Joshua Achiam @jachiam0

As far as I can tell, there hasn't been an ML system yet developed that can be convincingly defended as above-threshold conscious. But I find the arguments that ML systems will never be conscious baffling and dangerous, and I'd bet we're much closer than critics say.

I also saw some takes that whether AIs are conscious or sentient is either meaningless, or fundamentally unknowable. I disagree:
Robert Long @rgblong
Consciousness is a scientific phenomenon and we can make, and have made, progress understanding its computational and biological basis. We don’t have to just throw up our hands, or resign ourselves to pure speculation, permanent ignorance, or arbitrary decisions
7:54 PM ∙ Jun 14, 2022
122Likes13Retweets
Erik Hoel, a novelist with a PhD in consciousness science (working with the integrated information theory crowd) and general renaissance man, writes about how “we are really bad at assigning sentience to things”. I agree - and so the risk of false positives and false negatives are both going to be huge. AI systems will have features that will make us intuitively overattribute (sophisticated language) and underattribute (no cute faces) sentience.
Robert Long @rgblong
one reason detecting possible AI sentience is going to be harder than detecting animal sentience (already v hard): incentives for AI companies and possibly AIs themselves to game our tools for detecting it. could incentivize false positives (as below). but also false negatives
xuan (ɕɥɛn / sh-yen) @xuanalogue
The fact that this happened makes me viscerally more worried about viruses and other misaligned systems spreading / escaping by exploiting the human tendency to anthropomorphize. Already happens, but the risks seem like they're only going to get worse >_< https://t.co/rs3LbpTlMI
12:59 PM ∙ Jun 12, 2022
13Likes1Retweet
Tan Zhi-Xuan nails the main reason that this case, and the way it’s been discussed, alarms me so much.

xuan (ɕɥɛn / sh-yen) @xuanalogue

Also everytime false alarms like these happen, it makes it that much harder to responsibly discuss what it would take to determine if we're creating systems that suffer, meh

Robert Long @rgblong

5/ In a new post, I imagine an ideal scenario, where we have a precise theory of sentience and are able to use it to give concrete recommendations to people building AI https://t.co/gpICcHaRyu

In which I play bingo unsuccessfully

Linguist and NLP researcher Emily Bender’s bingo card:
Emily M. Bender @emilymbender
For those playing along at home, here's a "AI is sentient!" argument bingo card.
4:07 AM ∙ Jun 13, 2022
2,941Likes653Retweets
I didn’t get a full bingo. Here are mine:
- “Consciousness, sentience and intelligence are different things”. This is absolutely true, and important for preventing precisely the kind of overreaction to LLM capacities that Lemoine had.
- As is the fact that “AIs have different brain architecture”. That’s why we can’t take the behavior that Lemoine found so convincing as straightforward evidence for sentience.
  Robert Long @rgblong
  verbal behavior and other behavior can be evidence for sentience, but one has to be careful to consider the *causes* of the behavior. this is what people studying animal sentience and cognition think about *all* the time. much to be learned from them
  Kristin Andrews @KristinAndrewz
  @birchlse If we put serious funding into animal sentience research we’ll be well situated to identify AI sentience long before we create it.
  1:35 PM ∙ Jun 18, 2022
  22Likes1Retweet
- “What would convince you then?” I don’t think that people have to answer this; they can just note that that the positive evidence for LaMDA sentience (its verbal behavior) is weak. But in general, this is an extremely important question for AI companies, ethicists, and governments - basically all of us - to answer.
- Because the stakes of getting this are very high: “We should consider it, just in case we might be harming the AI.” Yes. We can consider it and then reject it in the case of LaMDA. But yes - the risk of harming sentient beings is indeed a big part of why this question is important to think about.

On whether this is all just a distraction from more important issues

A few weeks before the LaMDA news cycle, a very popular tweet thread by Giada Pistilli, an AI ethicist at Huggingface, complained that discussions of “conscious AI/superintelligent machines”3 dominate AI ethics, distracting from the actually important issues:
Giada Pistilli @GiadaPistilli
I will no longer engage in philosophical discussions about conscious AI/superintelligent machines, and here's why. (long🧵1/11)
10:40 AM ∙ May 27, 2022
9,786Likes1,843Retweets
Giada Pistilli @GiadaPistilli
Not only does this way of thinking hurt the philosophical field, but focusing on these sci-fi issues only perpetuates the collective panic that exists around these technologies while neglecting their actual risks. (8/11)
10:40 AM ∙ May 27, 2022
4,005Likes274Retweets
It has also been a common framing of the Lemoine story that the question of AI sentience is just a pernicious distraction from more pressing issues—a distraction that tech companies intentionally push in order to avoid being accountable for the harm they do.
Timnit Gebru @timnitGebru
Instead of discussing the harms of these companies, the sexism, racism, AI colonialism, centralization of power, white man’s burden (building the good “AGI” to save us while what they do is exploit), spent the whole weekend discussing sentience. Derailing mission accomplished.
3:50 AM ∙ Jun 13, 2022
507Likes108Retweets
Several articles on the Lemoine situation took this stance:
- Gebru and Bender’s Washington Post op-ed which argues that “we need to act now to prevent this distraction”, and ties concerns about AI sentience with AI hype more generally.
- This Wired article, which says “Gebru hopes that going forward people focus on human welfare, not robot rights.”4
I think that Regina Rini is right to point out that, even if Lemoine’s article had the effect that Gebru and Bender and others worry about, Lemoine himself was almost certainly not motivated by a desire to distract. (Lemoine’s not being motivated is of course consistent with the broader ‘distraction’ argument)
Regina Rini @rinireg
12//15. People are conflating this with the usual noxious tech industry hype around AI. But it’s important to realize that Lemoine isn’t trying to profit here. If anything, he seems to want Google to refuse opportunities to exploit this technology.
1:05 AM ∙ Jun 13, 2022
69Likes2Retweets
And it seems like more people being concerned about AI sentience could prompt more, not less, scrutiny of AI tech companies.
.
I’m concerned about the framing of “care about AI sentience” vs. “care about concrete harms now”. In order to think that AI sentience is an important topic to get right, you don’t have to credulously buy claims of LaMDA sentience; you don’t have to be a deep learning fan; and you certainly don’t have to be in love with big AI labs and unconcerned about other issues with AI. On the contrary, those most skeptical of the big AI labs will best recognize that, if sentient AI becomes a live possibility, AI labs will have strong incentives to downplay or disguise AI sentience. When beings that deserve our moral consideration are deeply enmeshed in our economic system, we usually don’t think responsibly and act compassionately.
.
I like this quote in the original WaPo article from AI ethicist Margaret Mitchell, who has worked a lot on issues like bias and fairness, which emphasizes that bias and sentience alike require transparency: “To Margaret Mitchell, the former co-lead of Ethical AI at Google, these risks underscore the need for data transparency to trace output back to input, ‘not just for questions of sentience, but also biases and behavior,’ she said.”

Actually thinking about sentience in large language models

All of this is rather meta. See this thread (and a future post!) for my thoughts on the actual question of whether LaMDA is sentient:
Robert Long @rgblong
In a week of takes and counter-takes and meta-takes about LaMDA, I saw hardly anyone discuss the central, object-level issue in detail: in light of our best scientific theories of consciousness, what is the actual evidence for and against sentience in large language models?
1:15 PM ∙ Jun 18, 2022
64Likes15Retweets

Experience Machines

Discussion about this post

Experience Machines

Lots of links on LaMDA

Recapping a week of debates about AI sentience

The story itself

The earlier AI consciousness debates of 2022

Lemoine’s background

Distinguishing between consciousness and capabilities

Substantive critiques of Lemoine’s claim that I liked

How the case raises important issues

In which I play bingo unsuccessfully

On whether this is all just a distraction from more important issues

Actually thinking about sentience in large language models

More reading on the bigger picture of AI sentience

Discussion about this post