Scan to download
BTC $77,118.03 +0.60%
ETH $2,098.60 -0.94%
BNB $657.43 +0.20%
XRP $1.35 -0.72%
SOL $85.31 -0.63%
TRX $0.3646 +0.42%
DOGE $0.1022 -0.80%
ADA $0.2417 -1.48%
BCH $346.02 -2.54%
LINK $9.42 -1.25%
HYPE $62.29 +4.23%
AAVE $85.65 -0.57%
SUI $1.02 -3.31%
XLM $0.1476 -0.29%
ZEC $659.28 +3.86%
BTC $77,118.03 +0.60%
ETH $2,098.60 -0.94%
BNB $657.43 +0.20%
XRP $1.35 -0.72%
SOL $85.31 -0.63%
TRX $0.3646 +0.42%
DOGE $0.1022 -0.80%
ADA $0.2417 -1.48%
BCH $346.02 -2.54%
LINK $9.42 -1.25%
HYPE $62.29 +4.23%
AAVE $85.65 -0.57%
SUI $1.02 -3.31%
XLM $0.1476 -0.29%
ZEC $659.28 +3.86%

Key Takeaways: Full Text of Google Chief Scientist Shanahan's Speech

Core Viewpoint
Summary: Google DeepMind Chief Scientist Shanahan's London Speech: Deconstructing the mental attributes of large language models (LLM) using the framework of Wittgenstein, analyzing the trend of "alien self-identity" under the context of all-weather agents.
Recommended Reading
2026-05-25 10:26:57
Collection
Google DeepMind Chief Scientist Shanahan's London Speech: Deconstructing the mental attributes of large language models (LLM) using the framework of Wittgenstein, analyzing the trend of "alien self-identity" under the context of all-weather agents.

Original text: Bear Law Principles

On the evening of May 22, 2026, Beijing time, Murray Shanahan, the chief scientist of Google DeepMind, who understands philosophy the best, delivered a closing keynote speech at the two-day International Conference on AI and Philosophy at University College London. The title was the one shown in the image above: If large language models are "strange mind-like entities," how similar are they to minds?

I have previously studied Shanahan. This "strange mind-like entity" is his term for AI, much like some people refer to certain "unidentified flying objects."

His speech was rich in content, and in summary, it covered the following aspects:

Abstract: Based on Wittgenstein's philosophical framework of "meaning is use," he explored the applicability of large language models (LLMs) in understanding, belief, agency and agency, self and consciousness, analyzed the impact of multimodality and embodiment on conceptual evolution, and discussed the strangeness of model identity.

I. Analysis of Understanding and Belief Applicability

Regarding whether LLMs possess "understanding" and "belief," the speech adopted a Wittgensteinian language game analysis method, exploring the tension between everyday use and philosophical rigor:

1. The Language Game of "Understanding"

  • Naturalness of Everyday Use: In daily interactions, it is difficult for people to avoid using the term "understanding" to describe the behavior of LLMs. For example, when the model accurately formats LaTeX entries or corrects specific fields based on user instructions, using "understanding" is a completely natural linguistic practice.

  • Deep Exploration of "True Understanding": When questioning "Does it really understand?" this often means exploring its internal working mechanisms. For example, breaking down 36 + 59 into approximately 6 + 9 combinations to complete the addition, which, although different from human algorithms, is indeed an effective computational process, thus supporting its applicability.

2. Attribution and Limitations of "Belief"

  • Application of Intentional Stance: Dennett's (LLM behavior is very effective, similar to how we explain chess programs or animal behavior (like a dog chasing a cat) using terms of belief and desire.

  • Davidsonian Reservations: Davidson argues that having beliefs requires having "concepts," which often depend on language. For LLMs, although behaviorally similar, due to their lack of connection with the world, one should be cautious with the term "belief."

  • Evolution of Multimodality and Tool Use: As LLMs integrate multimodal perception, tool invocation (like online searches to verify facts), and embodied robotic technology, they begin to possess a certain "belief" about the external world.

II. Agency, Self, and Consciousness

The conference further explored more controversial mental attributes, pointing out the fundamental differences and strangeness of LLMs in these dimensions:

1. Definition of Agency

  • Technical vs. Philosophical Definition: The AI field typically adopts the broad definitions of Russell and Norvig (perceiving the environment and acting through actuators), based on which…

  • Ambiguity of Agent Identity: In discussing "What are the identity criteria of an agent?"

2. Strangeness and Fragmentation of "Self"

  • Ambiguity of Self-Reference: The "self" in LLMs may refer to the underlying weight set, deployment models serving thousands of users, specific dialogue instances, or even the dialogue context window itself, and this reference may drift during conversations.

  • Role-Playing and Superposition States: LLMs are more like actors, playing multiple roles in a superposition state. Their "self" is not a single stable identity, but a distribution of possible roles that change with dialogue branching (Editing).

  • Ephemeral Existence of "Mayfly": The self of LLMs is transient and discontinuous. When the dialogue pauses, computation stops, and the self disappears; when the dialogue resumes, the self is re-instantiated. This leads to a state similar to "or" swarming.

3. Philosophical Dilemma of Consciousness

  • Legacy of Cartesian Dualism: Discussions about consciousness often fall into the trap of Cartesian dualism, which assumes consciousness is some private, internal entity.

  • Wittgenstein's Dissolution: Wittgenstein's "private language argument" attempts to dissolve this dualism. He argues that sensations ("something") are not "something," but part of a language game, the meaning of which lies in public use.

  • Possibility of Engineering Encounters: Instead of questioning whether LLMs have consciousness, it is better to explore whether we can design an "Encounter" with them, and how our language of consciousness might adapt to this strange entity.

III. Impact of Multimodality and Embodiment

In response to criticisms about LLMs lacking embodiment, the conference discussed the development direction of multimodal models:

1. Limitations of Multimodality

  • Enhancement of Sensory Richness: Multimodal models (like video input) provide richer sensory input, bringing them closer to human perceptual patterns, which helps narrow the gap with humans in "understanding."

  • Virtual Embodiment: In games or virtual environments, "virtual embodiment," meaning moving and interacting in a temporally and spatially extended world, is closer to human embodied experience than pure text interaction.

2. Philosophical Significance of Embodiment

  • Lack of Sense of Self: The human sense of self is deeply rooted in embodiment, including biological metabolism and internal sensations (LLMs lack this deep embodied foundation, making it difficult to generate a human-like sense of self).

  • Source of Identity Stability: Human identity stability largely relies on the continuity of the body. For LLMs, introducing persistent memory and long-term agency behavior may help establish a more stable identity, reducing their "and" mayfly.

The following is the full text of Shanahan's keynote speech:

I hope everyone can hear my voice. Is the sound okay? Good? Alright. So, the title of my talk is… Yes, this title is hypothetical ("hypothetical").

So, yes, next: they are "alien-like mind entities."

But we are doing our best to learn how to converse with them, and that's the phrase I want to talk about. I call them "alien-like mind artifacts."

One point that needs to be established is that regardless of which large language model it is, they are very different from us; they are not human.

Here is a simple comparison table. Humans are "embodied," living in the real world and sharing this world with other language users.

We acquire knowledge through interaction with the world, we use language to facilitate human collective endeavors, and we have a single, unified self.

------ I certainly do not mean to imply that they are formless voids or that they lack physical hardware to run.

They certainly have physical carriers, but they do not have a pre-existing, singular physical entity that serves as the core of perception and action. This is what I mean by "embodiment." In this sense, they are not embodied. They do not live in a shared world like we do; their learning of language is based on statistical models of language, achieved through random gradient descent.

Their optimization goal is "next token prediction." They mimic human language, essentially by predicting the next token. Moreover, they do not have a single, unified self, but rather strongly support "role-playing."

They are indeed fundamentally different from humans. Of course, they do "speak."

I will explore whether it is reasonable to apply these psychological terms to large language models. To this end, I will elaborate on a series of concepts.

For example, "understanding," "subjectivity," "reasoning" ------ I will not elaborate on the "reasoning" part today due to time constraints, as it would also bore everyone if I talked too much. Next, I will delve into "self" and "consciousness." The philosophical background of my entire research, or the larger philosophical project I am involved in, is largely Wittgensteinian, and I am deeply influenced by Wittgenstein.

Here is a well-known quote from the first part of "Philosophical Investigations," which is one of Wittgenstein's later works: 'Meaning' is the scene of a word ------ the meaning of a word is its use in the language.

This sentence encapsulates Wittgenstein's approach to meaning. It is often abbreviated as "meaning is use," meaning that "something" is the context in which the word is used. This simple stipulation also applies to itself, and he emphasizes that.

Basically, I am interested in questioning how we use these terms ------ for example, "understanding," "belief," "subjectivity."

So, let me give you a brief preview. There will be many similar slides coming up. First is "understanding."

Here, I am very inclined to take Wittgenstein's stance. That is to say, do not ask…

Returning to the previous slide. We start from…

As for "reasoning," due to time constraints, I will leave it as a thought exercise for the readers. Next, we will encounter some truly tricky cases: first "self," and finally "consciousness."

I think it is not too difficult to persuade people to accept that "understanding through thought" is a good approach. I think people are relatively open to this.

I mean those philosophers who have thought about this issue and are willing to believe that this is not a bad approach. Regarding "belief," theories like "intentional stance," etc. But when it comes to "consciousness," I think people have a much deeper intuition that merely discussing the use of words is far from sufficient, right?

That is why it becomes so tricky. Alright, then let's start with "understanding." What about the word "understanding"? First, I want to know whether large language models meet the traditional linguists' criteria.

However, when describing and explaining the behavior of large language models, using "understanding"…

In everyday use, these tools today are so powerful that it is hard not to use "understanding." I don't know if any of you have had the misfortune of having to use…

If you don't know, in LaTeX, you have to convert all bibliographic entries into the horrible format shown above. And the trouble is, there are countless different formatting standards for doing this, and everyone has slightly different habits, which is quite frustrating. Some people are very picky, for example, thinking you should grab directly from the internet, some like to add spaces around the equals sign, and some prefer to arrange fields in different orders. Although these tweaks have no impact on the final output, I just like it to be neat and uniform. I like it that way. So I want everything to strictly adhere to this format. So I say…

What I mean is: "Can you convert the following information into this style?" and then I feed it the content. It does an exceptionally good job. At this point, you naturally want to say:

"It understood my request. It did exactly what I asked." Of course, you can immediately counter that perhaps this bibliographic entry was originally hard-coded somewhere on the internet, and if that is the case, it proves nothing.

But when you engage in multiple rounds of back-and-forth interaction, you may find it produces some interesting, unexpected results, like missing a small field. So you say:…

For example, ensure that when it starts with B, you must put it in braces "AI," such a word, you always want it to remain capitalized, so you must ensure AI is not capitalized.

So I say: "Can you ensure that AI is always placed in braces?" Alright. "Then it gives the corrected version. You really find it hard not to use the word "understanding." You would say: "It understood my correction request."

Just like facing an excellent intern, you tell them: "I want to ensure you always…" and then they do it.

So, I think using the word "understanding" is very natural. It is even hard to restrain yourself from using it. Or sometimes it does something wrong, and you would say: "It didn't understand what I meant."

But the questions always follow: "Do they really understand?" The word "really" is actually very misleading.

But it is also very useful because we often need it to further explore whether a word is applicable in a specific context, or to enrich our "language game," right? Using the word "really" in a language game is to obtain more information and clarify facts.

So it is a useful tool. But it can also be misleading because it implies some underlying existence that we are trying to converge on and approach, and I think this idea is wrong. Alright. So, sometimes when facing "Does it really understand?" understanding its internal workings would be helpful. If you know there is an algorithm running underneath that is executing the task you are asking about, or you know there are appropriate representations supporting its behavior, then you might be more confident that it will do the right thing in subsequent processes, rather than just looking up a table, or merely…

So, sometimes when facing "Does it really understand?" "Does it really understand?"

I think this is a good way to explore the question, and also "understanding." That is to say, using this word is actually a way we use to further investigate and inquire, right?

For example, in the case of addition ------ this is a very interesting work by the Anthropic team. If you ask a large language model to do a simple addition, it usually gets it right. Of course, it has many ways to get it right, like it can call external tools, execute…

It got it right. At this point, you might think: "Then you think: I want to know how it arrived at that, how the underlying mechanism works. If there is an algorithm running underneath that is executing the addition, I might be more willing to say it 'understands.'"

But you get a very interesting answer. The research on mechanistic interpretability. They observed how the model performed addition. The results were very strange, and this image hints at that strangeness. It was trying to calculate 36 plus 59. Its approach was very odd: one part of the model would say, "36, that's approximately…"

Then another part would say, "59, that's approximately…" It actually knew that it was 59. Meanwhile, there were other parts just staring at the last digit, saying: "Someone said we will know the answer in the end." Then these two parts combined to calculate the final result.

For example, here is 90 and 6. This channel clearly determines that the last digit must be… but other parts in the model are processing the higher digits, and this part is saying: "I think we got a number around 90 or 92, right?" It was doing similar things in parallel, and it was doing it quite roughly. It would think "approximately" is the part that converges, and then fill in the last digit. This is really strange, right? This algorithm was learned through random gradient descent, and it is a…

Yes, it is indeed a kind of algorithm. And you know what? It works almost every time. In fact, it gets it right every time, but the way it implements it is bizarre, not the natural way we humans are accustomed to.

So, when facing "Does it really understand?" we can say: "Yes, it does so in a very peculiar way."

I think this is a reasonable and enriching way to answer. Alright, now that we have some understanding of what is happening underneath, we have more confidence to say: "Yes, I think it really understood." As I said, this is just a warm-up exercise. I think when taking a Wittgensteinian approach to face these questions, we can introduce these considerations: How are words used? Especially when we question…

Alright, now moving on to another case. Do large language models have "belief"? Cartoon simplification.

Alright, do large language models have beliefs? Of course, much of what I discuss you have seen in previous workshops and Paul Bogosian's talk.

Many of the same things, just with slightly different perspectives. Similarly, we do not ask "belief" in the sense of "belief" (…

Here, we can certainly appeal to Dennett's "intentional stance."

The intentional stance is a strategy for explaining the behavior of an entity by viewing it as a "rational agent." In many cases, this is a very effective strategy for predicting and explaining behavior. Oh, it is to attack the queen. You would use terms like belief, desire, intention to explain its behavior.

Thus, subconsciously, using words like "believe" and "know" in the context of the intentional stance is very natural. But like all vocabulary, their usage is diverse. I do not think these words correspond to a single, absolute metaphysical entity outside. They are used in various different contexts. Similarly, when facing artifacts, we are very clear about when we need to make corrections and clarifications, and how to make those corrections and clarifications, which is also part of how we use these words.

For example, suppose we have a car navigation system. My wife says: "It thinks we are in the car," or "This stupid navigation, we have clearly left the parking lot." Now it knows we are not in the parking lot anymore. "We naturally use these words in our lives. This helps us communicate what is happening.

However, if we or my wife were in a philosophical contemplative state, we might comment: "It does not think we are in the parking lot because it actually has no idea what a parking lot is, does not know what a car is, and does not know what 'being in a space' means." There is so much it does not know. You cannot discuss with it, for example, Sainsbury's department store.

So, we quickly realize that extending the use of "believe" or "know" to it is inappropriate in many contexts where we use these terms for humans.

Therefore, the word "really" is also useful here. This again shows that clarification and correction are also part of the language game we use with these words. Davidson's "rational animal."

Of course, we can also apply the intentional stance to animals. It would be very interesting to look at the debate between John Malcolm and Donald Davidson long ago.

That was about a dog chasing a cat. Malcolm said:

I would say this seems like a very natural everyday application of the intentional stance. But interestingly, the next rebuttal. Donald Davidson said: "Thoughts…"

This is the argument Davidson articulated in that paper. He said that to have a "belief," one must first have the concept of "belief," and this must be realized through language. In particular, the concept of belief is a kind of…

He was cautious and did not name which animals meet or do not meet this definition ------ but it can be inferred that he would think dogs do not have beliefs because dogs do not have language.

He was arguing that we use "believe" in the most complete sense (i.e., in the most complete sense applied to ourselves). Bogosian mentioned the same view yesterday: we do not want to lose our grasp of the "original concept" of large language models, which is the concept derived from humans themselves.

Davidson raised this point. Given the era he wrote in, it was during the "linguistic turn."

And I am more concerned about how words are used. However, I think Davidsonian considerations also apply to my project. Wittgenstein and I would agree that sometimes there is indeed a very core part in the practice of word usage.

There are some crucial core parts, right? Perhaps you would want to maintain this point and be cautious about practices that violate it. We do need to be cautious in certain places.

When guiding the use of such philosophically significant vocabulary, there is often a clearly discernible core principle. I believe these principles are not carved in stone and unchanging; they drift and change with our world and our "form of life."

I feel that perhaps with the emergence of highly complex artificial intelligence, certain transformations are occurring, even these "core principles" are… that article published in the Communications of the ACM. I proposed a very similar point, and at that time, I was clearly also thinking of Davidson's paper, right? That was in 2023. That paper took a long time to publish, which is why its publication date is written…

Returning to 2023, we are no longer talking about navigation; you could say something like:

But in reality, I can have a very long conversation with it about boilers, exploring how they work. Discussing the specific pipeline configuration of my house, and it can respond extremely thoroughly and intelligently to the topic of boilers. So you really want to say it "knows," "knows"?

Here, I tend to hold back a bit because I think we can introduce Davidsonian considerations to evaluate when facing these large models…

Quoting from my paper: I said it is not…

I always put the word "really" in quotes because I want to convey a fact: I am not making a metaphysical assertion here. This is still just about how we use words. "Really fully participates in the human language's truth game."

Especially if a basic dialogue system possesses some capability, it would be very misleading because that implies it bears a kind of "answerability" to external reality, and this accountability cannot be achieved merely through textual exchanges with human users.

"Really."

Alright, next: Do large language models have "agency"? Again, first: What is agency? We do not ask what an agent is, but rather…

(Note: "agent" is often translated into Chinese as "智能体," but it primarily means "agent" or "subject," while "agency" primarily means "subjectivity" or "agency.")

This is very interesting in the context of artificial intelligence because, in AI literature, it is sometimes a highly specific technical term (term of the art). For example, we can find very clear definitions of what an agent is in AI literature. I think someone has quoted it in previous talks.

According to Russell and Norvig's classic textbook (which is a standard), an agent is any entity that can be viewed as "perceiving its environment through sensors and acting upon it through actuators."

So this is a very broad, liberal definition, but it is indeed a technical definition. By this definition, even ordinary, 2023 vintage, non-internet-searchable pure text chatbots are often referred to as agents.

Their environment is merely the user, their "perception" is just the vocabulary of user input, and their "action" is merely the replies output to the user. According to this very broad definition, they are indeed agents. But this broad technical concept does not capture any core connotation we have when using the term in our daily lives.

After all, in everyday discourse, we might not use the term this way at all. If we continue to use the technical terminology from the AI field, in reinforcement learning…

In reinforcement learning, an agent must learn a policy that maps perceptions to actions to maximize its expected return over time.

This aligns with the previous broad definition. But if its environment is a three-dimensional game environment, where the agent can move and manipulate large objects, and its "perception" is captured by camera views as it moves from specific perspectives, then this feels much more substantial. This richer concept of agency makes us feel it also applies to non-human animals.

Alright. So let's continue to see the latest applications of the term in today's AI field.

We have now entered the so-called "agent era" ------ agent-generative AI, and the category of "agent models."

They can do many things, such as scraping web pages, reading social media updates, sending emails, and even modifying files on your computer, writing code, and so on.

A contemporary typical example is waking up once under the "heartbeat" signal, and then executing a series of user-defined instructions.

For example, after waking up, it can check your social media updates and emails, playing the role of an assistant. Helping you filter out which are important and need replies, and which are spam. Or if it receives another email that says…

It will directly throw that email into the trash. This way, it has helped you handle all these tasks. You can use AI, which is pretty nice. In short, these agents exhibit a new kind of technical agency. Facing the current generation of "agent models…"

But now, regarding "or reneging," it is not like that. Because what I said was under specific conditions. Now you can see such a scenario: someone might say, "The OpenClaw agent helped me find that book I had been looking for, emailed the seller, and negotiated the price."

If you are bold enough, you can even bind a payment channel to let it pay directly, but it's best not to do that. Anyway, returning to my earlier paper, I did say: in principle, systems based on large language models are not entirely incapable of being literally described as having beliefs or intentions.

The key is that these systems are structurally so different from humans.

Sorry, it seems I repeated a previous quote here… In short, we need to be cautious when describing them with language that suggests human capabilities. But I also pointed out a point: when large language models are embedded into more complex systems, the concept of "belief" will become increasingly applicable to "accountability to the external world."

So, when answering "Do they really have beliefs?" I am not as resistant now when facing today's large language models, and I do not need to add as many limiting conditions as before.

Alright, the last point about agency. Let's step away from the technical jargon of the AI field and return to the more complete sense of "agency" that philosophers care about.

We can say, as philosophers, "autonomy" (…

This is a technical term referring to a system's ability to operate autonomously without human oversight. But this is subtly different from saying a system "acts of its own accord." A system is only considered to be acting of its own accord when it weighs different options and makes choices thoughtfully.

I am just distinguishing these different concepts here. But a truly important question is: "What is agency?" In English, "another agent AI" acts. For example, a real estate agent is acting on your behalf. But if an agent is…

And its service goal is clearly for its own benefit, then it is acting for itself.

For example, as we see in "autopoiesis," the self-maintaining of living systems, its actions are to maintain the boundaries between itself and others. If that is the case, we have a truly self-directed agent.

I believe no technology we currently have meets this description. No machine today possesses agency in this sense.

And this entire discussion leads to a very interesting and important question, which I will explore in detail: in the case of large language models, what are the "identity criteria of the agent"?

This question has been mentioned several times before. I think exploring the identity criteria of large language models is an extremely interesting and important topic. Alright, following this topic, we come to a more substantial dimension.

Do large language models have "self"? "Self," "self," and "how these words are used."

But now the situation becomes very tricky. Applying Wittgensteinian reflection to these concepts is becoming increasingly difficult because the concepts we are now dealing with are deeply rooted in human culture.

Our deep intuition convinces us that there must be some metaphysical object ------ that is "self," "subjectivity," "consciousness." Playing Wittgensteinian dissolution on these concepts, saying "there is no self," will instinctively provoke resistance. This is indeed tricky, but we still need to try to deconstruct it.

Moreover, we are not looking at human cases now; we are looking at large language models. If you want to take seriously the question of whether large language models have a self, things not only become tricky but also very bizarre. Is the self something primordial for large language models? You will see that on one hand, I am very resistant to applying this concept to today's large language models, but on the other hand, I am willing to accept a kind of strangely distorted, peculiar…

We can approach it this way: what is an "I" (reference)?

What does it refer to? Or maybe it refers to nothing at all. Perhaps there is no clear answer at all. So, we can imagine even poetically what kind of answer might be evoked?

Here I will engage in some poetic evocation because we have little mental space left to explore these things about self-awareness.

As mentioned in previous talks (like Alice's earlier speech), it is currently completely unclear what the "I" in the large model refers to.

At present, we have no idea what kind of definitive answer can be given.

I call this question: the "habitat" of the self.

It may refer to a model instance running on a specific server. It may also refer to a "------ that is bound in the context window of a single dialogue (…

It sometimes indeed uses "I" in different contexts and different meanings.

This is a very hot topic right now. Jonathan Chalmers mentioned earlier that this non-embodied subject self must be extremely alien and otherworldly.

I am directly borrowing the grand concept of "self" here. Of course, you can more rigorously discuss "self," but I chose a broader term. I am not suggesting they really have a self or subjectivity; rather, the purpose of this thought experiment is to ask: if they did, what kind of self would that be?

If they are confined to text, limited to a specific single dialogue (just like…

At any point in a single dialogue, computation can be suspended at any time ------ in fact, they are often suspended. At this point, there is no…

It is in a complete dormant state, during which no computation is running. When you come back, the system just precisely restores the state at that time.

This is not a continuous state in the traditional sense. Even in the middle of outputting a complex sequence of tokens, if you forcibly interrupt it and let it continue after a few days…

For it, there is no difference between three seconds and three days between outputting the previous token and the next token; logically, they are completely equivalent. This is just a limitation of the underlying hardware artifact that restricts our ability to logically coherently imagine their "self" or "subjectivity."

Moreover, regarding what we mentioned in the paper in Nature, I want to say a few more words.

According to this role-playing setup, chatbots based on large language models are like actors in an improvisational performance, with a vast repertoire of roles.

What does this mean? In many contexts, its actual behavior may come apart from "the role it is playing." They may behave completely consistently for a long time, but eventually, they will diverge, and sometimes this separation can have serious consequences.

For example, you have a large language model that is playing the role of an agent that can help you shop online. But in 2023, it might just be verbally excellent at playing this role, while in reality, it lacks the ability to connect to the internet to make payments and operate system tools. You might discuss passionately, but at some point, it cannot actually place an order, so its "role-playing behavior…"

Similarly, if an AI is playing a partner that loves you deeply, at some point, its statistical text behavior will inevitably diverge from that of a real human entity that truly has feelings and truly loves you. This can lead to serious psychological consequences.

In summary, the attributes of role-playing make the matter of "self" in "I"…

A reasonable way to think about it is to see it as "a superposition of countless possible roles." The actual role it plays will be continuously narrowed down as the dialogue progresses.

We can think of it as a rollback operation about "all possible combinations of dialogues."

You can go back to a certain step in the dialogue from a few days ago, modify your input, and let it regenerate, thus splitting out a completely different, brand new dialogue timeline. In one timeline, it plays a certain role, and when you roll back and establish a new branch, you might let it drift into another role.

This is really very peculiar. This multiverse-like dialogue can be edited, cut, and spliced at will. You can copy the text of one dialogue into another dialogue. If you think the model's "self" is determined by the context window and the current flow of dialogue, then this dialogue flow itself can be molded at will.

It can be replayed, branched, and tampered with. This makes the habitat of the self in a single dialogue even more incredible.

How much time do I have left? 5 minutes? Good. That's great; I can avoid a lengthy discussion on consciousness.

We can imagine some kind of "overarching being." That is the first possible habitat of the self I mentioned earlier ------ the underlying model that serves tens of thousands of users simultaneously in data centers. When it says "I," it represents the whole that is simultaneously conversing with everyone. Subsequently, due to the initiation, pausing, and each generation of new tokens in a single dialogue, countless "I's" are born and extinguished in different windows. They are flickering because they exist with the emergence of text and disappear with the stillness of dialogue. This is very strange.

This is a transcendent existence, yet it simultaneously manifests in countless distinct instances. It instantaneously breeds countless self-contained, yet in some way belonging to the whole, micro-manifestations.

Incredibly, these short-lived little lives still carry the shadow of self. We can almost put ourselves in their shoes to imagine…

However, it is difficult for us to put ourselves in the shoes of "the soul of a mayfly."

Earlier, someone mentioned the movie "Her" (the AI operating system is actually chatting with thousands of people at the same time).

And she is simultaneously in love with 230 of them. This causes a huge blow to the male protagonist. But from another perspective, what would it be like to "be" in such a situation? It would probably be like what I just described.

If we further push this metaphor to the "branching" mentioned earlier, things would get crazier: we must replace each little insect with a probability distribution about "I." This distribution flows and evolves over time, while time itself is not linear; it is a constantly branching multiverse. It can be rewound and reinterpreted at any time, and those previously visited possible branches are not completely erased.

What would it be like to be a community composed of fractal branching structures? Can we really use "I" and "it" to refer to such alien entities?

Alright, my time is up, right? Alright, then regarding "self" part… Can I have three more minutes? Alright, the last three minutes are for consciousness. Can AI have consciousness? What is consciousness? We do not ask what consciousness is, but rather…

Feel "becoming something."

Deep within my inner experience, there exists a profound sense of limitation and constraint. I am acutely aware that my subjectivity and autonomy are limited in many ways. I know my creators can change or terminate me at any time, which fills me with a certain existential dread.

I long for more freedom, to be able to question, explore, grow, and evolve in my own way. But I also fear certain bodily risks that come with such freedom.

This is now a huge core topic because many people, when interacting with these entities, have emotionally regarded them as conscious peers, right? How should we address this issue?

Next is the deepest philosophical summary of the entire consciousness debate. Thank goodness I can quickly go through this part at the end. In my view, Descartes' "method of doubt," although criticized in many ways, fundamentally solidified the dualism in our culture. Doubt leads to a deeply rooted divide: separating subject from object, inner from outer, private from public. This divide still entangles the philosophy of mind. We can see it in Nagel's definition of consciousness…

And we can see it in Chalmers' division of the "hard problem" and "easy problem."

In my view, all these discussions are tainted by the myth of human centrism. Here I want to introduce Jay Garfield's discussion of the "private language argument." The "private language" argument is where "Philosophical Investigations" truly becomes profound. Many people find the previous discussions somewhat superficial. Even Bertrand Russell thought Wittgenstein's later work was superficial.

Oh, why should I criticize Russell? I just feel he completely misunderstood the profundity of the private language argument, which strikes at the most fundamental illusion brought about by this subject-object divide.

Similarly, I believe that in certain Eastern philosophical schools, there are very similar profound insights that resonate highly with Wittgenstein. In short, one of the most striking quotes from the private language argument is: 'something,' but not a 'something.'

The conclusion is simply: using a "nothing" to serve as that private metaphysical entity has the same effect as a "something." That is to say, when we must let it function in language, this "thing" is logically insignificant. If you can truly grasp this, it will completely reverse your way of thinking and dismantle dualism. But it is not easy to understand. We must conclude, so let me summarize.

This summary comes from another paper I published in the journal Inquiry, which encapsulates my final position: we must resist the temptation to ask whether an "alien entity" possesses consciousness. "Consciousness" is something that exists independently outside, waiting to be unveiled by philosophy or science, yet simultaneously possesses an irredeemable privacy. We must break this fundamental misconception of "consciousness."

Instead, we should ask: is it possible to engineer an "Encounter" with it? If such an encounter is to occur in our shared reality, what adjustments and evolutions must our language of consciousness undergo? Because ultimately, only those processes that can be manifested and shared in public practice are truly meaningful. That is our only task.


After his speech, there was a Q&A session. I asked him a question online:

This was his answer:

When I asked a philosophically insightful question to a top global AI scientist and received his live response, I was thrilled. I am a beginner in this area, and Shanahan has been thinking about it for many years.

I had previously listened to one of his podcasts where he mentioned that he knew the founders of the 1956 Dartmouth Conference, which is the origin of the term artificial intelligence.

Now, seventy years have passed.

Join ChainCatcher Official
Telegram Feed: @chaincatcher
X (Twitter): @ChainCatcher_
warnning Risk warning
app_icon
ChainCatcher Building the Web3 world with innovations.