Attention as schematic model in AI — The Map of Consciousness

Explanation

The Attention Schema Theory (AST) was proposed by the neuroscientist Michael Graziano of Princeton in his book Consciousness and the Social Brain (2013) and later works (Rethinking Consciousness, 2019). Although originally formulated as a neuroscientific theory of human consciousness, it has generated great interest in AI because it suggests concrete steps for implementing consciousness-like functions in machines.

The central thesis: the brain constructs models of things in the world to better predict and control them. Among the things the brain models is its own attention. The attention schema is a simplified internal model of how the brain is attending to what at each moment. This model is not attention itself but a representation of it, useful for cognitive control.

Graziano suggests that this description, this internal schema of attention, is what we call consciousness. When we say I am conscious of X, we are describing the content of the attention schema: my attention is now on X. The schema is functionally useful: it allows self-regulation (directing attention where it is useful), social communication (explaining what we are attending to), metacognition. And it is computationally relatively simple: a schematic model, not a detailed replica.

Philosophical consequences: phenomenal consciousness (qualia, something it is like) is, according to Graziano, a useful illusion generated by the attention schema. The schema represents us to ourselves as having subjective experience, even though no real qualia exists. This is an eliminativist position on phenomenal consciousness, comparable to Dennett's. It is one of the most direct responses to Chalmers's hard problem: there is no hard problem, because there really is no phenomenal consciousness to explain, only its functional model.

For AI, AST suggests that to create systems with some consciousness-like properties, we should build agents with the capacity for selective attention (already present in transformer architectures: the attention mechanism) and with models of their own attention (self-attention schemas). This would provide the system with: capacity for self-monitoring, for reporting on its own cognitive processes, for communication about internal states. Some research on interpretability of LLMs explores what models know about their own processing; this is analogous to the attention schema.

For the theory of consciousness, AST is a coherent and accessible proposal that attempts to demystify the hard problem by treating it as a problem of cognitive modelling. Although it has been criticised (critics argue that it simply relocates the problem: why does having an attention schema produce subjectivity?), it offers a scientifically tractable hypothesis with possible AI applications. Graziano himself works with colleagues on artificial implementations of the attention schema and studies their correlations with consciousness measured in humans. As a theory combining cognitive neuroscience, philosophy of mind and AI implementation possibilities, AST remains a stimulating contribution to the contemporary panorama.

Strengths

Clear and constructible technical proposal.
Direct continuity with cognitive neuroscience.
Addresses verbal report and self-modelling simultaneously.
Posits falsifiable criteria via implementation.
Unified framework for human and artificial consciousness.

Main critiques

Phenomenal eliminativism is contested.
Irreducible qualia may not reduce to schemas.
Verbal report as criterion vulnerable to imitation.
Sufficiency of the attention schema yet to be demonstrated.

Explanation

Strengths

Main critiques

Connections with other theories