Active Construction of Past Episodes

“The active construction of past episodes”

Episodic memories – declarative memories of past events, characterized by rich spatiotemporal context – play a central role in guiding perception and behaviour.
Here, we advance a model that integrates episodic memories within the active inference framework. We describe how episodic memories are incorporated into the generative models used in active inference to support the re-construction, replay and communication of past events. In doing so, we foreground two foundational themes.
The first is the message passing in deep temporal models that allow one to actively construct memories of episodes.
The second is the communicative aspect of declarative memories, and the way in which one might recount something from one’s autobiography.
In effect, this means that the message passing that supports episodic memory propagates information about what we have done – or what we would do – given past circumstances to draw inferences about how to communicate those beliefs.
Together, these themes emphasise that we are not passive recorders of the things that happen to us. We are active participants in the events we recall and in the telling of stories about them.


From the perspective of active inference, behaviour is accounted for in terms of the internal generative world model a brain uses to guide inference (by synaptic message passing) and action. The core idea is that, by predicting the sensory data one would expect if all were going well, our nervous systems can select the actions that realise those predictions. Framed in this way, the definition above tells us two things. A generative model for a creature capable of declarative memories must include a communicative component. If the requisite declarations are verbal, it would need to predict the sensory (e.g., auditory) consequences of speech. For it to be episodic, it must entail statistics about the place and time at which an event occurred.

Memory in natural creatures is a constructive process – not an exhaustive recording of all that has happened in the past. During encoding of memories, a compressed representation is formed that can subsequently be used for reconstruction of the event during recall. However, this poses a problem. If we draw inferences based upon a generative model, how do we go about reconstructing the inferences we made in the past when trying to explain current sensory data? This tells us that a further aspect of memory is the ability to transiently suppress the influence of current sensory data – i.e., to attenuate their precision – during the recall of a memory. Otherwise, the constraints imposed by the world, as it currently is, make it difficult to entertain inferences about the world as it was at some point in the past.

Deep temporal models and message passing. The upper part of this figure expresses a deep temporal model as a Forney factor graph. Each square represents a probability distribution. The lines (‘edges’) connecting those squares are the variables shared by different distributions. Squares containing equals (‘=’) signs enforce equality of all connected edges. For the first episode, the distributions are annotated as prior (including empirical priors) beliefs, transition probabilities, and likelihoods that give the probability of sensory data given some state of the world. The second episode is equipped with arrows that represent the process of inference – here assuming a belief-propagation like scheme. The red arrows show the ‘descending’ messages that predict sensory data. The blue arrows show ‘ascending’ messages that communicate the evidence from those data. Green arrows deal with predictions through time. The lower part of the figure reinterprets these arrows as neuronal populations, preserving the colour coding. We have not reproduced the entire network that would solve the model above – just the portion for which arrows are shown in the upper figure. This is the reason for the connections to populations not shown in the ‘across episodes’ panel. The numbering in black squares is for reference in a later figure, and we will not go through these in detail here except to say that population #6 follows a slightly different pattern to the other messages. Here, we have a series of (blue and green) populations representing the messages concerning anticipated future states under each alternative policy or plan. These are used to update beliefs about which policy to pursue.
Within episode generative model. This graphic illustrates the structure of the state-space available within a single episode. On the left, we see a set of states that can be occupied during the walk. Participants start at the junction (middle) and from here can either return to the signpost or select the left or right path. On the right, we show the available syntactic and semantic states for verbal communication. These make up a limited language comprising a small number of possible sentences. The initial state for each epoch is either a ‘no speech’ state, the ‘what’ state, or the ‘I’ state. The latter two progress through a defined sequence, with certain points at which alternative words can be substituted. The # states are the semantic states, and the alternative values each of these can take is shown in different shades of blue. The #3 state also doubles as a state for the behavioural task on the left, and is expected to generate both the context (left or right) determining whether the path home is to the left or right, and the spoken word in the corresponding syntax.
Depth and factorisation. The graphic on the left shows two kinds of factorisation and the variables in play. This is a non-standard graph notation that seeks to complement the information present in Figure 1. Here, we collapse over the temporal dimensions shown in Figure 1 in favour of an expansion of the factorial dimensions (some of which were unpacked in Figure 2). It can be read analogously to a Bayes net, but where the outer variables are conditioned upon the variables closer to the centre of the graph. Here, the radial dimension corresponds to the hierarchical level, getting faster (and closer to sensory data), while the circumferential dimension shows the factorisation of different kinds of states or sensory modalities. The edges (lines) indicate the conditional dependencies between the different states mediated through conditional (empirical) priors and likelihoods. On the right, a loose relationship is shown between different timescales (matching the colours on the left) and representative brain areas that might operate at episodic state (medial temporal), policy (striatal), fast state (frontal), and sensory (occipital) timescales. The arrows here indicate the reciprocal message passing (see Figure 1) expected between these regions.

Microcircuits and messages. This figure addresses the finer-scale anatomy of different cortical cytoarchitectures and of the message passing implied by a belief-propagation scheme that does and does not disambiguate between alternative trajectories. The numbers correspond to those in Figure 1.
The isocortical layers are shown from superficial (layer I) to deep (layer VI) in the vertical dimension.
The hippocampal formation is shown as a coronal slice.
For each of these, we preserve the connectivity implied by the model of Figure 1 and the patterns of laminar connectivity known from empirical research as summarised in the main text. The convergence of ascending and descending messages onto population #7 in the isocortex graphic facilitates prediction of several possible alternative futures (populations labelled #6). Perhaps the relative complexity of the 6 layered neocortex is that it predicts futures conditioned upon actions which, communicated to the basal ganglia via layer V, help in adjudication between plans. In contrast, 3 layered allocortex may predict sequences that offer compressed representations, that are only unpacked into predictions of policies and of states conditioned upon those policies through projections either to neocortex or basal ganglia.

[…] Emerging developments in theoretical neurobiology, specifically active inference, might play a role in our understanding of episodic memory.
The contribution of active inference here is the idea that memory is an active process in which one constructs imagery of the past based upon empirical prior beliefs about how one acted, or would have acted, in a given context.
The definition of declarative memory is inherently active in the sense that it implies the ability to declare or communicate the contents of a memory.
This underlines the importance of developing models that capture both the behaviour during the episode to be recollected but also the declarative process of communicating those events.
By modelling the communicative part of memory, we move a step closer to a computational account of the measurements taken by clinicians, implicitly, while listening to a patient’s story.

Leave a comment