Module 1

Generative Models of the World

Helmholtz in 1867 proposed that perception is unconscious inference: the brain projects sensory data through an inverted generative model to recover the underlying causes. This module develops the generative-model picture, from Helmholtz to hierarchical Bayesian networks to the modern Bayesian brain.

1. Helmholtz’s Unconscious Inference

Helmholtz observed that visual perception routinely adds information not present in the retinal image — depth from binocular disparity, object permanence across saccades, illumination correction for reflectance. He framed this as unconscious inversion of a generative model: the brain asks “what causes would have produced these sensations?” and returns the most probable answer. Roughly 150 years later, that idea was formalised as Bayesian perception (Knill & Pouget 2004).

2. Hierarchical Latent Variables

A world model is a hierarchy of latent variables where each level generates the level below:

\[ P(\text{sensation}) \;=\; \int P(\text{sensation}\mid z_1)\,P(z_1\mid z_2)\,P(z_2\mid z_3)\cdots dz_1\,dz_2\,\cdots \]

In vision, low-level latents are edges and textures; mid-level are objects; high-level are scenes and abstract concepts. Cortex is organised hierarchically along this gradient. Rao & Ballard 1999 showed that a specific generative- model hierarchy reproduces V1 receptive-field properties when trained on natural images. M3 covers predictive coding in detail.

3. Inference in the Brain

Exact Bayesian inference is generally intractable for high-dimensional world-models. The brain must approximate. Two major routes considered in later modules: (a) variational inference (M2, M4) — approximate the posterior with a tractable family; (b) sampling (not covered in depth) — explicit Monte-Carlo over latents. Predictive coding (M3) implements a particular form of variational inference that maps onto cortical microcircuits.

Simulation: 2-D Latent, High-D Observation

Python

script.py32 lines

import numpy as np, matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt

# 2D latent -> high-dim observation (sinusoidal grating)
rng = np.random.default_rng(42)
n = 500
z = rng.normal(size=(n, 2))
# Observation is 16x16 image of oriented grating whose orientation + phase = z
def render(z_vec):
    x = np.linspace(-2, 2, 16)
    xx, yy = np.meshgrid(x, x)
    theta, phase = z_vec
    f = 1.5
    return np.sin(2*np.pi*f*(xx*np.cos(theta) + yy*np.sin(theta)) + phase)

imgs = np.array([render(zz) for zz in z])
# Sample 8
idx = rng.choice(n, 8, replace=False)

fig, axes = plt.subplots(2, 4, figsize=(13, 6), facecolor='#0a0a1a')
for i, ax in enumerate(axes.ravel()):
    ax.imshow(imgs[idx[i]], cmap='RdBu_r')
    ax.axis('off')
    ax.set_title(f'z=({z[idx[i],0]:.2f}, {z[idx[i],1]:.2f})',
                 color='#c4b5fd', fontsize=9)
fig.suptitle('Generative model: 2-D latent z -> 16x16 observation',
             color='#c4b5fd', fontweight='bold', fontsize=14)
plt.tight_layout()
plt.savefig('output.png', dpi=120, bbox_inches='tight', facecolor='#0a0a1a')
print('Generative model: P(image | z) via rendering function')
print('Inference: given image, find posterior over z')

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Key References

• Helmholtz, H. von (1867). Handbuch der Physiologischen Optik.

• Rao, R. P. N. & Ballard, D. H. (1999). “Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects.” Nat. Neurosci., 2, 79–87.

• Friston, K. J. (2005). “A theory of cortical responses.” Phil. Trans. R. Soc. B, 360, 815–836.

• Lee, T. S. & Mumford, D. (2003). “Hierarchical Bayesian inference in the visual cortex.” J. Opt. Soc. Am. A, 20, 1434–1448.

Share:X Reddit LinkedIn

← Module 0 Module 2: Prediction Error →