Active Inference in the real world

Most AI systems are trained to map inputs to outputs, optimise rewards, or imitate behaviour.

Active inference takes a different approach.

Instead of learning a policy directly, the agent maintains a model of the world, continuously predicts what it expects to observe, and acts to reduce the mismatch between prediction and reality.

In simple terms:

the agent predicts → compares → updates → acts → and repeats

Behaviour emerges from this loop.

Open the full writeup (PDF)

From idea to mechanism

(I work with industry partners applying active inference and generative modelling to real-world systems. Consulting)

The agent receives sensory input from the world (here, ray-cast distances acting as a crude form of vision), and uses a generative model to predict what those inputs should be under different actions.

When predictions are wrong, the model updates. When actions can reduce prediction error, the agent moves.

There is no explicit reward function, and no pretraining. Behaviour arises from maintaining consistency between beliefs and observations.

Embodiment and physics

Moving from 2D environments to a 3D physical world changes everything.

The agent must deal with inertia, drag, collisions, and partial observability. It cannot simply “jump” between states — it has to act through a body, in space, over time.

In this setting, behaviour becomes visibly different:

small corrections and reorientation
hesitation near obstacles
gradual convergence to goals
recovery when predictions fail

The world is not given. The transition dynamics are learned online from interaction with the environment.

Example: collect and return

In this task, the robot must locate an object, grasp it, and return to a home zone.

All behaviour (exploration, obstacle avoidance, grasping, and return) emerges from the same underlying inference process.

Example: polyphonic drone navigation

The same active inference principles can also be extended to aerial agents operating in more open 3D environments.

In this drone setting, behaviour emerges from online inference, short-horizon rollout, and the balancing of multiple control pressures during navigation.

Mathematical formulation

Active inference is grounded in variational inference.

The agent maintains beliefs over hidden states \(x\) and minimises a quantity known as variational free energy:

\( F = \mathbb{E}_{q(x)}[\log q(x) - \log p(y, x)] \)

This can be interpreted as:

\( F = \text{prediction error} + \text{complexity} \)

Minimising \(F\) corresponds to finding the best explanation of sensory data.

Action is selected by minimising expected free energy:

\( G(\pi) = \mathbb{E}[\text{risk}] + \mathbb{E}[\text{ambiguity}] \)

This drives behaviour that both achieves goals (low risk) and reduces uncertainty (low ambiguity).

Why this matters

Active inference provides a unified framework for perception, learning, and action.

The same principles used to model brain function can be used to build agents that operate in complex, uncertain environments.

Rather than separating learning, planning, and control, these processes emerge from a single objective: maintaining coherent beliefs about the world.