This is one of the 100+ free recipes of the IPython Cookbook, Second Edition, by Cyrille Rossant, a guide to numerical computing and data science in the Jupyter Notebook. The ebook and printed book are available for purchase at Packt Publishing.
▶ Text on GitHub with a CC-BY-NC-ND license
▶ Code on GitHub with a MIT license
Chapter 13 : Stochastic Dynamical Systems
Discrete-time Markov chains are stochastic processes that undergo transitions from one state to another in a state space. Transitions occur at every time step. Markov chains are characterized by their lack of memory in that the probability to undergo a transition from the current state to the next depends only on the current state, not the previous ones. These models are widely used in scientific and engineering applications.
Continuous-time Markov processes also exist and we will cover particular instances later in this chapter.
Markov chains are relatively easy to study mathematically and to simulate numerically. In this recipe, we will simulate a simple Markov chain modeling the evolution of a population.
- Let's import NumPy and matplotlib:
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
- We consider a population that cannot comprise more than
$N=100$ individuals, and define the birth and death rates:
N = 100 # maximum population size
a = .5 / N # birth rate
b = .5 / N # death rate
- We simulate a Markov chain on the finite space
${0, 1, ..., N}$ . Each state represents a population size. Thex
vector will contain the population size at each time step. We set the initial state to$x_0=25$ (that is, there are 25 individuals in the population at initialization time):
nsteps = 1000
x = np.zeros(nsteps)
x[0] = 25
- Now we simulate our chain. At each time step
$t$ , there is a new birth with probability$ax_t$ , and independently, there is a new death with probability$bx_t$ . These probabilities are proportional to the size of the population at that time. If the population size reaches 0 or N, the evolution stops:
for t in range(nsteps - 1):
if 0 < x[t] < N - 1:
# Is there a birth?
birth = np.random.rand() <= a * x[t]
# Is there a death?
death = np.random.rand() <= b * x[t]
# We update the population size.
x[t + 1] = x[t] + 1 * birth - 1 * death
# The evolution stops if we reach $0$ or $N$.
else:
x[t + 1] = x[t]
- Let's look at the evolution of the population size:
fig, ax = plt.subplots(1, 1, figsize=(8, 4))
ax.plot(x, lw=2)
We see that, at every time step, the population size can stay stable, increase, or decrease by 1.
- Now, we will simulate many independent trials of this Markov chain. We could run the previous simulation with a loop, but it would be very slow (two nested
for
loops). Instead, we vectorize the simulation by considering all independent trials at once. There is a single loop over time. At every time step, we update all trials simultaneously with vectorized operations on vectors. Thex
vector now contains the population size of all trials, at a particular time. At initialization time, the population sizes are set to random numbers between 0 and N:
ntrials = 100
x = np.random.randint(size=ntrials,
low=0, high=N)
- We define a function that performs the simulation. At every time step, we find the trials that undergo births and deaths by generating random vectors, and we update the population sizes with vector operations:
def simulate(x, nsteps):
"""Run the simulation."""
for _ in range(nsteps - 1):
# Which trials to update?
upd = (0 < x) & (x < N - 1)
# In which trials do births occur?
birth = 1 * (np.random.rand(ntrials) <= a * x)
# In which trials do deaths occur?
death = 1 * (np.random.rand(ntrials) <= b * x)
# We update the population size for all trials
x[upd] += birth[upd] - death[upd]
- Now, let's look at the histograms of the population size at different times. These histograms represent the probability distribution of the Markov chain, estimated with independent trials (the Monte Carlo method):
bins = np.linspace(0, N, 25)
nsteps_list = [10, 1000, 10000]
fig, axes = plt.subplots(1, len(nsteps_list),
figsize=(12, 3),
sharey=True)
for i, nsteps in enumerate(nsteps_list):
ax = axes[i]
simulate(x, nsteps)
ax.hist(x, bins=bins)
ax.set_xlabel("Population size")
if i == 0:
ax.set_ylabel("Histogram")
ax.set_title(f"{nsteps} time steps")
Whereas, initially, the population sizes look uniformly distributed between 0 and
Mathematically, a discrete-time Markov chain on a space
A (stationary) Markov chain is characterized by the probability of transitions
Simulating a single Markov chain in Python is not particularly efficient because we need a for
loop. However, simulating many independent chains following the same process can be made efficient with vectorization and parallelization (all tasks are independent, thus the problem is embarrassingly parallel). This is useful when we are interested in statistical properties of the chain (example of the Monte Carlo method).
There is a vast literature on Markov chains. Many theoretical results can be established with linear algebra and probability theory.
Many generalizations of discrete-time Markov chains exist. Markov chains can be defined on infinite state spaces, or with a continuous time. Also, the Markov property is important in a broad class of stochastic processes.
Here are a few references:
- Markov chains on Wikipedia, available at https://en.wikipedia.org/wiki/Markov_chain
- Absorbing Markov chains on Wikipedia, available at https://en.wikipedia.org/wiki/Absorbing_Markov_chain
- Monte-Carlo methods on Wikipedia, available at https://en.wikipedia.org/wiki/Monte_Carlo_method
- Simulating a Brownian motion