Markov Chains &amp; Random Walks — Stochastic Processes Explained | 3D
      Simulations

3D Simulations

Stochastic Processes · Probability · Statistics

📅 April 2026 ⏱ ≈ 12 min read 🎯 Beginner–Intermediate · Last updated: 22 June 2026

Markov Chains & Random Walks

A Markov chain is a stochastic process with one beautiful property: the future depends only on the present, never on the past. This "memoryless" condition makes the mathematics tractable while still modelling an enormous range of real-world systems — from Google's PageRank and weather sequences to the Metropolis-Hastings sampler underlying Bayesian statistics. This article builds the theory from transition matrices up to random walks and absorbing states, with complete JavaScript implementations.

Written by MySimulator Team · Reviewed by MySimulator Editorial Review

1. The Markov Property and Transition Matrix

A sequence of random variables X₀, X₁, X₂, … taking values in a state space S is a Markov chain if:

P(Xₙ₊₁ = j | Xₙ = i, Xₙ₋₁ = iₙ₋₁, \dots, X₀ = i₀) = P(Xₙ₊₁ = j | Xₙ = i)

All information about the future is captured by the current state. The transition probability P(j | i) = pᵢⱼ is the probability of moving from state i to state j in one step. Collecting these into a matrix gives the transition matrix P where P[i][j] = pᵢⱼ and each row sums to 1 (stochastic matrix).

Example — Sunny/Rainy Weather

States: {Sunny, Rainy} P = | 0.8 0.2 | // row "Sunny": stay sunny 80%, go rainy 20% | 0.4 0.6 | // row "Rainy": go sunny 40%, stay rainy 60%

The distribution after n steps starting from an initial distribution π₀ is:

πₙ = π₀ \cdot Pⁿ

Matrix powers can be computed by repeated multiplication. More efficiently, eigenvalue decomposition (P = V Λ V⁻¹, so Pⁿ = V Λⁿ V⁻¹) separates the components.

2. State Classification

Recurrent

The chain is guaranteed to return to state i infinitely often. All states in a finite, irreducible chain are recurrent.

Transient

There is a positive probability of never returning to state i. Transient states are visited only finitely many times.

Absorbing

Once entered, the chain never leaves. Probability 1 of staying: pᵢᵢ = 1. Used in gambler's ruin and absorbing Markov chain analysis.

Periodic / Aperiodic

A state has period d if returns occur only at multiples of d steps. Aperiodic (d = 1) states are needed for unique stationary distributions.

A chain is irreducible if every state is reachable from every other state (the underlying directed graph is strongly connected). An irreducible, aperiodic chain on a finite state space has a unique stationary distribution.

3. Stationary Distributions and Convergence

A distribution π* is stationary if it is unchanged by the transition:

π* = π* \cdot P // eq. for row vector π* equivalently: Pᵀ π* = π* // π* is left eigenvector of P with eigenvalue 1

The Perron-Frobenius theorem guarantees that an irreducible, aperiodic stochastic matrix has 1 as its largest eigenvalue, with a unique positive eigenvector — the stationary distribution. For the weather example above:

π*[Sunny] = 0.4 / (0.2 + 0.4) = 2/3 \approx 66.7% π*[Rainy] = 0.2 / (0.2 + 0.4) = 1/3 \approx 33.3%

This is computed analytically for 2-state chains using the balance equations. For larger state spaces, power iteration is the practical algorithm: repeatedly multiply a distribution vector by P until it stops changing.

Mixing time is how long the chain takes to get close to π* (measured in total variation distance). Chains with good mixing properties converge quickly; poorly connected chains (e.g. those with near-absorbing states) mix slowly. Mixing time is controlled by the second-largest eigenvalue magnitude |λ₂| — the spectral gap 1 − |λ₂| governs the mixing rate.

4. PageRank — Markov Chains on the Web

Google's original PageRank algorithm models a random surfer walking the web by following hyperlinks uniformly at random. The web graph defines a transition matrix; the stationary distribution gives each page its rank.

Two complications arise:

Dangling nodes: pages with no outgoing links trap the surfer. Fixed by redistributing probability equally to all nodes when a dangling node is reached.
Disconnected components: the chain may not be irreducible. Fixed by the damping factor d (typically 0.85): with probability 1 − d the surfer teleports to a uniformly random page.

G = d \cdot P + (1-d)/n \cdot 𝟏𝟏ᵀ // Google matrix PageRank = stationary distribution of G

The Google matrix is now irreducible and aperiodic regardless of the link structure, so power iteration is guaranteed to converge. In practice ~50–100 iterations of matrix-vector multiplication suffice for the web-scale graph.

5. Random Walks and Gambler's Ruin

1-D Random Walk

At each step the walker moves +1 with probability p and −1 with probability q = 1 − p. Starting at position k, the walk is recurrent (returns to start with probability 1) when p = q = 0.5; when p ≠ q it drifts toward +∞ or −∞ and is transient.

Gambler's Ruin

A gambler starts with £k and plays against a casino with infinite funds. Each round the gambler wins £1 with probability p or loses £1 with probability q. What is the probability of reaching a fortune of £N before going bankrupt (state 0)?

ruin probability from state k: P(ruin | start = k) = 1 - Pₖ Pₖ = { k/N if p = q = 0.5 (fair game) { (1-(q/p)ᵏ) / (1-(q/p)ᴺ) if p \neq q

Even with a tiny edge against the player (p slightly below 0.5), the probability of reaching the target N before ruin plummets exponentially. Casino games are designed exactly so that p < 0.5.

Higher-Dimensional Random Walks

In 1D and 2D, a symmetric random walk is recurrent (returns to origin with probability 1). In 3D and above it becomes transient — the drunken sailor always finds home, but the intoxicated bird is lost forever. This is Pólya's theorem (1921).

6. JavaScript: Power Iteration and Random Walk

Power Iteration for Stationary Distribution

// Power iteration: find the stationary distribution π* of a Markov chain
// P[i][j] = probability of transitioning from state i to state j
function stationaryDistribution(P, tol = 1e-9, maxIter = 10000) {
  const n = P.length;
  let pi = new Float64Array(n).fill(1 / n);  // uniform start

  for (let iter = 0; iter < maxIter; iter++) {
    const next = new Float64Array(n);
    for (let j = 0; j < n; j++)
      for (let i = 0; i < n; i++)
        next[j] += pi[i] * P[i][j];    // πP row-vector multiplication

    // Check convergence via L1 norm
    let err = 0;
    for (let j = 0; j < n; j++) err += Math.abs(next[j] - pi[j]);
    pi = next;
    if (err < tol) break;
  }
  return pi;
}

// Weather example
const P = [[0.8, 0.2], [0.4, 0.6]];
console.log(stationaryDistribution(P));  // ≈ [0.667, 0.333]

Simulating a Random Walk

// 1-D Gambler's Ruin simulation — estimate ruin probability empirically
function simulateRuin(start, target, p, trials = 10_000) {
  let ruined = 0;
  for (let t = 0; t < trials; t++) {
    let x = start;
    while (x > 0 && x < target) {
      x += Math.random() < p ? 1 : -1;
    }
    if (x === 0) ruined++;
  }
  return ruined / trials;
}

// Theoretical ruin probability (fair game)
function ruinProbFair(k, N) { return 1 - k / N; }
function ruinProb(k, N, p) {
  const r = (1 - p) / p;
  return p === 0.5
    ? ruinProbFair(k, N)
    : (1 - Math.pow(r, k)) / (1 - Math.pow(r, N));
}

console.log('Simulated:',  simulateRuin(10, 20, 0.5).toFixed(3));  // ≈ 0.500
console.log('Theoretical:', ruinProb(10, 20, 0.5).toFixed(3));  // 0.500

PageRank with Power Iteration

// Tiny 4-page web graph (0-indexed adjacency list)
function pageRank(edges, n, d = 0.85, iters = 100) {
  // Build row-stochastic transition matrix with dangling-node fix
  const out = new Array(n).fill(0);
  edges.forEach(([i]) => out[i]++);

  let rank = new Float64Array(n).fill(1 / n);

  for (let it = 0; it < iters; it++) {
    const next = new Float64Array(n).fill((1 - d) / n);
    for (const [i, j] of edges) next[j] += d * rank[i] / out[i];
    rank = next;
  }
  return rank;
}

const graph = [[0,1],[0,2],[1,2],[2,0],[3,2]];
console.log(pageRank(graph, 4));  // node 2 should rank highest

7. Continuous-Time Markov Chains

In discrete-time chains steps happen at integer times. A continuous-time Markov chain (CTMC) can jump at any t ≥ 0. The key object is the generator matrix Q (also called the rate matrix):

Qᵢⱼ = transition rate from i to j (j \neq i) [units: per unit time] Qᵢᵢ = -Σⱼ\neqᵢ Qᵢⱼ [rows sum to zero]

The probability distribution evolves via the forward equation (Kolmogorov):

d/dt π(t) = π(t) · Q Solution: π(t) = π(0) · e^{Qt}

CTMCs model queueing systems (M/M/1 queue), chemical reaction networks (via the chemical master equation), and failure/repair processes in reliability engineering.

Connection to MCMC: The Metropolis-Hastings algorithm and Gibbs sampling construct Markov chains whose stationary distributions match a target distribution π* — using them to sample from distributions that are impossible to sample from directly. See the Monte Carlo Methods article for the full derivation.

8. Applications

Natural Language Processing

N-gram language models are Markov chains: the probability of the next word depends only on the previous n − 1 words. Bigram (n = 2) and trigram models were the backbone of statistical NLP before neural language models and remain useful for data-scarce domains.

Reinforcement Learning

A Markov Decision Process (MDP) extends Markov chains with actions and rewards. The agent's policy defines a Markov chain; value functions are stationary expectations under this chain. Policy evaluation, policy iteration, and value iteration all solve the underlying Markov system.

Queueing and Traffic

The M/M/1 queue (Poisson arrivals, exponential service, 1 server) is a birth-death CTMC. Its stationary distribution gives average queue length, waiting times, and utilisation as closed-form expressions — foundational results in capacity planning for networks, call-centres, and cloud infrastructure.

Molecular Dynamics / Protein Folding

Markov State Models (MSMs) discretise conformational space of a protein into metastable states. By decomposing long MD trajectories into short parallel simulations, MSMs extend accessible timescales by orders of magnitude and reveal folding pathways.