Question 1

What does this simulation actually show?

Accepted Answer

It shows how cooperation can emerge, collapse or cycle when many agents repeatedly play the Prisoner's Dilemma on a grid. Each coloured cell is an agent running one of five strategies, and over successive rounds the more successful strategies spread across the population through imitation.

Question 2

How does the evolution rule work?

Accepted Answer

After every round each agent looks at its own score and the scores of its eight neighbours, then adopts the strategy of whichever one earned the most points that round. This imitate the best neighbour rule is a standard form of spatial evolutionary dynamics, with no genetics or reproduction, just copying of successful behaviour.

Question 3

What do the T, R, P and S sliders control?

Accepted Answer

They set the four payoffs of the dilemma: Temptation (defect against a cooperator), Reward (mutual cooperation), Punishment (mutual defection) and Sucker (cooperate against a defector). A genuine dilemma needs T greater than R greater than P greater than S, and ideally 2R greater than T plus S so that mutual cooperation beats alternating exploitation.

Question 4

What are the five strategies?

Accepted Answer

Always Cooperate (green) and Always Defect (red) ignore the opponent. Tit-for-Tat (blue) repeats the neighbour's last move towards it. Pavlov, or Win-Stay Lose-Shift (purple), keeps its move if the last payoff was good and switches if it was poor. Random (amber) cooperates or defects with a 50/50 coin flip each interaction.

Question 5

Why does Tit-for-Tat usually do so well?

Accepted Answer

In Robert Axelrod's 1980 tournaments Tit-for-Tat beat 62 rival strategies because it is nice (never defects first), retaliatory (punishes defection at once), forgiving (returns to cooperation immediately) and clear. On a grid, clusters of TFT agents shield one another from exploitation, letting cooperation hold a foothold even amid defectors.

Question 6

What is the difference between a Nash equilibrium and a Pareto optimum here?

Accepted Answer

In a one-shot game mutual defection is the Nash equilibrium: neither player can gain by switching alone. Mutual cooperation is the Pareto optimum: you cannot improve one agent without harming the other. The tragedy of the dilemma is that rational self-interest drives players to the worse, Nash outcome.

Question 7

What does the noise slider do?

Accepted Answer

Noise sets a mutation rate, from 0 to 20 per cent. After the imitation step, each agent has that probability of instead adopting a randomly chosen strategy. A little noise keeps the system from freezing into a single absorbing state and lets new strategies be re-seeded so the dynamics stay lively.

Question 8

Is the spatial structure important?

Accepted Answer

Yes. Because agents only interact with eight immediate neighbours rather than the whole population, cooperators can form protective clusters whose interior members all cooperate and score well. This local structure is exactly why cooperation survives spatially when it would be wiped out in a well-mixed population.

Question 9

What do the presets do?

Accepted Answer

All Defectors seeds a sea of defectors with a tiny TFT core to test invasion; TFT Invasion uses a larger cooperative cluster; Pavlov World fills the grid mostly with Pavlov agents; and Mixed starts from a weighted blend of all five strategies. They are quick ways to explore which configurations let cooperation take hold.

Question 10

How accurate is this compared with real game theory?

Accepted Answer

The payoff matrix, strategy definitions and Nash and Pareto analysis are faithful to standard theory, and the imitate-best dynamic is a recognised model used by Nowak and May. It is a simplified teaching tool, though: real ecologies involve continuous strategies, memory of many past rounds and reproduction rather than pure copying.

Question 11

Where does the Prisoner's Dilemma appear in the real world?

Accepted Answer

It models any situation where private incentives undermine shared gains: nations emitting carbon despite a collective interest in restraint, rivals locked in arms races, bacteria producing public goods that cheaters exploit, and firms tempted to undercut a mutually profitable price. The simulation hints at how repetition and structure can rescue cooperation.

	C	D
C	R=3	S=0
D	T=5	P=1

🎮 Game Theory — Prisoner's Dilemma & Evolutionary Strategies

Legend

Payoff Matrix

Grid & Speed

Presets

Stats

The Prisoner's Dilemma

Nash Equilibrium vs. Pareto Optimum

Why Tit-for-Tat wins

Real-world applications

About Evolutionary Game Theory

Frequently Asked Questions