Tit for tat

Tit for tat is a highly effective strategy in game theory for the iterated prisoner's dilemma. It was first introduced by Anatol Rapoport in Robert Axelrod's two tournaments, held around 1980. Based on the English saying meaning "equivalent retaliation" ("tit for tat"), an agent using this strategy will initially cooperate, then respond in kind to an opponent's previous action. If the opponent previously was cooperative, the agent is cooperative. If not, the agent is not. This is similar to reciprocal altruism in biology.

Overview

This strategy is dependent on four conditions that has allowed it to become the most prevalent strategy for the Prisoner's Dilemma:

Unless provoked, the agent will always cooperate
If provoked, the agent will retaliate
The agent is quick to forgive
The agent must have a good chance of competing against the opponent more than once.

In the last condition, the definition of "good chance" depends on the payoff matrix of the prisoner's dilemma. The important thing is that the competition continues long enough for repeated punishment and forgiveness to generate a long-term payoff higher than the possible loss from cooperating initially.

A fifth condition applies to make the competition meaningful: if an agent knows that the next play will be the last, it should naturally defect for a higher score. Similarly if it knows that the next two plays will be the last, it should defect twice, and so on. Therefore the number of competitions must not be known in advance to the agents.

Against a variety of alternative strategies, tit for tat was the most effective, winning in several annual automated tournaments against (generally far more complex) strategies created by teams of computer scientists, economists, and psychologists. Game theorists informally believed the strategy to be optimal (although no proof was presented).

It is important to know that tit for tat still is the most effective strategy if you compare the average performance of each competing team. The team which recently won over a pure tit for tat team only outperformed it with some of their algorithms because they submitted multiple algorithms which would recognize each other and assume a master and slave relationship (one algorithm would "sacrifice" itself and obtain a very poor result in order for the other algorithm to be able to outperform Tit for Tat on an individual basis, but not as a pair or group). Still, this "group" victory illustrates an important limitation of the Prisoner's Dilemma in representing social reality, namely, that it does not include any natural equivalent for friendship or alliances. The advantage of "tit for tat" thus pertains only to a Hobbesian world of rational solutions, not to a world in which humans are inherently social.

Example of play

	Cooperate	Defect
Cooperate	3, 3	0, 5
Defect	5, 0	1, 1
Prisoner's dilemma example

Assume there are 4 agents: 2 are Tit for Tat players ("variables") and 2 are "Defectors", simply trying to maximize their own winnings by always giving evidence against the other. Assume that each player faces the other 3 in a match lasting 6 games. If one player gives evidence against a player who does not, the former gains 5 points and the latter nets 0. If both refrain from giving evidence, both gain 3 points. If both give evidence against each other, both gain 1 point.

When a variable faces off against a defector, the former refrains from giving evidence in the first game while the defector does the opposite, gaining the control 5 points. In the remaining 5 games, both players give evidence against each other, netting 1 point each game. The final score is: Defector - 10 | Variable - 5.

When the variables face off against each other, each refrains from giving evidence in all 6 games. 6 * 3 = 18 points, the final score being Variable(1) - 18 | Variable(2) - 18.

When the defectors face off, each gives evidence against the other in all 6 games. 6 * 1 = 6 points, the final score being Defector(1) - 6 | Defector(2) - 6.

The final score for each variable is 5 (game against defector(1)) + 5 (game against defector(2)) + 18 (game against variable) = 28 points. The final score for each defector is 10 (against variable(1)) + 10 (against variable(2)) + 6 (against defector) = 26 points.

Despite the fact that the variables never won a match and the defectors never lost a match, the variables still came out ahead, because the final score is not determined by the winner of matches, but the scorer of points. Simply put, the variables gained more points tying with each other than they lost to the defectors. The more variables that there are in the game, the more advantage it is to be a variable.

(This example was taken from Piers Anthony's novel, Golem in the Gears.)

Implications

The success of the strategy, which is largely cooperative, took many by surprise. In successive competitions various teams produced complex strategies which attempted to "cheat" in a variety of cunning ways, but Tit for Tat eventually prevailed in every competition.

Some theorists believe this result may give insight into how groups of animals (and particularly human societies) have come to live in largely (or entirely) cooperative societies, rather than the individualistic "red in tooth and claw" way that might be expected from individual engaged in a Hobbesian state of nature. This, and particularly its application to human society and politics, is the subject of Robert Axelrod's book The Evolution of Cooperation. Also the theory can give insight in how technological innovation have taken place in history, and in particular, why the modern age evolved in the many competing kingdoms of Europe, but not for example in China.

Problems

While it has been empirically shown (by Axelrod) that the strategy is optimal in some cases, two agents playing tit for tat remain vulnerable. A one-time, single-bit error in either player's interpretation of events can lead to an unending "death spiral". In this symmetric situation, each side perceives itself as preferring to cooperate, if only the other side would. But each is forced by the strategy into repeatedly punishing an opponent who continues to attack despite being punished in every game cycle. Both sides come to think of themselves as innocent and acting in self-defense, and their opponent as either evil or too stupid to learn to cooperate.

This situation frequently arises in real world conflicts, ranging from schoolboy fights to civil and regional wars. Tit for two tats could be used to avoid this problem.

"Tit for Tat with forgiveness" is sometimes superior. When the opponent defects, on the next move, the player sometimes cooperates anyway, with a small probability (around 1%-5%). This allows for occasional recovery from getting trapped in a cycle of defections. The exact probability depends on the line-up of opponents. "Tit for Tat with forgiveness" is best when miscommunication is introduced to the game — when one's move is incorrectly reported to the opponent.

Tit for two tats

Tit for Two Tats is similar to Tit for Tat in that it is nice, retaliating, forgiving and non-envious, the only difference between the two being how nice the strategy is.

In a tit for tat strategy once an opponent defects, the tit for tat player immediately responds by defecting on the next move. This has the unfortunate consequence of causing two retaliatory strategies to continuously defect against one another resulting in a poor outcome for both players. A tit for two tats player will let the first defection go unchallenged as a means to avoid the "death spiral" of the previous example. If the opponent defects twice in a row, the tit for two tats player will respond by defecting.

This strategy was put forward by Robert Axelrod during his second round of computer simulations at RAND. After analyzing the results of the first experiment he determined that had a participant entered the tit for two tats strategy it would have emerged with a higher cumulative score than any other program. As a result he himself entered it with high expectations in the second tournament. Unfortunately due to the more aggressive nature of the programs entered in the second round, tit for two tats did significantly worse than tit for tat due to aggressive strategies being able to take advantage of its highly forgiving nature.

Popular culture

The tit for tat strategy was employed in an episode of Numb3rs, where FBI agents were interrogating and attempting to obtain information from an inmate on death row. The strategy was working, but the FBI would not implement a "tit for two tats".

Live and Let Live

The tit for tat strategy has been detected by analysts in the spontaneous non-violent behaviour, called Live and Let Live (World war I) that arose during the First World War.

External links

References

The Evolution of Cooperation, Robert Axelrod, Basic Books, ISBN 0-465-02121-2
The Selfish Gene, Richard Dawkins (1990), second edition -- includes two chapters about the evolution of cooperation, ISBN 0-19-286092-5
The Origins of Virtue, Matt Ridley, Penguin Books Ltd, ISBN 0-14-024404-2
How Are We to Live?, Peter Singer, Prometheus Books, ISBN 0-87975-966-6

v t e Topics of game theory
Definitions	Congestion game Cooperative game Determinacy Escalation of commitment Extensive-form game First-player and second-player win Game complexity Graphical game Hierarchy of beliefs Information set Normal-form game Preference Sequential game Simultaneous game Simultaneous action selection Solved game Succinct game Mechanism design
Equilibrium concepts	Bayes correlated equilibrium Bayesian Nash equilibrium Berge equilibrium Core Correlated equilibrium Coalition-proof Nash equilibrium Epsilon-equilibrium Evolutionarily stable strategy Gibbs equilibrium Mertens-stable equilibrium Markov perfect equilibrium Nash equilibrium Pareto efficiency Perfect Bayesian equilibrium Proper equilibrium Quantal response equilibrium Quasi-perfect equilibrium Risk dominance Satisfaction equilibrium Self-confirming equilibrium Sequential equilibrium Shapley value Strong Nash equilibrium Subgame perfection Trembling hand equilibrium
Strategies	Appeasement Backward induction Bid shading Collusion Cheap talk De-escalation Deterrence Escalation Forward induction Grim trigger Markov strategy Pairing strategy Dominant strategies Pure strategy Mixed strategy Strategy-stealing argument Tit for tat
Classes of games	Auction Bargaining problem Global game Intransitive game Mean-field game n-player game Perfect information Large Poisson game Potential game Repeated game Screening game Signaling game Strictly determined game Stochastic game Symmetric game Zero-sum game
Games	Go Chess Infinite chess Checkers All-pay auction Prisoner's dilemma Gift-exchange game Optional prisoner's dilemma Traveler's dilemma Coordination game Chicken Centipede game Lewis signaling game Volunteer's dilemma Dollar auction Battle of the sexes Stag hunt Matching pennies Ultimatum game Electronic mail game Rock paper scissors Pirate game Dictator game Public goods game Blotto game War of attrition El Farol Bar problem Fair division Fair cake-cutting Bertrand competition Cournot competition Stackelberg competition Deadlock Diner's dilemma Guess 2/3 of the average Kuhn poker Nash bargaining game Induction puzzles Trust game Princess and monster game Rendezvous problem
Theorems	Aumann's agreement theorem Folk theorem Minimax theorem Nash's theorem Negamax theorem Purification theorem Revelation principle Sprague–Grundy theorem Zermelo's theorem
Key figures	Albert W. Tucker Amos Tversky Antoine Augustin Cournot Ariel Rubinstein Claude Shannon Daniel Kahneman David K. Levine David M. Kreps Donald B. Gillies Drew Fudenberg Eric Maskin Harold W. Kuhn Herbert Simon Hervé Moulin John Conway Jean Tirole Jean-François Mertens Jennifer Tour Chayes John Harsanyi John Maynard Smith John Nash John von Neumann Kenneth Arrow Kenneth Binmore Leonid Hurwicz Lloyd Shapley Melvin Dresher Merrill M. Flood Olga Bondareva Oskar Morgenstern Paul Milgrom Peyton Young Reinhard Selten Robert Axelrod Robert Aumann Robert B. Wilson Roger Myerson Samuel Bowles Suzanne Scotchmer Thomas Schelling William Vickrey
Search optimizations	Alpha–beta pruning Aspiration window Principal variation search max^n algorithm Paranoid algorithm Lazy SMP
Miscellaneous	Bounded rationality Combinatorial game theory Confrontation analysis Coopetition Evolutionary game theory Glossary of game theory List of game theorists List of games in game theory No-win situation Topological game Tragedy of the commons