|
CATEGORIES Afterlife (1)
Art (9)
Artificial Life (2)
Bioethics (2)
Consciousness (5)
Copyright (1)
Cyberspace (1)
Economics (6)
Education (1)
Entertainment (3)
Fiction (13)
Food (6)
Future (12)
Gadgets (2)
Game Theory (1)
Law (4)
Literature (1)
Mathematics (1)
Media (1)
Medicine (1)
Miscellanea (3)
Outer Space (4)
Physics (6)
Astrophysics (1) Particles (6) Quantum Theory (5) Relativity (3) Special Relativity (1) VBHT (19) Politics (4)
Science Fiction (4)
Television (4)
Travel (5)
|
The Prisoner's Dilemma
A brief description of a simple yet profound problem in game theory.
ARTICLE INFO
The basic idea of the Prisoner's Dilemma is very simple. Both players have a choice between two actions: cooperate or defect. At the end of the game each player gets a number of points that depends on the choices made by both. If both cooperate then they get three points each. If they both defect then they each get one point. If one cooperates and one defects then the defector gets five points and the cooperator gets nothing. You can, of course, change the payoffs, but you have to keep the temptation to defect higher than the reward for cooperating, which must be higher than the punishment for mutual defection, which has to be higher than the amount given to a cooperator when the other player defects. Also, you have to arrange things so that if one cooperates and the other defects and then shares the loot with the cooperator then they don't both make more than they would be both cooperating.
(The initial posing of the problem was in terms of a pair of prisoners making deals with prosectors in a situation in which each could get a reduced sentence by betraying the other to the authorities. I think it was originally studied as a model for superpower confrontation.) Now, in a single game of the Dilemma the optimal strategy is clearly to defect. If your opponent cooperates then you can sucker him by defecting and make more points, but if he defects then you can at best equal his score by also defecting. The whole thing doesn't seem very interesting. Things become much more complex when the same pair of players play against each other repeatedly and can modify their behaviour based on the previous rounds. This is the Iterated Prisoner's Dilemma. In a game of IPD it turns out that the best strategies incorporate the ideas of "niceness" (start by cooperating), "retaliation" (punish those who defect), "forgiveness" (don't get caught up in vendettas - cooperation is better) and "transparency" (randomness is a very bad strategy, and if your strategy is too complex then it behaves pretty much like a random strategy). There are even more complex variants of the game with populations of strategies playing against each other. It would probably be fun to enter an algorithm in an IPD tournament.
|
add comment