# Difference between revisions of "Prisoner's dilemma"

Orthonormal (talk | contribs) (→Blog posts) |
(→Blog posts: Added Slepnev's post solving PD in no-oracles ADT, grouped with Nisan's post) |
||

Line 8: | Line 8: | ||

*[http://lesswrong.com/lw/tn/the_true_prisoners_dilemma/ The True Prisoner's Dilemma] | *[http://lesswrong.com/lw/tn/the_true_prisoners_dilemma/ The True Prisoner's Dilemma] | ||

*[http://lesswrong.com/lw/to/the_truly_iterated_prisoners_dilemma/ The Truly Iterated Prisoner's Dilemma] | *[http://lesswrong.com/lw/to/the_truly_iterated_prisoners_dilemma/ The Truly Iterated Prisoner's Dilemma] | ||

− | *[http://lesswrong.com/lw/do/reformalizing_pd/ Re-formalizing PD] by cousin_it | + | *[http://lesswrong.com/lw/do/reformalizing_pd/ Re-formalizing PD] by [http://lesswrong.com/user/cousin_it/ cousin_it] |

*[http://lesswrong.com/lw/1w6/blackmail_nukes_and_the_prisoners_dilemma/ Blackmail, Nukes, and the Prisoner's Dilemma] by Stuart Armstrong | *[http://lesswrong.com/lw/1w6/blackmail_nukes_and_the_prisoners_dilemma/ Blackmail, Nukes, and the Prisoner's Dilemma] by Stuart Armstrong | ||

*[http://lesswrong.com/lw/7f2/prisoners_dilemma_tournament_results/ Prisoner's Dilemma Tournament Results] by prase | *[http://lesswrong.com/lw/7f2/prisoners_dilemma_tournament_results/ Prisoner's Dilemma Tournament Results] by prase | ||

*[http://lesswrong.com/lw/1cq/the_continued_misuse_of_the_prisoners_dilemma/ The continued misuse of the Prisoner's Dilemma] by [http://silasx.blogspot.com/ Silas Barta] | *[http://lesswrong.com/lw/1cq/the_continued_misuse_of_the_prisoners_dilemma/ The continued misuse of the Prisoner's Dilemma] by [http://silasx.blogspot.com/ Silas Barta] | ||

+ | |||

+ | Solution in [[ADT]]: | ||

+ | *[http://lesswrong.com/lw/2ip/ai_cooperation_in_practice/ AI cooperation in practice] by [http://lesswrong.com/user/cousin_it/ cousin_it] | ||

*[http://lesswrong.com/lw/9o7/formulas_of_arithmetic_that_behave_like_decision/ Formulas of arithmetic that behave like decision agents] by Nisan | *[http://lesswrong.com/lw/9o7/formulas_of_arithmetic_that_behave_like_decision/ Formulas of arithmetic that behave like decision agents] by Nisan | ||

## Revision as of 10:43, 25 March 2012

The **Prisoner's dilemma** is a classic problem in game theory. Two players independently have the option of *cooperating* with or *defecting* against the other player. If both players cooperate, they each receive a payoff **C** from a third party (a "banker" or benevolent nature); if they each defect, they both receive payoff **D**; if one cooperates and one defects, the defector receives payoff **T** and the cooperator receives payoff **S**. The prisoner's dilemma is the situation in which **T** > **C** > **D** > **S**. The payoffs are assumed to be in utilons, each player only wants to maximize her own payoff, without any regard in either direction for the other player.

Notice that if you treat the other player's decision as completely independent from yours, if the other player defects, then you score higher if you defect as well, whereas if the other player cooperates, you do better by defecting. So it would seem that the rational decision would be to defect (at least if the game is to be played only once), and indeed, this is what classical causal decision theory says. And yet—and yet, if only somehow both players could agree to cooperate, they would both do better than if they both defected. If the players are timeless decision agents, they can.

## Blog posts

- The True Prisoner's Dilemma
- The Truly Iterated Prisoner's Dilemma
- Re-formalizing PD by cousin_it
- Blackmail, Nukes, and the Prisoner's Dilemma by Stuart Armstrong
- Prisoner's Dilemma Tournament Results by prase
- The continued misuse of the Prisoner's Dilemma by Silas Barta

Solution in ADT:

- AI cooperation in practice by cousin_it
- Formulas of arithmetic that behave like decision agents by Nisan

## External links

- Prisoner's dilemma (Stanford Encyclopedia of Philosophy)

## See also

- Game theory
- Decision theory
- Newcomb's problem
- Counterfactual mugging
- Parfit's hitchhiker
- Smoking lesion
- Absentminded driver
- Pascal's mugging

## References

- Drescher, Gary (2006).
*Good and Real*. Cambridge: The MIT Press. ISBN 0262042339.