Difference between revisions of "Prisoner's dilemma"

From Lesswrongwiki
Jump to: navigation, search
(Blog posts)
(Blog posts: Added Slepnev's post solving PD in no-oracles ADT, grouped with Nisan's post)
Line 8: Line 8:
 
*[http://lesswrong.com/lw/tn/the_true_prisoners_dilemma/ The True Prisoner's Dilemma]
 
*[http://lesswrong.com/lw/tn/the_true_prisoners_dilemma/ The True Prisoner's Dilemma]
 
*[http://lesswrong.com/lw/to/the_truly_iterated_prisoners_dilemma/ The Truly Iterated Prisoner's Dilemma]
 
*[http://lesswrong.com/lw/to/the_truly_iterated_prisoners_dilemma/ The Truly Iterated Prisoner's Dilemma]
*[http://lesswrong.com/lw/do/reformalizing_pd/ Re-formalizing PD] by cousin_it
+
*[http://lesswrong.com/lw/do/reformalizing_pd/ Re-formalizing PD] by [http://lesswrong.com/user/cousin_it/ cousin_it]
 
*[http://lesswrong.com/lw/1w6/blackmail_nukes_and_the_prisoners_dilemma/ Blackmail, Nukes, and the Prisoner's Dilemma] by Stuart Armstrong
 
*[http://lesswrong.com/lw/1w6/blackmail_nukes_and_the_prisoners_dilemma/ Blackmail, Nukes, and the Prisoner's Dilemma] by Stuart Armstrong
 
*[http://lesswrong.com/lw/7f2/prisoners_dilemma_tournament_results/ Prisoner's Dilemma Tournament Results] by prase
 
*[http://lesswrong.com/lw/7f2/prisoners_dilemma_tournament_results/ Prisoner's Dilemma Tournament Results] by prase
 
*[http://lesswrong.com/lw/1cq/the_continued_misuse_of_the_prisoners_dilemma/ The continued misuse of the Prisoner's Dilemma] by [http://silasx.blogspot.com/ Silas Barta]
 
*[http://lesswrong.com/lw/1cq/the_continued_misuse_of_the_prisoners_dilemma/ The continued misuse of the Prisoner's Dilemma] by [http://silasx.blogspot.com/ Silas Barta]
 +
 +
Solution in [[ADT]]:
 +
*[http://lesswrong.com/lw/2ip/ai_cooperation_in_practice/ AI cooperation in practice] by [http://lesswrong.com/user/cousin_it/ cousin_it]
 
*[http://lesswrong.com/lw/9o7/formulas_of_arithmetic_that_behave_like_decision/ Formulas of arithmetic that behave like decision agents] by Nisan
 
*[http://lesswrong.com/lw/9o7/formulas_of_arithmetic_that_behave_like_decision/ Formulas of arithmetic that behave like decision agents] by Nisan
  

Revision as of 10:43, 25 March 2012

Smallwikipedialogo.png
Wikipedia has an article about

The Prisoner's dilemma is a classic problem in game theory. Two players independently have the option of cooperating with or defecting against the other player. If both players cooperate, they each receive a payoff C from a third party (a "banker" or benevolent nature); if they each defect, they both receive payoff D; if one cooperates and one defects, the defector receives payoff T and the cooperator receives payoff S. The prisoner's dilemma is the situation in which T > C > D > S. The payoffs are assumed to be in utilons, each player only wants to maximize her own payoff, without any regard in either direction for the other player.

Notice that if you treat the other player's decision as completely independent from yours, if the other player defects, then you score higher if you defect as well, whereas if the other player cooperates, you do better by defecting. So it would seem that the rational decision would be to defect (at least if the game is to be played only once), and indeed, this is what classical causal decision theory says. And yet—and yet, if only somehow both players could agree to cooperate, they would both do better than if they both defected. If the players are timeless decision agents, they can.

Blog posts

Solution in ADT:

External links

See also

References

  • Drescher, Gary (2006). Good and Real. Cambridge: The MIT Press. ISBN 0262042339.