Misplaced Pages

Talk:Prisoner's dilemma: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editContent deleted Content addedVisualWikitext
Revision as of 03:36, 9 March 2024 editCewbot (talk | contribs)Bots7,346,969 editsm Maintain {{WPBS}}: 8 WikiProject templates. Remove 2 deprecated parameters: field, frequentlyviewed.Tag: Talk banner shell conversion← Previous edit Latest revision as of 12:38, 23 November 2024 edit undoLowercase sigmabot III (talk | contribs)Bots, Template editors2,294,402 editsm Archiving 1 discussion(s) to Talk:Prisoner's dilemma/Archive 5) (bot 
(15 intermediate revisions by 10 users not shown)
Line 1: Line 1:
{{Talkheader|archive_age=1|archive_units=year}} {{Talkheader}}
{{ArticleHistory {{ArticleHistory
|action1=RBP |action1=RBP
Line 42: Line 42:
|small=yes}} |small=yes}}


== what? ==
== Where does "Outcome D" arise from? ==


''A purely selfish strategy will not "cheat" on its opponent for purely self-interested reasons first.''
Outcome D: If A and B both betray the other, they will share the sentence and serve 5 years


Why is that sentence in the "Nice" section? How is a purely selfish strategy "nice?" ] (]) 04:39, 24 November 2023 (UTC)
I am failing to see where outcome D comes from? Is it implied in the text?


:I also don't understand the logic of that sentence. How is a ''purely'' selfish strategy not purely selfish? ] (]) 04:42, 24 November 2023 (UTC)
Premise:
::I didn't find many mentions of "purely selfish" in decent sources after a quick search, although I think it means does not consider the other party at all, that is, it always chooses the best outcome--defect. I am changing this to "This strategy will not "cheat" on its opponent for purely self-interested reasons first" for now. ] (]) 02:02, 19 June 2024 (UTC)
:Yeah, this section is a bit convoluted. "Nice" simply means the algorithm will not defect first, compared to algorithms which will try to defect. ] (]) 01:50, 19 June 2024 (UTC)


== Christmas truce in "See also" ==
William Poundstone described the game in his 1993 book Prisoner's Dilemma:


No inline citation is given for why the ] is mentioned at ] of this article, and mentioning it in this article seems inappropriate - can anyone explain? As I understand it, the Christmas truce was a ] and therefore should instead be mentioned in that article:
Two members of a criminal gang, A and B, are arrested and imprisoned. Each prisoner is in solitary confinement with no means of communication with their partner. The principal charge would lead to a sentence of ten years in prison; however, the police do not have the evidence for a conviction. They plan to sentence both to two years in prison on a lesser charge but offer each prisoner a Faustian bargain: If one of them confesses to the crime of the principal charge, betraying the other, they will be pardoned and free to leave while the other must serve the entirety of the sentence instead of just two years for the lesser charge.


- if both sides cooperate, everybody involved gets to live that day and enjoy the truce;
] (]) 12:02, 17 June 2023 (UTC)


- if both sides betray, then the war continues as usual with some casualties being expected;
: I noticed the same when I read it a few days ago but didn't have the time or energy to investigate it or even mention it on the talk page. But I agree completely: There is a non-stated premise that if both are convicted they will share the 10 year sentence in the given hypothetical justice system, and it is confusing that this premise only shows up in Outcome D, out of the blue.
: So now I looked further into it, and it turns out that the quote has been messed with. It is not a proper quote from the stated book. The book (William Poundstone: Prisoner's Dilemma, February 1993, ISBN 0-385-41580-X) says this at page 118:
: "A typical contemporary version of the story goes like this: Two members of a criminal gang are arrested and imprisoned. Each prisoner is in solitary confinement with no means of speaking to or exchanging messages with the other. The police admit they don't have enough evidence to convict the pair on the principal charge. They plan to sentence both to a year in prison on a lesser charge. Simultaneously, the police offer each prisoner a Faustian bargain. If he testifies against his partner, he will go free while the partner will get three years in prison on the main charge. Oh, yes, there is a catch ... If ''both'' prisoners testify against each other, both will be sentenced to two years in jail."
: So I guess we should either (1) correct the quote and then also correct the text after the quote to match it, or (2) state our completely own version of the dilemma, without referring to any book, but then I think we could border on something like ]. I guess option (1) seems to be the most straightforward to ensure we align with reliable sources.
: We should also make clear that the quote is just a "typical contemporary version" mentioned in that book, because right now, without reading the article lead, the text can give the impression that this book is the original source of the prisoner's dilemma (even though the book of course makes no such claim; on the contrary, it clearly states the origin as Merrill Flood and Melvin Dresher in 1950, with the name coming from Albert W. Tucker). --] (]) 11:43, 18 June 2023 (UTC)
::I have adjusted the article now with the correct quote and correction of the actual numbers mentioned in that specific version of the story. I also changed outcomes ABCD to 1234 to lessen confusion with prisoners A and B. ] (]) 13:33, 18 June 2023 (UTC)


- if one side cooperates with the other betraying, then the cooperating side suffers massive casualties due to an ill-fated attempt to cross ], while the betraying side can shoot the enemy with impunity, but does not enjoy a truce that day. ] (]) 08:53, 7 January 2024 (UTC)
== I can't reconcile these 2 statements in the article ==


As it turns out, the mentioning of the Christmas truce was introduced in with no reason given. ] (]) 19:45, 2 February 2024 (UTC)
I can't reconcile these 2 statements:


== The grammar is wrong. ==
"For ] to emerge between rational players, the number of rounds must be unknown or infinite."


] redirects to ]. The plural possessive, however, is the correct form (regardless of how many times the mistake has been repeated in the past) and the singular possessive is wrong.
...


While there's only one dilemma (it's the same dilemma), it ''must'' be faced by multiple prisoners, otherwise the entire thought experiment is meaningless. Therefore multiple prisoners face the dilemma in question. It's the prisoners' dilemma, not any one prisoner's dilemma.
"Another aspect of the iterated prisoner's dilemma is that this tacit agreement between players has always been established successfully even when the number of iterations is made public to both sides."


This grammatical atrocity must be ferreted out and stopped anywhere it can, and redirecting from the correct grammar to the wrong grammar is enabling the reign of the kingdom of the Doofuses.
As an aside, I see the principles discussed as similar to the concept of ] ] ] (]) 21:37, 20 July 2023 (UTC)


Next step: figuring out how to make Chrome stop underlining the correct form.
:Yeah I think ] belongs in a "see also" for Prinsoners Dilemma, it links back here and provides context:
] (]) 03:35, 18 February 2024 (UTC)
:"
:Etymology | ]
:The word can be found being used in the context of playing zero-sum games in a publication by the Mental Health Research Institute in 1956. It is alternately claimed that the first known use of the term was in Nigel Howard's book ''Paradoxes of Rationality: Theory of Metagames and Political Behavior'' published in 1971, where Howard used the term in his analysis of the ] political landscape using a variation of the ]., however Howard used the term in ''Metagame Analysis in Political Problems'' published in 1966. In 1967, the word appeared in a study by Russell Lincoln Ackoff and in the Bulletin of the Operations Research Society of America
:" ] (]) 21:40, 20 July 2023 (UTC)


:Hi, What you are commenting is not a grammar issue, it is just a preference for naming the game. Grammatically both expressions are correct. We could also argue that the dilemma is not really about prisoners but about criminals or defendants, and argue in favor of changing the name of the game accordingly. Now, i personally prefer singular, as it puts emphasis on the point of view of one (arbitrary) criminal/defendant/prisoner, and the concept of Nash equilibrium also puts emphasis on the issue with (isolated/distributed) individual decisions. I checked the corresponding wiki page in about 10 other languages and singular is used in all. ] (]) 06:36, 7 May 2024 (UTC)
== what? ==


== Plus/minus and payoff/cost ==
''A purely selfish strategy will not "cheat" on its opponent for purely self-interested reasons first.''


I suspect that many may be puzzled comparing the "payoff" matrix in the ], displaying years in prison as payoffs, and the one in the ], displaying, as expected, a more desirable payoff with a greater value than a less desirable one.
Why is that sentence in the "Nice" section? How is a purely selfish strategy "nice?" ] (]) 04:39, 24 November 2023 (UTC)
We ask for instance in a general prisoner's dilemma that <math>R>P</math>, where ''R'' is the payoff for cooperating=staying silent, and ''P'' that for defecting=testifying; this is inconsistent with payoffs being years in prison. The matrix displaying years in prison should rather be called "cost matrix", or be qualified with some other negative word like "punishment/penalty/sentence/...", and then the page should have a word of explanation to relate payoffs and costs. Alternatively we can display negatives of years in prison, and keep calling it "payoff matrix": this is consistent with the definition of a general prisoner's dilemma. ] (]) 05:46, 7 May 2024 (UTC)


== Moral philosophy ==
:I also don't understand the logic of that sentence. How is a ''purely'' selfish strategy not purely selfish? ] (]) 04:42, 24 November 2023 (UTC)
I was reading today about the applications of prisoner's dilemma in moral philosophy, and noticed that there is no section about it on this page!
I just added a new section, along with some opinions. Feel free to expand and discuss


== I still can't reconcile these 2 statements in the current article ==
== what person? ==


I still can't reconcile these 2 statements in the current article:
''For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, '''that''' person is at a slight disadvantage because of the loss on the first turn. In such a population, the optimal strategy is to defect every time.'' ] (]) 04:45, 24 November 2023 (UTC)


"For cooperation to emerge between rational players, the number of rounds must be unknown or infinite.
== Christmas truce in "See also" ==


"Another aspect of the iterated prisoner's dilemma is that this tacit agreement between players has always been established successfully even when the number of iterations is made public to both sides."
No inline citation is given for why the ] is mentioned at ] of this article, and mentioning it in this article seems inappropriate - can anyone explain? As I understand it, the Christmas truce was a ] and therefore should instead be mentioned in that article:


<nowiki>~~~~</nowiki> ] (]) 01:17, 23 July 2024 (UTC)
- if both sides cooperate, everybody involved gets to live that day and enjoy the truce;


:I think the idea is that when real people play repeated rounds they do not follow the rational / dominant strategy, and co-operation still emerges. I can believe this is true, but it would probably be better supported by a reference. ] (]) 08:01, 23 July 2024 (UTC)
- if both sides betray, then the war continues as usual with some casualties being expected;


::I can find references in which co-operation emerges in a finite game (I'll leave this reference here as an example<ref>{{cite journal |last1=Cooper |first1=Russell |last2=DeJong |first2=Douglas V. |last3=Forsythe |first3=Robert |last4=Ross |first4=Thomas W. |title=Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games |journal=Games and Economic Behavior |date=1996 |volume=12 |issue=2 |pages=187-218 |doi=10.1006/game.1996.0013}}</ref>) but the word ''always'' needs some justification. I'm tempted to rephrase without the ''always'' but equally good if there is a citation to support it. ] (]) 08:32, 23 July 2024 (UTC)
- if one side cooperates with the other betraying, then the cooperating side suffers massive casualties due to an ill-fated attempt to cross ], while the betraying side can shoot the enemy with impunity, but does not enjoy a truce that day. ] (]) 08:53, 7 January 2024 (UTC)


As it turns out, the mentioning of the Christmas truce was introduced in with no reason given. ] (]) 19:45, 2 February 2024 (UTC)


{{Reflist-talk}}<!-- Template:Reflist-talk creates a section-level reference list box. Please add comments and references for this section's discussion above this template. When a new discussion begins, the new section will be added below this template. Add a new {{Reflist-talk}} at the end of that section if needed. -->
== The grammar is wrong. ==


== Suppose a prisoner testified but perjured their testimony with false extraneous details. ==
] redirects to ]. The plural possessive, however, is the correct form (regardless of how many times the mistake has been repeated in the past) and the singular possessive is wrong.


Just saying. Hope no policy has been broken by my use of this. ] (]) 17:16, 11 September 2024 (UTC)
While there's only one dilemma (it's the same dilemma), it ''must'' be faced by multiple prisoners, otherwise the entire thought experiment is meaningless. Therefore multiple prisoners face the dilemma in question. It's the prisoners' dilemma, not any one prisoner's dilemma.


== Lacks reference ==
This grammatical atrocity must be ferreted out and stopped anywhere it can, and redirecting from the correct grammar to the wrong grammar is enabling the reign of the kingdom of the Doofuses.


The following paragraph lacks reference.
Next step: figuring out how to make Chrome stop underlining the correct form.

] (]) 03:35, 18 February 2024 (UTC)
"In contrast to the one-time prisoner's dilemma game, the optimal strategy in the iterated prisoner's dilemma depends upon the strategies of likely opponents, and how they will react to defections and cooperation. For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, that person is at a slight disadvantage because of the loss on the first turn. In such a population, the optimal strategy is to defect every time. More generally, given a population with a certain percentage of always-defectors with the rest being tit-for-tat players, the optimal strategy depends on the percentage and number of iterations played." ] (]) 02:52, 25 September 2024 (UTC)

Latest revision as of 12:38, 23 November 2024

This is the talk page for discussing improvements to the Prisoner's dilemma article.
This is not a forum for general discussion of the article's subject.
Article policies
Find sources: Google (books · news · scholar · free images · WP refs· FENS · JSTOR · TWL
Archives: 1, 2, 3, 4, 5Auto-archiving period: 12 months 
Former featured articlePrisoner's dilemma is a former featured article. Please see the links under Article milestones below for its original nomination page (for older articles, check the nomination archive) and why it was removed.
Main Page trophyThis article appeared on Misplaced Pages's Main Page as Today's featured article on March 16, 2004.
Article milestones
DateProcessResult
January 19, 2004Refreshing brilliant proseKept
April 16, 2008Featured article reviewDemoted
Current status: Former featured article
This  level-5 vital article is rated C-class on Misplaced Pages's content assessment scale.
It is of interest to the following WikiProjects:
WikiProject iconGame theory Top‑importance
WikiProject iconThis article is part of WikiProject Game theory, an attempt to improve, grow, and standardize Misplaced Pages's articles related to Game theory. We need your help!
Join in | Fix a red link | Add content | Weigh inGame theoryWikipedia:WikiProject Game theoryTemplate:WikiProject Game theorygame theory
TopThis article has been rated as Top-importance on the importance scale.
WikiProject iconMathematics High‑priority
WikiProject iconThis article is within the scope of WikiProject Mathematics, a collaborative effort to improve the coverage of mathematics on Misplaced Pages. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.MathematicsWikipedia:WikiProject MathematicsTemplate:WikiProject Mathematicsmathematics
HighThis article has been rated as High-priority on the project's priority scale.
WikiProject iconEconomics High‑importance
WikiProject iconThis article is within the scope of WikiProject Economics, a collaborative effort to improve the coverage of Economics on Misplaced Pages. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.EconomicsWikipedia:WikiProject EconomicsTemplate:WikiProject EconomicsEconomics
HighThis article has been rated as High-importance on the project's importance scale.
WikiProject iconComputer science High‑importance
WikiProject iconThis article is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Misplaced Pages. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Computer scienceWikipedia:WikiProject Computer scienceTemplate:WikiProject Computer scienceComputer science
HighThis article has been rated as High-importance on the project's importance scale.
Things you can help WikiProject Computer science with:

Here are some tasks awaiting attention:
WikiProject iconEnvironment High‑importance
WikiProject iconThis environment-related article is part of the WikiProject Environment to improve Misplaced Pages's coverage of the environment. The aim is to write neutral and well-referenced articles on environment-related topics, as well as to ensure that environment articles are properly categorized.
Read Misplaced Pages:Contributing FAQ and leave any messages at the project talk page.EnvironmentWikipedia:WikiProject EnvironmentTemplate:WikiProject EnvironmentEnvironment
HighThis article has been rated as High-importance on the project's importance scale.
WikiProject iconSociology Mid‑importance
WikiProject iconThis article is within the scope of WikiProject Sociology, a collaborative effort to improve the coverage of sociology on Misplaced Pages. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.SociologyWikipedia:WikiProject SociologyTemplate:WikiProject Sociologysociology
MidThis article has been rated as Mid-importance on the project's importance scale.
WikiProject iconCorrection and Detention Facilities (defunct)
WikiProject iconThis article is within the scope of WikiProject Correction and Detention Facilities, a project which is currently considered to be defunct.Correction and Detention FacilitiesWikipedia:WikiProject Correction and Detention FacilitiesTemplate:WikiProject Correction and Detention FacilitiesCorrection and Detention Facilities
WikiProject iconPsychology Mid‑importance
WikiProject iconThis article is within the scope of WikiProject Psychology, a collaborative effort to improve the coverage of Psychology on Misplaced Pages. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.PsychologyWikipedia:WikiProject PsychologyTemplate:WikiProject Psychologypsychology
MidThis article has been rated as Mid-importance on the project's importance scale.

Media mentionThis article has been mentioned by a media organization:

what?

A purely selfish strategy will not "cheat" on its opponent for purely self-interested reasons first.

Why is that sentence in the "Nice" section? How is a purely selfish strategy "nice?" 107.77.195.33 (talk) 04:39, 24 November 2023 (UTC)

I also don't understand the logic of that sentence. How is a purely selfish strategy not purely selfish? 107.77.195.33 (talk) 04:42, 24 November 2023 (UTC)
I didn't find many mentions of "purely selfish" in decent sources after a quick search, although I think it means does not consider the other party at all, that is, it always chooses the best outcome--defect. I am changing this to "This strategy will not "cheat" on its opponent for purely self-interested reasons first" for now. Winston (talk) 02:02, 19 June 2024 (UTC)
Yeah, this section is a bit convoluted. "Nice" simply means the algorithm will not defect first, compared to algorithms which will try to defect. Winston (talk) 01:50, 19 June 2024 (UTC)

Christmas truce in "See also"

No inline citation is given for why the Christmas truce is mentioned at the bottom of this article, and mentioning it in this article seems inappropriate - can anyone explain? As I understand it, the Christmas truce was a stag hunt and therefore should instead be mentioned in that article:

- if both sides cooperate, everybody involved gets to live that day and enjoy the truce;

- if both sides betray, then the war continues as usual with some casualties being expected;

- if one side cooperates with the other betraying, then the cooperating side suffers massive casualties due to an ill-fated attempt to cross no man's land, while the betraying side can shoot the enemy with impunity, but does not enjoy a truce that day. TheDestroyer111 (talk) 08:53, 7 January 2024 (UTC)

As it turns out, the mentioning of the Christmas truce was introduced in a June 2010 edit with no reason given. TheDestroyer111 (talk) 19:45, 2 February 2024 (UTC)

The grammar is wrong.

The Prisoners' Dilemma redirects to The Prisoner's Dilemma. The plural possessive, however, is the correct form (regardless of how many times the mistake has been repeated in the past) and the singular possessive is wrong.

While there's only one dilemma (it's the same dilemma), it must be faced by multiple prisoners, otherwise the entire thought experiment is meaningless. Therefore multiple prisoners face the dilemma in question. It's the prisoners' dilemma, not any one prisoner's dilemma.

This grammatical atrocity must be ferreted out and stopped anywhere it can, and redirecting from the correct grammar to the wrong grammar is enabling the reign of the kingdom of the Doofuses.

Next step: figuring out how to make Chrome stop underlining the correct form. 2601:1C2:5000:8CC7:F5EF:AC6C:C8:5638 (talk) 03:35, 18 February 2024 (UTC)

Hi, What you are commenting is not a grammar issue, it is just a preference for naming the game. Grammatically both expressions are correct. We could also argue that the dilemma is not really about prisoners but about criminals or defendants, and argue in favor of changing the name of the game accordingly. Now, i personally prefer singular, as it puts emphasis on the point of view of one (arbitrary) criminal/defendant/prisoner, and the concept of Nash equilibrium also puts emphasis on the issue with (isolated/distributed) individual decisions. I checked the corresponding wiki page in about 10 other languages and singular is used in all. 37.67.33.80 (talk) 06:36, 7 May 2024 (UTC)

Plus/minus and payoff/cost

I suspect that many may be puzzled comparing the "payoff" matrix in the Prisoner's dilemma#Premise, displaying years in prison as payoffs, and the one in the Prisoner's dilemma#Generalized form, displaying, as expected, a more desirable payoff with a greater value than a less desirable one. We ask for instance in a general prisoner's dilemma that R > P {\displaystyle R>P} , where R is the payoff for cooperating=staying silent, and P that for defecting=testifying; this is inconsistent with payoffs being years in prison. The matrix displaying years in prison should rather be called "cost matrix", or be qualified with some other negative word like "punishment/penalty/sentence/...", and then the page should have a word of explanation to relate payoffs and costs. Alternatively we can display negatives of years in prison, and keep calling it "payoff matrix": this is consistent with the definition of a general prisoner's dilemma. 37.67.33.80 (talk) 05:46, 7 May 2024 (UTC)

Moral philosophy

I was reading today about the applications of prisoner's dilemma in moral philosophy, and noticed that there is no section about it on this page! I just added a new section, along with some opinions. Feel free to expand and discuss

I still can't reconcile these 2 statements in the current article

I still can't reconcile these 2 statements in the current article:

"For cooperation to emerge between rational players, the number of rounds must be unknown or infinite.

"Another aspect of the iterated prisoner's dilemma is that this tacit agreement between players has always been established successfully even when the number of iterations is made public to both sides."

~~~~ Mathiastck (talk) 01:17, 23 July 2024 (UTC)

I think the idea is that when real people play repeated rounds they do not follow the rational / dominant strategy, and co-operation still emerges. I can believe this is true, but it would probably be better supported by a reference. Mgp28 (talk) 08:01, 23 July 2024 (UTC)
I can find references in which co-operation emerges in a finite game (I'll leave this reference here as an example) but the word always needs some justification. I'm tempted to rephrase without the always but equally good if there is a citation to support it. Mgp28 (talk) 08:32, 23 July 2024 (UTC)


References

  1. Cooper, Russell; DeJong, Douglas V.; Forsythe, Robert; Ross, Thomas W. (1996). "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games". Games and Economic Behavior. 12 (2): 187–218. doi:10.1006/game.1996.0013.

Suppose a prisoner testified but perjured their testimony with false extraneous details.

Just saying. Hope no policy has been broken by my use of this. Tathagata1st (talk) 17:16, 11 September 2024 (UTC)

Lacks reference

The following paragraph lacks reference.

"In contrast to the one-time prisoner's dilemma game, the optimal strategy in the iterated prisoner's dilemma depends upon the strategies of likely opponents, and how they will react to defections and cooperation. For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, that person is at a slight disadvantage because of the loss on the first turn. In such a population, the optimal strategy is to defect every time. More generally, given a population with a certain percentage of always-defectors with the rest being tit-for-tat players, the optimal strategy depends on the percentage and number of iterations played." 202.92.130.118 (talk) 02:52, 25 September 2024 (UTC)

Categories: