Revision as of 06:39, 7 May 2024 editPlm203 (talk | contribs)137 edits →Plus/minus and payoff/cost← Previous edit | Latest revision as of 12:38, 23 November 2024 edit undoLowercase sigmabot III (talk | contribs)Bots, Template editors2,294,256 editsm Archiving 1 discussion(s) to Talk:Prisoner's dilemma/Archive 5) (bot | ||
(11 intermediate revisions by 8 users not shown) | |||
Line 1: | Line 1: | ||
{{Talkheader |
{{Talkheader}} | ||
{{ArticleHistory | {{ArticleHistory | ||
|action1=RBP | |action1=RBP | ||
Line 41: | Line 41: | ||
| url=http://www.metrowestdailynews.com/columnists/view.bg?articleid=81042 | | url=http://www.metrowestdailynews.com/columnists/view.bg?articleid=81042 | ||
|small=yes}} | |small=yes}} | ||
== Where does "Outcome D" arise from? == | |||
Outcome D: If A and B both betray the other, they will share the sentence and serve 5 years | |||
I am failing to see where outcome D comes from? Is it implied in the text? | |||
Premise: | |||
William Poundstone described the game in his 1993 book Prisoner's Dilemma: | |||
Two members of a criminal gang, A and B, are arrested and imprisoned. Each prisoner is in solitary confinement with no means of communication with their partner. The principal charge would lead to a sentence of ten years in prison; however, the police do not have the evidence for a conviction. They plan to sentence both to two years in prison on a lesser charge but offer each prisoner a Faustian bargain: If one of them confesses to the crime of the principal charge, betraying the other, they will be pardoned and free to leave while the other must serve the entirety of the sentence instead of just two years for the lesser charge. | |||
] (]) 12:02, 17 June 2023 (UTC) | |||
: I noticed the same when I read it a few days ago but didn't have the time or energy to investigate it or even mention it on the talk page. But I agree completely: There is a non-stated premise that if both are convicted they will share the 10 year sentence in the given hypothetical justice system, and it is confusing that this premise only shows up in Outcome D, out of the blue. | |||
: So now I looked further into it, and it turns out that the quote has been messed with. It is not a proper quote from the stated book. The book (William Poundstone: Prisoner's Dilemma, February 1993, ISBN 0-385-41580-X) says this at page 118: | |||
: "A typical contemporary version of the story goes like this: Two members of a criminal gang are arrested and imprisoned. Each prisoner is in solitary confinement with no means of speaking to or exchanging messages with the other. The police admit they don't have enough evidence to convict the pair on the principal charge. They plan to sentence both to a year in prison on a lesser charge. Simultaneously, the police offer each prisoner a Faustian bargain. If he testifies against his partner, he will go free while the partner will get three years in prison on the main charge. Oh, yes, there is a catch ... If ''both'' prisoners testify against each other, both will be sentenced to two years in jail." | |||
: So I guess we should either (1) correct the quote and then also correct the text after the quote to match it, or (2) state our completely own version of the dilemma, without referring to any book, but then I think we could border on something like ]. I guess option (1) seems to be the most straightforward to ensure we align with reliable sources. | |||
: We should also make clear that the quote is just a "typical contemporary version" mentioned in that book, because right now, without reading the article lead, the text can give the impression that this book is the original source of the prisoner's dilemma (even though the book of course makes no such claim; on the contrary, it clearly states the origin as Merrill Flood and Melvin Dresher in 1950, with the name coming from Albert W. Tucker). --] (]) 11:43, 18 June 2023 (UTC) | |||
::I have adjusted the article now with the correct quote and correction of the actual numbers mentioned in that specific version of the story. I also changed outcomes ABCD to 1234 to lessen confusion with prisoners A and B. ] (]) 13:33, 18 June 2023 (UTC) | |||
⚫ | == I can't reconcile these 2 statements in the article == | ||
⚫ | I can't reconcile these 2 statements: | ||
⚫ | "For |
||
... | |||
⚫ | "Another aspect of the iterated prisoner's dilemma is that this tacit agreement between players has always been established successfully even when the number of iterations is made public to both sides." | ||
As an aside, I see the principles discussed as similar to the concept of ] ] ] (]) 21:37, 20 July 2023 (UTC) | |||
:Yeah I think ] belongs in a "see also" for Prinsoners Dilemma, it links back here and provides context: | |||
:" | |||
:Etymology | ] | |||
:The word can be found being used in the context of playing zero-sum games in a publication by the Mental Health Research Institute in 1956. It is alternately claimed that the first known use of the term was in Nigel Howard's book ''Paradoxes of Rationality: Theory of Metagames and Political Behavior'' published in 1971, where Howard used the term in his analysis of the ] political landscape using a variation of the ]., however Howard used the term in ''Metagame Analysis in Political Problems'' published in 1966. In 1967, the word appeared in a study by Russell Lincoln Ackoff and in the Bulletin of the Operations Research Society of America | |||
⚫ | |||
== what? == | == what? == | ||
Line 88: | Line 49: | ||
:I also don't understand the logic of that sentence. How is a ''purely'' selfish strategy not purely selfish? ] (]) 04:42, 24 November 2023 (UTC) | :I also don't understand the logic of that sentence. How is a ''purely'' selfish strategy not purely selfish? ] (]) 04:42, 24 November 2023 (UTC) | ||
::I didn't find many mentions of "purely selfish" in decent sources after a quick search, although I think it means does not consider the other party at all, that is, it always chooses the best outcome--defect. I am changing this to "This strategy will not "cheat" on its opponent for purely self-interested reasons first" for now. ] (]) 02:02, 19 June 2024 (UTC) | |||
:Yeah, this section is a bit convoluted. "Nice" simply means the algorithm will not defect first, compared to algorithms which will try to defect. ] (]) 01:50, 19 June 2024 (UTC) | |||
== what person? == | |||
⚫ | ''For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, |
||
== Christmas truce in "See also" == | == Christmas truce in "See also" == | ||
Line 122: | Line 81: | ||
I suspect that many may be puzzled comparing the "payoff" matrix in the ], displaying years in prison as payoffs, and the one in the ], displaying, as expected, a more desirable payoff with a greater value than a less desirable one. | I suspect that many may be puzzled comparing the "payoff" matrix in the ], displaying years in prison as payoffs, and the one in the ], displaying, as expected, a more desirable payoff with a greater value than a less desirable one. | ||
We ask for instance in a general prisoner's dilemma that <math>R>P</math>, where ''R'' is the payoff for cooperating=staying silent, and ''P'' that for defecting=testifying; this is inconsistent with payoffs being years in prison. The matrix displaying years in prison should rather be called "cost matrix", or be qualified with some other negative word like "punishment/penalty/sentence/...", and then the page should have a word of explanation to relate payoffs and costs. Alternatively we can display negatives of years in prison, and keep calling it "payoff matrix": this is consistent with the definition of a general prisoner's dilemma. ] (]) 05:46, 7 May 2024 (UTC) | We ask for instance in a general prisoner's dilemma that <math>R>P</math>, where ''R'' is the payoff for cooperating=staying silent, and ''P'' that for defecting=testifying; this is inconsistent with payoffs being years in prison. The matrix displaying years in prison should rather be called "cost matrix", or be qualified with some other negative word like "punishment/penalty/sentence/...", and then the page should have a word of explanation to relate payoffs and costs. Alternatively we can display negatives of years in prison, and keep calling it "payoff matrix": this is consistent with the definition of a general prisoner's dilemma. ] (]) 05:46, 7 May 2024 (UTC) | ||
== Moral philosophy == | |||
I was reading today about the applications of prisoner's dilemma in moral philosophy, and noticed that there is no section about it on this page! | |||
I just added a new section, along with some opinions. Feel free to expand and discuss | |||
⚫ | == I still can't reconcile these 2 statements in the current article == | ||
⚫ | I still can't reconcile these 2 statements in the current article: | ||
⚫ | "For cooperation to emerge between rational players, the number of rounds must be unknown or infinite. | ||
⚫ | "Another aspect of the iterated prisoner's dilemma is that this tacit agreement between players has always been established successfully even when the number of iterations is made public to both sides." | ||
⚫ | <nowiki>~~~~</nowiki> ] (]) 01:17, 23 July 2024 (UTC) | ||
:I think the idea is that when real people play repeated rounds they do not follow the rational / dominant strategy, and co-operation still emerges. I can believe this is true, but it would probably be better supported by a reference. ] (]) 08:01, 23 July 2024 (UTC) | |||
::I can find references in which co-operation emerges in a finite game (I'll leave this reference here as an example<ref>{{cite journal |last1=Cooper |first1=Russell |last2=DeJong |first2=Douglas V. |last3=Forsythe |first3=Robert |last4=Ross |first4=Thomas W. |title=Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games |journal=Games and Economic Behavior |date=1996 |volume=12 |issue=2 |pages=187-218 |doi=10.1006/game.1996.0013}}</ref>) but the word ''always'' needs some justification. I'm tempted to rephrase without the ''always'' but equally good if there is a citation to support it. ] (]) 08:32, 23 July 2024 (UTC) | |||
{{Reflist-talk}}<!-- Template:Reflist-talk creates a section-level reference list box. Please add comments and references for this section's discussion above this template. When a new discussion begins, the new section will be added below this template. Add a new {{Reflist-talk}} at the end of that section if needed. --> | |||
== Suppose a prisoner testified but perjured their testimony with false extraneous details. == | |||
Just saying. Hope no policy has been broken by my use of this. ] (]) 17:16, 11 September 2024 (UTC) | |||
== Lacks reference == | |||
The following paragraph lacks reference. | |||
⚫ | "In contrast to the one-time prisoner's dilemma game, the optimal strategy in the iterated prisoner's dilemma depends upon the strategies of likely opponents, and how they will react to defections and cooperation. For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, that person is at a slight disadvantage because of the loss on the first turn. In such a population, the optimal strategy is to defect every time. More generally, given a population with a certain percentage of always-defectors with the rest being tit-for-tat players, the optimal strategy depends on the percentage and number of iterations played." ] (]) 02:52, 25 September 2024 (UTC) |
Latest revision as of 12:38, 23 November 2024
This is the talk page for discussing improvements to the Prisoner's dilemma article. This is not a forum for general discussion of the article's subject. |
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
Archives: 1, 2, 3, 4, 5Auto-archiving period: 12 months |
Prisoner's dilemma is a former featured article. Please see the links under Article milestones below for its original nomination page (for older articles, check the nomination archive) and why it was removed. | |||||||||||||
This article appeared on Misplaced Pages's Main Page as Today's featured article on March 16, 2004. | |||||||||||||
| |||||||||||||
Current status: Former featured article |
This level-5 vital article is rated C-class on Misplaced Pages's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
This article has been mentioned by a media organization:
|
what?
A purely selfish strategy will not "cheat" on its opponent for purely self-interested reasons first.
Why is that sentence in the "Nice" section? How is a purely selfish strategy "nice?" 107.77.195.33 (talk) 04:39, 24 November 2023 (UTC)
- I also don't understand the logic of that sentence. How is a purely selfish strategy not purely selfish? 107.77.195.33 (talk) 04:42, 24 November 2023 (UTC)
- I didn't find many mentions of "purely selfish" in decent sources after a quick search, although I think it means does not consider the other party at all, that is, it always chooses the best outcome--defect. I am changing this to "This strategy will not "cheat" on its opponent for purely self-interested reasons first" for now. Winston (talk) 02:02, 19 June 2024 (UTC)
- Yeah, this section is a bit convoluted. "Nice" simply means the algorithm will not defect first, compared to algorithms which will try to defect. Winston (talk) 01:50, 19 June 2024 (UTC)
Christmas truce in "See also"
No inline citation is given for why the Christmas truce is mentioned at the bottom of this article, and mentioning it in this article seems inappropriate - can anyone explain? As I understand it, the Christmas truce was a stag hunt and therefore should instead be mentioned in that article:
- if both sides cooperate, everybody involved gets to live that day and enjoy the truce;
- if both sides betray, then the war continues as usual with some casualties being expected;
- if one side cooperates with the other betraying, then the cooperating side suffers massive casualties due to an ill-fated attempt to cross no man's land, while the betraying side can shoot the enemy with impunity, but does not enjoy a truce that day. TheDestroyer111 (talk) 08:53, 7 January 2024 (UTC)
As it turns out, the mentioning of the Christmas truce was introduced in a June 2010 edit with no reason given. TheDestroyer111 (talk) 19:45, 2 February 2024 (UTC)
The grammar is wrong.
The Prisoners' Dilemma redirects to The Prisoner's Dilemma. The plural possessive, however, is the correct form (regardless of how many times the mistake has been repeated in the past) and the singular possessive is wrong.
While there's only one dilemma (it's the same dilemma), it must be faced by multiple prisoners, otherwise the entire thought experiment is meaningless. Therefore multiple prisoners face the dilemma in question. It's the prisoners' dilemma, not any one prisoner's dilemma.
This grammatical atrocity must be ferreted out and stopped anywhere it can, and redirecting from the correct grammar to the wrong grammar is enabling the reign of the kingdom of the Doofuses.
Next step: figuring out how to make Chrome stop underlining the correct form. 2601:1C2:5000:8CC7:F5EF:AC6C:C8:5638 (talk) 03:35, 18 February 2024 (UTC)
- Hi, What you are commenting is not a grammar issue, it is just a preference for naming the game. Grammatically both expressions are correct. We could also argue that the dilemma is not really about prisoners but about criminals or defendants, and argue in favor of changing the name of the game accordingly. Now, i personally prefer singular, as it puts emphasis on the point of view of one (arbitrary) criminal/defendant/prisoner, and the concept of Nash equilibrium also puts emphasis on the issue with (isolated/distributed) individual decisions. I checked the corresponding wiki page in about 10 other languages and singular is used in all. 37.67.33.80 (talk) 06:36, 7 May 2024 (UTC)
Plus/minus and payoff/cost
I suspect that many may be puzzled comparing the "payoff" matrix in the Prisoner's dilemma#Premise, displaying years in prison as payoffs, and the one in the Prisoner's dilemma#Generalized form, displaying, as expected, a more desirable payoff with a greater value than a less desirable one. We ask for instance in a general prisoner's dilemma that , where R is the payoff for cooperating=staying silent, and P that for defecting=testifying; this is inconsistent with payoffs being years in prison. The matrix displaying years in prison should rather be called "cost matrix", or be qualified with some other negative word like "punishment/penalty/sentence/...", and then the page should have a word of explanation to relate payoffs and costs. Alternatively we can display negatives of years in prison, and keep calling it "payoff matrix": this is consistent with the definition of a general prisoner's dilemma. 37.67.33.80 (talk) 05:46, 7 May 2024 (UTC)
Moral philosophy
I was reading today about the applications of prisoner's dilemma in moral philosophy, and noticed that there is no section about it on this page! I just added a new section, along with some opinions. Feel free to expand and discuss
I still can't reconcile these 2 statements in the current article
I still can't reconcile these 2 statements in the current article:
"For cooperation to emerge between rational players, the number of rounds must be unknown or infinite.
"Another aspect of the iterated prisoner's dilemma is that this tacit agreement between players has always been established successfully even when the number of iterations is made public to both sides."
~~~~ Mathiastck (talk) 01:17, 23 July 2024 (UTC)
- I think the idea is that when real people play repeated rounds they do not follow the rational / dominant strategy, and co-operation still emerges. I can believe this is true, but it would probably be better supported by a reference. Mgp28 (talk) 08:01, 23 July 2024 (UTC)
- I can find references in which co-operation emerges in a finite game (I'll leave this reference here as an example) but the word always needs some justification. I'm tempted to rephrase without the always but equally good if there is a citation to support it. Mgp28 (talk) 08:32, 23 July 2024 (UTC)
References
- Cooper, Russell; DeJong, Douglas V.; Forsythe, Robert; Ross, Thomas W. (1996). "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games". Games and Economic Behavior. 12 (2): 187–218. doi:10.1006/game.1996.0013.
Suppose a prisoner testified but perjured their testimony with false extraneous details.
Just saying. Hope no policy has been broken by my use of this. Tathagata1st (talk) 17:16, 11 September 2024 (UTC)
Lacks reference
The following paragraph lacks reference.
"In contrast to the one-time prisoner's dilemma game, the optimal strategy in the iterated prisoner's dilemma depends upon the strategies of likely opponents, and how they will react to defections and cooperation. For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, that person is at a slight disadvantage because of the loss on the first turn. In such a population, the optimal strategy is to defect every time. More generally, given a population with a certain percentage of always-defectors with the rest being tit-for-tat players, the optimal strategy depends on the percentage and number of iterations played." 202.92.130.118 (talk) 02:52, 25 September 2024 (UTC)
Categories:- Misplaced Pages former featured articles
- Featured articles that have appeared on the main page
- Featured articles that have appeared on the main page once
- C-Class level-5 vital articles
- Misplaced Pages level-5 vital articles in Mathematics
- C-Class vital articles in Mathematics
- C-Class game theory articles
- Top-importance game theory articles
- C-Class mathematics articles
- High-priority mathematics articles
- C-Class Economics articles
- High-importance Economics articles
- WikiProject Economics articles
- C-Class Computer science articles
- High-importance Computer science articles
- WikiProject Computer science articles
- C-Class Environment articles
- High-importance Environment articles
- C-Class sociology articles
- Mid-importance sociology articles
- C-Class psychology articles
- Mid-importance psychology articles
- WikiProject Psychology articles
- Misplaced Pages pages referenced by the press