Wikipedia:Featured article review/Monty Hall problem/archive3

Monty Hall problem

Notified: User:Rick Block, User:Gill110951, User:Glopk, (top 3, the next two are topic banned) Wikipedia talk:WikiProject Statistics Wikipedia talk:WikiProject Mathematics Wikipedia talk:WikiProject Television Game Shows Wikipedia talk:WikiProject Game theory Wikipedia talk:WikiProject Psychology

WP:WIAFA concerns (1a, 1b, 1c, 1d, possibly 1e, 2a) are detailed below. But before delving into the details, I'll point out that there are major, long-term disagreements among a number of editors on how to present the topic involving the tension between WP:MTAA and WP:NPOV that have been going on for years, and culminated in an ArbCom case where two of the long-term contributors have been sanctioned, and several placed on restrictions or reprimanded. Unfortunately, the departure a couple of ArbCom-sanctioned editors has changed nothing of substance in topics in disagreement between the remaining editors. The issue that WP:FAR should be concerned with is that those disagreements have negatively impacted the end product: the article itself. Below, I organize my presentation by topic rather than WIAFA criteria, because some topics involve multiple criteria (even butting heads depending how one chooses to favor clarity or their view of npov) but I'll point them out. Before continuing, I disclose that I have contributed a couple of large-block-size diffs to this article, and a few minor ones. Of course, whether a change is semantically major or not doesn't exclusively depend on how much text is changed, so below I review the revision of the article before I had contributed anything, particularly because several long-term contributors to this article think that some of my edits were not an improvement.

1) Lead clarity vs. npov issues on the multiple interpretations/variants of the natural language statement leading to distinct mathematical problems. (WIAFA 1a, 2a, and 1d) Can you tell what do the words "randomly" and "overall" mean in the following chunk of the lead (emphasis mine)?

“

Although not explicitly stated in this version, solutions are often based on the additional assumptions that the car is initially equally likely to be behind each door and that the host must open a door showing a goat, must randomly choose which door to open if both hide goats, and must make the offer to switch.

As the player cannot be certain which of the two remaining unopened doors is the winning door, and initially all doors were equally likely, most people assume that each of two remaining closed doors has an equal probability and conclude that switching does not matter; hence the usual answer is "stay with your original door". However the player should switch—doing so doubles the overall probability of winning the car from 1/3 to 2/3.

”

The part of the lead quoted I have quoted is supposed to communicate a mathematical result, as opposed to the vos Savant version just above it. Can you tell if the emphasized words are there for a purpose or are superfluous? What is hidden behind those two is the mathematical equivalent of WP:WEASEL wording that is trying hide a dispute in the interpretation of vos Savant's words, as I tried to explain [1]

“

Although not explicitly stated in this version, solutions are often based on the additional assumptions that the car is initially equally likely to be behind each door and that the host must open a door showing a goat, must uniformly choose which door to open if both hide goats, and must make the offer to switch. As the player cannot be certain which of the two remaining unopened doors is the winning door, and initially all doors were equally likely, most people assume that each of two remaining closed doors has an equal probability and conclude that switching does not matter; hence the usual answer is "stay with your original door". However the player should switch—doing so doubles the probability of winning the car from 1/3 to 2/3.

A common variant of the problem, assumed by several academic authors as the canonical problem, does not make the simplifying assumption that host must uniformly choose the door to open, but instead that he uses some other strategy. The confusion as to which formalization is authoritative has led to considerable acrimony, particularly because this variant makes proofs more involved without altering the optimality of the always-switch strategy for the player. In this variant, the player can have different probabilities of winning depending on the observed choice of the host, but in any case the probability of winning by switching is at least 1/2 (and can be as high as 1), while the overall probability of winning by switching is still exactly 2/3.

”

Rick Block writes that the distinction between the two interpretations (and thus two mathematical problem-objects, one a subset, i.e. particular case of the other) is not important enough for the lead. It may not be immediately apparent why this is also a WIAFA 2a issue, but it becomes evident once you try to read the rest of article: the lead simply fails to prepare the reader for the problem variations which the proponents of various methods argue that their method is "the best". David Hilbert said "He who seeks for methods without having a definite problem in mind seeks for the most part in vain." The major problem variants don't have to be in the lead, but they should be certainly be stated before the several solutions are given, because these also try to convince the reader that the other approaches are wrong or superfluous.

There are plenty of secondary sources that make this separation, e.g. Rosenhouse (2009) ISBN 0195367898 by chapter, but not Wikipedia. In a similar vein, User:Kmhkmh argues that even presenting one of the variants ahead of the other is "nothing but a subtle POV pushing". If you wonder how this could possibly be so, the answer is the next paragraph.

2) Another lead issue is that MHP made it to the pages of the New York Times (the first time around) in no small part because mathematicians disagreed on what math problem vos Savant's words should translate to, with some of them proposing even other variants besides the above two. Of course, I'm not proposing a list in the lead, but a large part of MHP's notability is due to its confusing language, at least in its original formulation, which should be said in the lead (WIAFA 2a/1d).

3) Using a degenerate case to illustrate the use of the "best" proof method for a more general problem/variant. (Like insisting on solving $0x^{2}+2x+1=3$ using the quadratic formula). I argue that doing this is confusing for the reader (WIAFA 1a) and I gave a list of RSes not doing this (besides Rosenhouse), i.e. who explicitly introduce the general case or a particular non-uniform (usually deterministic) strategy for Monte, (e.g. always showing preference for one of the doors when he has a choice) before using a general method. Rick Block however says that doing so in not npov (WIAFA 1d trumping 1a), arguing that the majority of math sources do this. Even assuming this is true (a claim for which he provided no evidence), it's still not clear that a head count is the best selection criteria for proofs. I tried to engage him in a more detailed discussion on how he compares two proofs for authority by asking him to compare one from Ken Binmore with one (Morgan's) Rick seems to favor, but so far without getting any reply on that.

(I'll stop with giving WIAFA callouts from here on because they are obvious for the remainder of this review.)

4) Inconsistent terminology throughout the article. Examples include referring to overall/average probability but also calling the case/path probability total. Requests to synchronize terminology from different sources met with "I'd rather not change it". As a results, the article is a terminological pastiche, contravening WP:MOSMATH.

5) Disorganized presentation and tangentiality. Before and after giving some solutions with quote/paraphrases whether the "simple" solutions are wrong, without ever trying to present this matter coherently (because that would require explicitly stating several problem variants). Text reads like "blah, blah, blah, did I mention the simple solutions are wrong?, blah, blah, ... , did I mention the simple solutions are wrong?, blah, blah, ..., did I mention the simple solutions are wrong?, blah ..."

6) The above is the result of endless POV wars between true believers in the various solutions, who frankly seem to be clueless that are talking about different problems. ArbCom didn't ban all of them, unfortunately, only the worst offenders. The counterpoint to the oft repeated (from one source!) claim that simple solutions are wrong, i.e. "the simple solution is right in the case of equal conditional probabilities (because the overall probab. is their average)" has been recently deleted from the article as "unverifiable" even though it was cited. Quite amusing chutzpah, given that several other sources concur with that, e.g. Rosenthal 2005/2008 [2] and those are free on-line. Ironically Rosenhouse cites Wikipeida for inspiration when making this point at p. 52 in his book. I guess this makes him completely unreliable per WP:CIRCULAR! He read the WP:Wrong version too! Someone email Science (journal) (which published a reviewed of the book doi:10.1126/science.1177947) right away! Never mind he is a math prof at James Madison University, and can probably evaluate whether a argument like this is convincing or not.

7) The problem variants from given the large table are poorly organized and some are of questionable relevance. Never mind they repeat the snuck-through-the-back door variant that this-or-that solution was really solving. E.g. "The host acts as noted in the specific version of the problem." I have no idea what that refers to. Or "The host opens a door and makes the offer to switch 100% of the time if the contestant initially picked the car, and 50% the time if she didn't. Switching wins 1/2 the time at the Nash equilibrium." If you assume that the host strategy is fixed, it's a little silly to speak of a Nash equilibrium. It's a one-player game against nature, something that many game theory books don't even consider a game. Some decent secondary source like Chun 1999 or Rosenhouse should be used to organize variants.

8) Excessive formulism (for lack of a better term) is one of the Bayes' solution #1. Self-evident eye sore.

9) The "Sources of confusion" section is confusing if not downright POV. There are two issues: people being confused after being presented a definite math problem, and people being confused by the ambiguous formulation(s). No attempt is made to separate these. The implicit assumption there is that those making different assumptions about the game are idiots (other than Morgan of course, we are again reminded that the simple solutions are wrong!), including the profs from the NYT piece, and sources like Chun 1999.

10) Poorly researched from a formal sciences perspective (i.e. limited to STAT 101). Trivial variations between a bunch of proofs are presented as something of note. My note on the lack of serious game theory treatment in the article, which (I think) stymies understanding and only prolongs the absurd discussions, have been met with repetitions of the same obsessive "unconditional vs. conditional" mantras which have nothing to do with this issue. As David Eppstein put in on the ArbCom page, the so-called "advanced solution" using Bayes' theorem is the "basic of basics" as far as probability & decision theory is concerned.

Bayes' theorem vs is like mom's meat grinder next to a food processing plant when up against extensive-form games and Markov decision processes, (no need to skin the pig or chop the meat manually before the machinery takes over). As Ken Binmore puts it, once you formulate it as an EFG you hardly have to (creatively) think at all, meaning you just apply a well known algorithm to solve it. (Same goes for MDP or formulating it as a Bayesian game). Sources usable for this:

Ken Binmore, Playing for real, ISBN 0195300572, pp. 77-79, 84-85, 91-92, 385-386 (uses MHP as a running example) -- EFG approach, the most insightful
Chun 1999 [3] -- Bayesian (matrix) game approach with linear programming solution (with some "information economics" chaff that can be ignored).

The fact that the above two are equivalent approaches is non-trivial in general, a result that played no small part in these guys getting a Nobel prize.

Handbook of weighted automata, doi:10.1007/978-3-642-01492-5, pp. 527-536 -- MDP approach, iterated value solution (uses MHP as running example to introduce the notions) This works because Monty has only one move.

10) Issues related to interpretation of probability not discussed. Suggested source: Georgii ISBN 3110191458 (3rd ed.) pp. 54-56 (More correctly these are framed as issues stemming from what can or cannot be assumed common knowledge (logic). Crucially, the definition of a game like EFG assumes the rules are known by all player.) Also Rosenhouse pp. 84-88, but it's less useful. Olofsson ISBN 0470040017 pp. 50-52 discusses it the same way as Georgii, but with less formalism.

11) Issues stemming from bounded rationality not discussed. E.g., doi:10.1002/bdm.451 Related to this, Rosenhouse p. 135-136 discusses "feeling bad about switching" (Olofsson also mentions this), and with Chun 1999 codifies this as an alternate game where the Von Neumann–Morgenstern utility does not equal the lottery probability. Chugh and Bazerman 2007 [4] is a good overview here.

12) For the uniform problem a combinatorial argument (with 6 layouts by numbering the goats) is given in Rosenhouse p. 54 (taken from Williams ISBN 052100618X pp. 73-74). Richard Isaac discusses a slightly different sample space counting approach in ISBN 038794415X, pp. 8-10. These are sufficiently different proofs to include I think. (Isaac also discusses the Gillman, OMG subtle POV variant, if you're curious, on p. 27)

I hope some article improvements come out of the above, but I'm not holding my breath. A fair number of editors repeat on talk the article is fine. Others make weird edits with strange if not misleading summaries reminiscent of WP:ARBPIA articles and stonewall to perfection on talk. Of course, this article may well deserve its FA star as "the best Wikipedia could ever produce on this topic given its social dynamics", but the answer to the question: "is this article a good presentation of the topic based on the sources available", the answer is clearly no in my mind. Overall the article reads to me like it was produced by a committee of humanities journalists who read a few math articles, and cobbled them together without really understanding what they are saying or trying to integrate them in a coherent (mathematical) presentation.

And as a courtesy for my time investment, please do not edit what I wrote above to either strike anything or interject your replies between my paragraphs. I have numbered the issues, so you can easily address them in the unlimited space below, and let the FA(R) director/delegates decide. Tijfo098 ([[User talk: |talk]]) 23:23, 7 April 2011 (UTC)[reply]

Tijfo, I cannot seem to find a notification from you of the possibility of a FAR on the article talk page. This notification is required as the first step in the FAR process. If you have not made this notification, and allowed time for the (obviously) active editors to work on the article, then this review should be placed on hold. FAR is for articles that have degraded to the point that they no longer meet the FA criteria, and do not have editors interested in working on them, as shown by a lack of response to the initial notification. Dana boomer (talk) 00:32, 8 April 2011 (UTC)[reply]

I do not see that requirement (that I must explicitly "threaten" with a FAR) at WP:FAR [5], only "In this step, concerned editors attempt to directly resolve issues with the existing community of article editors, and to informally improve the article. Articles in this step are not listed on this page." I have tried that. See the Talk:Monty_Hall_problem/Archive_23. The same discussion has been going in circles since then on the non-archived talk. I hoped the remaining participants would come up with something other than reiterating what they have been saying for 22 talk page archives. Slim chance of that, I'd say. Tijfo098 (talk) 01:18, 8 April 2011 (UTC)[reply]

@Tijfo098: Since I'm incorrectly cited above, I'd correct that and clear up some possible misunderstandings as well.

I did not argue that "presenting one of the variants ahead of the other is 'nothing but a subtle POV pushing'". Instead I've argued declaring one variant as "the" (canonical) MHP and the others "mere" variants is subtle POV pushing, because it suggests to reader that the variants are solving slightly different problems from that ("original") one posed by Whitaker in vos Savant's column (note its Whitaker's words/problem not vos Savant's), which is not simply not correct as some of those variants deal precisely with Whitaker's problem rather than a modified version. The crux is here that Whitaker's problem is ambiguous and there are different possibilities to address the ambiguity leading to different solutions or "variants". Whether this squabbling over the use of the word variant is of any importance depends how the article is written. If the lead is written as suggested above, I have no issue with that. However in the past there was a push to move anything regarding "variants" (and the ambiguity of the problem) completely out of the lead and first chapters ("keept it simply for the less educated reader"). That means a reader not reading the complete article but just the lead and/or first chapters would not have been aware of the ambiguity of the problem, various variants adressing the same ambiguous wording and the disagreement among mathematcians themselves, instead he would have learned only about the spat between vos savants and some acdemics and the related media storm and switching is the best strategy.
I agree that the article appears a bit like patchwork (as a result of the "eternal disagreements". Imho the best solution would be if all old editors voluntarily withdraw from the article (other than commenting) and a few new qualified editors attempt a complete overhaul. However you'll have to keep in mind that the MHP somehow works like magnet attracting plenty of editors who feel the need to leave "their" mark on the article often with a almost religious fervor. So even if the overhaul succeeds, unless the article is not closely guarded chances are over time it will turn into a patchwork again.

--Kmhkmh (talk) 03:24, 8 April 2011 (UTC)[reply]

Well, a fair number of sources: Rosenhouse (chapter title is "Classical Monty"), Isaac ("The most common version of the problem ..." p. 27), Rosenthal (gives other names to the variants like Monty Small) do make a choice as to what they consider the common interpretation; I can easily scrounge for more RS remarks like that. I do see the point that others (sources and Wikipedians alike) don't see it that way; they see their version as canonical. I think it's a minor issue of wording and perhaps WP:INTEXT attribution. Instead, at least two long term contributors hold the intelligibility of the article hostage over this issue. Tijfo098 (talk) 12:01, 8 April 2011 (UTC)[reply]

Their line of POV pushing is to argue that old papers, who usually have more citations simply because of that (first-mover advantage), are cited because they are right. Even if they the had a calculation error that stood [officially] uncorrected for 20 years, which may well have affected the authors' judgement of the simple solutions, because their fancy solution gave a different numerical result. Oops. Tijfo098 (talk) 12:01, 8 April 2011 (UTC)[reply]

There is also a problem of POV pushing based on Kraus and Wang. The Bayes formula lovers insist that K & W empirically prove that most people don't get the simple proof(s). [6] At the same time, these Wikipedians ignore the fact that K & W also show that while common people "buy" the Bayes's formula proof, they are utterly unable to apply it to a similar problem (considered a more realistic test of comprehension by K & W). So K & W is an RS when it agrees with their POV, but not when it doesn't. [7] Classic signs of POV pushing right there. No point in me arguing with these guys ad infinitum. (The diffs I gave are representative, but the same arguments are being repeated on the current talk page by the same main contributors. Typical edit from one of them: [8]) Tijfo098 (talk) 12:01, 8 April 2011 (UTC)[reply]

Well as I said, with your suggested lead I have no issue with use of the term of variant in there, but my comment to referring to (older) tendencies by (other) authors. I agree that there is also "pov pushing" from other side(s) as well, the problem is that too many editors insist on their personally favoured way to read & treat MHP and try to marginalize everything else and various sources just serve as tools in that battle. Editors often don't seem to care what a particular source is saying overall, but they just pick snippets suiting their agenda. Imho a stable and reasonably good article will only be achieved if all participants (including us) are willing accept that the article cannot match their preferred treatment of MHP. Since there seem to be various righteous and almost religious beliefs and a lot of invested ego from all sides, I think the article would need to rewritten by new authors, old authors might help out by providing information but they need to refrain from editing and give up on seeing their favored treatment in the article. Doing so (egowise) might be easier with respected new authors rather than "giving in" to other side of the longterm conflict.--Kmhkmh (talk) 12:41, 8 April 2011 (UTC)[reply]

We're in agreement here. Tijfo098 (talk) 12:57, 8 April 2011 (UTC)[reply]

Yes! The nail is hit on the head. Richard Gill (talk) 10:33, 9 April 2011 (UTC)[reply]

Comment. For the record, and to provide a clear link - this article was the subject of a recent arbitration case - Wikipedia:Arbitration/Requests/Case/Monty Hall problem. The final decision, filed March 25 2011, may be found at Wikipedia:Arbitration/Requests/Case/Monty_Hall_problem#Final_decision. UltraExactZZ ^Said~ Did 13:02, 8 April 2011 (UTC)[reply]

I agree that the article, as it currently stands, is not representative of "the very best of Wikipedia". Its present structure was "frozen in time" in the middle of an edit war (the "Aids to Understanding" section was being bounced up by and down). The content of several paragraphs is obscure, if not contorted (my favorite is: "they do not explicitly address their interpretation of vos Savant's rewording of Whitaker's original question (Seymann)."). Because of said edit war, and the intervening edit semi-freezes during mediation and arbitration, no clarity is shed any longer on the criticism of the simple solution. Overall I agree with Tijfo098's assesment, although not with his assesment of the editors' motives. So yes, this is 'not an FA, it hasn't ben for quite some time. glopk (talk) 01:16, 10 April 2011 (UTC)[reply]

Comment. I think a desirable outcome here would be a clear consensus of a path for article improvement. Normally, this is what a featured article review achieves. If, as seems likely, that path is still very contentious on certain points, then I think FAR may not be the right instrument to deal with these (real or perceived) problems with the article. It seems to me that the sort of criticisms raised in the FAR here are the kinds of things that would be, for the most part, better addressed on the discussion page of the article first. Sławomir Biały (talk) 13:22, 10 April 2011 (UTC)[reply]

This is overly optimistic view of the editing environment on Wikipedia. The plain reason I started this FAR is that I think the article fails WIAFA in many ways, and that no reasonable time frame for addressing the issues seems forthcoming. Endless circular discussions on talk page is not a criteria for promoting or keeping articles as FA. Tijfo098 (talk) 16:10, 10 April 2011 (UTC)[reply]