Jump to content

Jury theorem: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Line 60: Line 60:
The above result is due to Kaniovski and Zaigraev. They also discuss optimal jury design for homogenous juries with correlated votes.<ref name=":0" />
The above result is due to Kaniovski and Zaigraev. They also discuss optimal jury design for homogenous juries with correlated votes.<ref name=":0" />


=== Bounded correlation ===
There are other jury theorems that do not require voter independence, but take into account the degree to which votes may be correlated.<ref>{{cite web|author=James Hawthorne|title=Voting In Search of the Public Good: the Probabilistic Logic of Majority Judgments|url=http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|url-status=dead|archive-url=https://web.archive.org/web/20160323044630/http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|archive-date=2016-03-23|accessdate=2009-04-20}}</ref><ref>see for example: {{cite journal|author=Krishna K. Ladha|date=August 1992|title=The Condorcet Jury Theorem, Free Speech, and Correlated Votes|journal=American Journal of Political Science|volume=36|issue=3|pages=617–634|doi=10.2307/2111584|jstor=2111584}}</ref>
One way to weaken the Independence Axiom is to allow correlation, but require that the correlation be bounded. A jury theorem by Pivato<ref>{{Cite journal|date=2017-10-01|title=Epistemic democracy with correlated voters|url=https://www.sciencedirect.com/science/article/abs/pii/S0304406816301094|journal=Journal of Mathematical Economics|language=en|volume=72|pages=51–69|doi=10.1016/j.jmateco.2017.06.001|issn=0304-4068}}</ref> shows that, if the average covariance between voters becomes small as the population becomes large, then Crowd Infallibility holds (for some voting rule).

There are other jury theorems that take into account the degree to which votes may be correlated.<ref>{{cite web|author=James Hawthorne|title=Voting In Search of the Public Good: the Probabilistic Logic of Majority Judgments|url=http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|url-status=dead|archive-url=https://web.archive.org/web/20160323044630/http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|archive-date=2016-03-23|accessdate=2009-04-20}}</ref><ref>see for example: {{cite journal|author=Krishna K. Ladha|date=August 1992|title=The Condorcet Jury Theorem, Free Speech, and Correlated Votes|journal=American Journal of Political Science|volume=36|issue=3|pages=617–634|doi=10.2307/2111584|jstor=2111584}}</ref>


=== Problem-sensitive independence and competence ===
=== Problem-sensitive independence and competence ===

Revision as of 08:39, 27 May 2021

A jury theorem is a mathematical theorem proving that, under certain assumptions, a decision attained using majority voting in a large group is more likely to be correct than a decision attained by a single expert. It serves as a formal argument for the idea of wisdom of the crowd, and for democracy in general.[1]

The first and most famous jury theorem is Condorcet's jury theorem. It assumes that all voters have independent probabilities to vote for the correct alternative, these probabilities are larger than 1/2, and are the same for all voters. Under these assumptions, the probability that the majority decision is correct is strictly larger when the group is larger; and when the group size tends to infinity, the probability that the majority decision is correct tends to 1.

There are many other jury theorems, relaxing some or all of these assumptions.

Setting

The premise of all jury theorems is that there is an objective truth, which is unknown to the voters. Most theorems focus on binary issues (issues with two possible states), for example, whether a certain defendant is guilty or innocent, whether a certain stock is going to rise or fall, etc. There are voters (or jurors), and their goal is to reveal the truth. Each voter has an opinion about which of the two options is correct. The opinion of each voter is either correct (i.e., equals the true state), or wrong (i.e., differs than the true state). This is in contrast to other settings of voting, in which the opinion of each voter represents his/her subjective preferences and is thus always "correct" for this specific voter. The opinion of a voter can be considered a random variable: for each voter, there is a positive probability that his opinion equals the true state.

The group decision is determined by the majority rule. For example, if a majority of voters says "guilty" then the decision is "guilty", while if a majority says "innocent" then the decision is "innocent". To avoid ties, it is often assumed that the number of voters is odd. Alternatively, if is even, then ties are broken by tossing a fair coin.

Jury theorems are interested in the probability of correctness - the probability that the majority decision coincides with the objective truth. Typical jury theorems make two kinds of claims on this probability:[1]

  1. Growing Reliability: the probability of correctness is larger when the group is larger.
  2. Crowd Infallibility: the probability of correctness goes to 1 when the group size goes to infinity.

Claim 1 is often called the non-asymptotic part and claim 2 is often called the asymptotic part of the jury theorem.

Obviously, these claims are not always true, but they are true under certain assumptions on the voters. Different jury theorems make different assumptions.

Independence, competence, and uniformity

Condorcet's jury theorem makes the following three assumptions:

  1. Unconditional Independence: the voters make up their minds independently. In other words, their opinions are independent random variables.
  2. Unconditional Competence: the probability that the opinion of a single voter coincides with the objective truth is larger than 1/2 (i.e., the voter is smarter than a random coin-toss).
  3. Uniformity: all voters have the same probability of being correct.

The jury theorem of Condorcet says that these three assumptions imply Growing Reliability and Crowd Infallibility.

Correlated votes: weakening the independence assumption

Truth-sensitive independence and competence

In binary decision problems, there is often one option that is easier to detect that the other one. For example, it may be easier to detect that a defendant is guilty (as there is clear evidence for guilt) than to detect that he is innocent. In this case, the probability that the opinion of a single voter is correct is represented by two different numbers: probability given that option #1 is correct, and probability given that option #2 is correct. This also implies that opinions of different voters are correlated. This motivates the following relaxations of the above assumptions:

  1. Conditional Independence: for each of the two options, the voters' opinions given that this option is the true one are independent random variables.
  2. Conditional Competence: for each of the two options, the probability that a single voter's opinion is correct given that this option is true is larger than 1/2.
  3. Conditional Uniformity: for each of the two options, all voters have the same probability of being correct given that this option is true.

Growing Reliability and Crowd Infallibility continue to hold under these weaker assumptions.[1]

Failure for large correlation

In addition to the dependence on the true option, there are many other reasons for which voters' opinions may be correlated. For example:

  • Deliberation among voters;
  • Peer pressure;
  • False evidence (e.g. a guilty defendant that excells at pretending to be innocent);
  • External conditions (e.g. poor weather affecting their judgement).
  • Any other common cause of votes.

In these cases, even Conditional Independence may not hold, and the Growing Reliability claim might fail.[2] As an example, let be the probability of a juror voting for the correct alternative and be the (second-order) correlation coefficient between any two correct votes. If all higher-order correlation coefficients in the Bahadur representation[3] of the joint probability distribution of votes equal to zero, and is an admissible pair, then the probability of the jury collectively reaching the correct decision under simple majority is given by:

where is the regularized incomplete beta function.

Example: Take a jury of three jurors , with individual competence and second-order correlation . Then . The competence of the jury is lower than the competence of a single juror, which equals to . Moreover, enlarging the jury by two jurors decreases the jury competence even further, . Note that and is an admissible pair of parameters. For and , the maximum admissible second-order correlation coefficient equals .

The above example shows that when the individual competence is low but the correlation is high:

  • The collective competence under simple majority may fall below that of a single juror;
  • Enlarging the jury may decrease its collective competence.

The above result is due to Kaniovski and Zaigraev. They also discuss optimal jury design for homogenous juries with correlated votes.[2]

Bounded correlation

One way to weaken the Independence Axiom is to allow correlation, but require that the correlation be bounded. A jury theorem by Pivato[4] shows that, if the average covariance between voters becomes small as the population becomes large, then Crowd Infallibility holds (for some voting rule).

There are other jury theorems that take into account the degree to which votes may be correlated.[5][6]

Problem-sensitive independence and competence

One strategy to handle voter correlation is to weaken the Conditional Independence assumption, and conditionalize on all common causes of the votes. In other words, the votes are now independent conditioned on the specific decision problem. However, in a specific problem, the Conditional Competence assumption may not be valid. For example, in a specific problem with false evidence, it is likely that most voters will have a wrong opinion. Thus, the two assumptions - conditional independence and conditional competence - are not justifiable simultaneously (under the same conditionalization).[7]

A possible solution is to weaken Conditional Competence as follows. For each voter and each problem x, there is a probability p(x) that the voter's opinion is correct in this specific problem. Since x is a random variable, p(x) is a random variable too. Conditional Competence requires that p(x) > 1/2 with probability 1. The weakened assumption is:

  • Tendency to Competence: for each voter, and for each r>0, the probability that p(x) = 1/2+r is at least as large as the probability that p(x) = 1/2-r.

A jury theorem by Dietrich and Spiekerman[8] says that Conditional Independence, Tendency to Competence, and Conditional Uniformity, together imply Growing Reliability. Note that Crowd Infallibility is not implied. In fact, the probability of correctness tends to a value which is below 1, if and only of Conditional Competence does not hold.

Diverse capabilities: weakening the uniformity assumption

Different voters often have different competence levels, so the Uniformity assumption does not hold. In this case, both Growing Reliability and Crowd Infallibility may not hold. This may happen if new voters have much lower competence than existing voters, so that adding new voters decreases the group's probability of correctness. In some cases, the probability of correctness might converge to 1/2 (- a random decision) rather than to 1.[9]

Stronger competence requirements

Uniformity can be dismissed if the Competence assumption is strengthened. There are several ways to strengthen it:

  • Strong Competence: for each voter i, the probability of correctness pi is at least 1/2+e, where e>0 is fixed for all voters. In other words: the competence is bounded away from a fair coin toss. A jury theorem by Paroush[9] shows that Strong Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability).
  • Average Competence: the average of the individual competence levels of the voters (i.e. the average of their individual probabilities of deciding correctly) is slightly greater than half, or converges to a value above 1/2. Jury theorems by Grofman, Owen and Feld,[10] and Berend and Paroush,[11] show that Average Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability).

Random voter selection

instead of assuming that the voter identity is fixed, one can assume that there is a large pool of potential voters with different competence levels, and the actual voters are selected at random from this pool (as in sortition).

A jury theorem by Ben Yashar and Paroush[12] shows that, under certain conditions, the correctness probability of a jury, or of a subset of it chosen at random, is larger than the correctness probability of a single juror selected at random. A more general jury theorem by Berend and Sapir[13] proves that Growing Reliability holds in this setting: the correctness probability of a random committee increases with the committee size. The theorem holds, under certain conditions, even with correlated votes.[14]

A jury theorem by Owen, Grofman and Feld[15] analyzes a setting where the competence level is random. They show what distribution of individual competence maximizes or minimizes the probability of correctness.

Weighted majority rule

When the competence levels of the voters are known, the simple majority rule may not be the best decision rule. There are various works on identifying the optimal decision rule - the rule maximizing the group correctness probability. Nitzan and Paroush[16] show that, under Unconditional Independence, the optimal decision rule is a weighted majority rule, where the weight of each voter with correctness probability pi is log(pi/(1-pi)), and an alternative is selected iff the sum of weights of its supporters is above some threshold. Grofman and Shapley[17] analyze the effect of interdependencies between voters on the optimal decision rule. Ben-Yashar and Nitzan[18] prove a more general result.

Dietrich[19] generalizes this result to a setting that does not require prior probabilities of the 'correctness' of the two alternative. The only required assumption is Epistemic Monotonicity, which says that, if under certain profile alternative x is selected, and the profile changes such that x becomes more probable, then x is still selected. Dietrich shows that Epistemic Monotonicity implies that the optimal decision rule is weighted majority with a threshold. In the same paper, he generalizes the optimal decision rule to a setting that does not require the input to be a vote for one of the alternatives. It can be, for example, a subjective degree of belief. Moreover, competence parameters do not need to be known. For example, if the inputs are subjective beliefs x1,...,xn, then the optimal decision rule sums log(xi/(1-xi)) and checks whether the sum is above some threshold. Epistemic Monotonicity is not sufficient for computing the threshold itself; the threshold can be computed by assuming expected-utility maximization and prior probabilities.

Indirect majority systems

Condorcet's theorem considers a direct majority system, in which all votes are counted directly towards the final outcome. Many countries use an indirect majority system, in which the voters are divided into groups. The voters in each group decide on an outcome by an internal majority vote; then, the groups decide on the final outcome by a majority vote among them. For example,[20] suppose there are 15 voters. In a direct majority system, a decision is accepted whenever at least 8 votes support it. Suppose now that the voters are grouped into 3 groups of size 5 each. A decision is accepted whenever at least 2 groups support it, and in each group, a decision is accepted whenever at least 3 voters support it. Therefore, a decision may be accepted even if only 6 voters support it.

Boland, Proschan and Tong[21] prove that, when the voters are independent and p>1/2, a direct majority system - as in Condorcet's theorem - always has a higher chance of accepting the correct decision than any indirect majority system.

Berg and Paroush[22] consider multi-tier voting hierarchies, which may have several levels with different decision-making rules in each level. They study the optimal voting structure, and compares the competence against the benefit of time-saving and other expenses.

More than two options

Most theorems do not directly apply to decisions between more than two outcomes. This critical limitation was in fact recognized by Condorcet (see Condorcet's paradox), and in general it is very difficult to reconcile individual decisions between three or more outcomes (see Arrow's theorem), although List and Goodin[23] and Everaere, Konieczny and Marquis[24] present evidence to the contrary. This limitation may also be overcome by means of a sequence of votes on pairs of alternatives, as is commonly realized via the legislative amendment process. (However, as per Arrow's theorem, this creates a "path dependence" on the exact sequence of pairs of alternatives; e.g., which amendment is proposed first can make a difference in what amendment is ultimately passed, or if the law—with or without amendments—is passed at all.)

Strategic voting

The behaviour that everybody in the jury votes according to his own beliefs might not be a Nash equilibrium under certain circumstances.[25]

Limitations

The notion of "correctness" may not be meaningful when making policy decisions, as opposed to deciding questions of fact.[citation needed] Some defenders of the theorem hold that it is applicable when voting is aimed at determining which policy best promotes the public good, rather than at merely expressing individual preferences. On this reading, what the theorem says is that although each member of the electorate may only have a vague perception of which of two policies is better, majority voting has an amplifying effect. The "group competence level", as represented by the probability that the majority chooses the better alternative, increases towards 1 as the size of the electorate grows assuming that each voter is more often right than wrong.

Despite these objections, Condorcet's jury theorem provides a theoretical basis for democracy, even if somewhat idealized, as well as a basis of the decision of questions of fact by jury trial, and as such continues to be studied by political scientists.

Further reading

  • Majority systems and the Condorcet jury theorem:[26] discusses non-homogeneous and correlated voters, and indirect majority systems.
  • Evolution in collective decision making.[27]
  • Law of large numbers: a mathematical generalization of jury theorems.
  • Realizing Epistemic Democracy: a criticism on the assumptions of jury theorems.[28]
  • The Epistemology of Democracy: a comparison of jury theorems to two other epistemic models of democracy: experimentalism and Diversity trumps ability.[29]

References

  1. ^ a b c Franz Dietrich and Kai Spiekermann (2019-07-19). Jury Theorems. Routledge. doi:10.4324/9781315717937-38/jury-theorems-franz-dietrich-kai-spiekermann. ISBN 978-1-315-71793-7.
  2. ^ a b Kaniovski, Serguei; Alexander, Zaigraev (2011). "Optimal Jury Design for Homogeneous Juries with Correlated Votes" (PDF). Theory and Decision. 71 (4): 439–459. CiteSeerX 10.1.1.225.5613. doi:10.1007/s11238-009-9170-2. S2CID 9189720.
  3. ^ Bahadur, R.R. (1961). "A representation of the joint distribution of responses to n dichotomous items". H. Solomon (Ed.), Studies in Item Analysis and Prediction: 158–168.
  4. ^ "Epistemic democracy with correlated voters". Journal of Mathematical Economics. 72: 51–69. 2017-10-01. doi:10.1016/j.jmateco.2017.06.001. ISSN 0304-4068.
  5. ^ James Hawthorne. "Voting In Search of the Public Good: the Probabilistic Logic of Majority Judgments" (PDF). Archived from the original (PDF) on 2016-03-23. Retrieved 2009-04-20.
  6. ^ see for example: Krishna K. Ladha (August 1992). "The Condorcet Jury Theorem, Free Speech, and Correlated Votes". American Journal of Political Science. 36 (3): 617–634. doi:10.2307/2111584. JSTOR 2111584.
  7. ^ Dietrich, Franz (2008). "The Premises of Condorcet's Jury Theorem Are Not Simultaneously Justified". Episteme: A Journal of Social Epistemology. 5 (1): 56–73. doi:10.1353/epi.0.0023. ISSN 1750-0117.
  8. ^ Dietrich, Franz; Spiekermann, Kai (2013-03-01). "Epistemic democracy with defensible premises". Economics and Philosophy. 29 (1): 87–120. ISSN 0266-2671.
  9. ^ a b Paroush, Jacob (1998). "Stay away from fair coins: A Condorcet jury theorem". Social Choice and Welfare. 15 (1): 15–20. ISSN 0176-1714.
  10. ^ Bernard Grofman; Guillermo Owen; Scott L. Feld (1983). "Thirteen theorems in search of the truth" (PDF). Theory & Decision. 15 (3): 261–78. doi:10.1007/BF00125672. S2CID 50576036.
  11. ^ Berend, Daniel; Paroush, Jacob (1998). "When is Condorcet's Jury Theorem valid?". Social Choice and Welfare. 15 (4): 481–488. ISSN 0176-1714.
  12. ^ Ben-Yashar, Ruth; Paroush, Jacob (2000-03-01). "A nonasymptotic Condorcet jury theorem". Social Choice and Welfare. 17 (2): 189–199. doi:10.1007/s003550050014. ISSN 1432-217X. S2CID 32072741.
  13. ^ Berend, Daniel; Sapir, Luba (2005). "Monotonicity in Condorcet Jury Theorem". Social Choice and Welfare. 24 (1): 83–92. ISSN 0176-1714.
  14. ^ Berend, Daniel; Sapir, Luba (2007). "Monotonicity in Condorcet's Jury Theorem with dependent voters". Social Choice and Welfare. 28 (3): 507–528. ISSN 0176-1714.
  15. ^ Owen, Guillermo; Grofman, Bernard; Feld, Scott L. (1989-02-01). "Proving a distribution-free generalization of the Condorcet Jury Theorem". Mathematical Social Sciences. 17 (1): 1–16. doi:10.1016/0165-4896(89)90012-7. ISSN 0165-4896.
  16. ^ Nitzan, Shmuel; Paroush, Jacob (1982). "Optimal Decision Rules in Uncertain Dichotomous Choice Situations". International Economic Review. 23 (2): 289–297. doi:10.2307/2526438. ISSN 0020-6598.
  17. ^ Shapley, Lloyd; Grofman, Bernard (1984-01-01). "Optimizing group judgmental accuracy in the presence of interdependencies". Public Choice. 43 (3): 329–343. doi:10.1007/BF00118940. ISSN 1573-7101.
  18. ^ Ben-Yashar, Ruth C.; Nitzan, Shmuel I. (1997). "The Optimal Decision Rule for Fixed-Size Committees in Dichotomous Choice Situations: The General Result". International Economic Review. 38 (1): 175–186. doi:10.2307/2527413. ISSN 0020-6598.
  19. ^ Dietrich, Franz (2006). "General representation of epistemically optimal procedures". Social Choice and Welfare. 26 (2): 263–283. ISSN 0176-1714.
  20. ^ Boland, Philip J. (1989). "Majority Systems and the Condorcet Jury Theorem". Journal of the Royal Statistical Society, Series D (The Statistician). 38 (3): 181–189. doi:10.2307/2348873. ISSN 1467-9884. JSTOR 2348873.
  21. ^ Boland, Philip J.; Proschan, Frank; Tong, Y. L. (March 1989). "Modelling dependence in simple and indirect majority systems". Journal of Applied Probability. 26 (1): 81–88. doi:10.2307/3214318. ISSN 0021-9002. JSTOR 3214318.
  22. ^ Berg, Sven; Paroush, Jacob (1998-05-01). "Collective decision making in hierarchies". Mathematical Social Sciences. 35 (3): 233–244. doi:10.1016/S0165-4896(97)00047-4. ISSN 0165-4896.
  23. ^ Christian List and Robert Goodin (September 2001). "Epistemic democracy : generalizing the Condorcet Jury Theorem" (PDF). Journal of Political Philosophy. 9 (3): 277–306. CiteSeerX 10.1.1.105.9476. doi:10.1111/1467-9760.00128.
  24. ^ Patricia Everaere, Sébastien Konieczny and Pierre Marquis (August 2010). "The Epistemic View of Belief Merging: Can We Track the Truth?" (PDF). Proceedings of the 19th European Conference on Artificial Intelligence (ECAI'10): 621–626. CiteSeerX 10.1.1.298.3965. doi:10.3233/978-1-60750-606-5-621.
  25. ^ Austen-Smith, David; Banks, Jeffrey S. (1996). "Information aggregation, rationality, and the Condorcet Jury Theorem" (PDF). American Political Science Review. 90 (1): 34–45. doi:10.2307/2082796. JSTOR 2082796.
  26. ^ Boland, Philip J. (1989). "Majority Systems and the Condorcet Jury Theorem". Journal of the Royal Statistical Society, Series D (The Statistician). 38 (3): 181–189. doi:10.2307/2348873. ISSN 1467-9884. JSTOR 2348873.
  27. ^ "Evolution in collective decision making". Understanding Collective Decision Making: 167–192. 2017. doi:10.4337/9781783473151.00011. ISBN 9781783473151.
  28. ^ Pivato, Marcus (2019), Laslier, Jean-François; Moulin, Hervé; Sanver, M. Remzi; Zwicker, William S. (eds.), "Realizing Epistemic Democracy", The Future of Economic Design: The Continuing Development of a Field as Envisioned by Its Researchers, Studies in Economic Design, Cham: Springer International Publishing, pp. 103–112, doi:10.1007/978-3-030-18050-8_16, ISBN 978-3-030-18050-8, retrieved 2021-05-27
  29. ^ Anderson, Elizabeth (2006). "The Epistemology of Democracy". Episteme: A Journal of Social Epistemology. 3 (1): 8–22. doi:10.1353/epi.0.0000. ISSN 1750-0117.