Jury theorem: Difference between revisions
Erel Segal (talk | contribs) No edit summary |
GeorgmentO (talk | contribs) m →Weighted majority rule: fixed typo "iff"->"if" |
||
(28 intermediate revisions by 12 users not shown) | |||
Line 1: | Line 1: | ||
{{Short description|Mathematical theory of majority voting}} |
|||
A '''jury theorem''' is a [[mathematical theorem]] proving that, under certain assumptions, a decision attained using [[majority voting]] in a large group is more likely to be correct than a decision attained by a single expert. It serves as a formal argument for the idea of [[wisdom of the crowd]], and for [[democracy]] in general.<ref name=":2">{{ |
A '''jury theorem''' is a [[mathematical theorem]] proving that, under certain assumptions, a decision attained using [[majority voting]] in a large group is more likely to be correct than a decision attained by a single expert. It serves as a formal argument for the idea of [[wisdom of the crowd]], for decision of [[Question of fact|questions of fact]] by [[jury trial]], and for [[democracy]] in general.<ref name=":2">{{SEP|jury-theorems|Jury Theorems|Franz Dietrich & Kai Spiekermann|November 17, 2021}}</ref> |
||
The first and most famous jury theorem is [[Condorcet's jury theorem]]. It assumes that all voters have independent probabilities to vote for the correct alternative, these probabilities are larger than 1/2, and are the same for all voters. Under these assumptions, the probability that the majority decision is correct is strictly larger when the group is larger; and when the group size tends to infinity, the probability that the majority decision is correct tends to 1. |
The first and most famous jury theorem is [[Condorcet's jury theorem]]. It assumes that all voters have independent probabilities to vote for the correct alternative, these probabilities are larger than 1/2, and are the same for all voters. Under these assumptions, the probability that the majority decision is correct is strictly larger when the group is larger; and when the group size tends to infinity, the probability that the majority decision is correct tends to 1. |
||
Line 6: | Line 7: | ||
== Setting == |
== Setting == |
||
The premise of all jury theorems is that there is an [[ |
The premise of all jury theorems is that there is an ''[[objective truth]]'', which is unknown to the voters. Most theorems focus on ''binary issues'' (issues with two possible states), for example, whether a certain [[defendant]] is guilty or innocent, whether a certain [[stock]] is going to rise or fall, etc. There are <math>n</math> voters (or jurors), and their goal is to reveal the truth. Each voter has an ''[[opinion]]'' about which of the two options is correct. The opinion of each voter is either correct (i.e., equals the true state), or wrong (i.e., differs than the true state). This is in contrast to other settings of [[voting]], in which the opinion of each voter represents his/her subjective preferences and is thus always "correct" for this specific voter. The opinion of a voter can be considered a [[random variable]]: for each voter, there is a positive probability that his opinion equals the true state. |
||
The group decision is determined by the [[majority rule |
The group decision is determined by the ''[[majority rule]]''. For example, if a majority of voters says "guilty" then the decision is "guilty", while if a majority says "innocent" then the decision is "innocent". To avoid ties, it is often assumed that the number of voters <math>n</math> is odd. Alternatively, if <math>n</math> is even, then ties are broken by tossing a [[fair coin]]. |
||
Jury theorems are interested in the ''probability of correctness'' - the probability that the majority decision coincides with the objective truth. Typical jury theorems make two kinds of claims on this probability:<ref name=":2" /> |
Jury theorems are interested in the ''probability of correctness'' - the probability that the majority decision coincides with the objective truth. Typical jury theorems make two kinds of claims on this probability:<ref name=":2" /> |
||
Line 28: | Line 29: | ||
== Correlated votes: weakening the independence assumption == |
== Correlated votes: weakening the independence assumption == |
||
The opinions of different voters are often correlated, so Unconditional Independence may not hold. In this case, the Growing Reliability claim might fail. |
The opinions of different voters are often correlated, so Unconditional Independence may not hold. In this case, the Growing Reliability claim might fail. |
||
=== Example === |
=== Example === |
||
Line 55: | Line 56: | ||
Growing Reliability and Crowd Infallibility continue to hold under these weaker assumptions.<ref name=":2" /> |
Growing Reliability and Crowd Infallibility continue to hold under these weaker assumptions.<ref name=":2" /> |
||
One criticism of Conditional Competence is that it depends on the way the decision question is formulated. For example, instead of asking whether the defendant is guilty or innocent, one can ask whether the defendant is guilty of exactly 10 charges (option A), or guilty of another number of charges (0..9 or more than 11). This changes the conditions, and hence, the conditional probability. Moreover, if the state is very specific, then the probability of voting correctly might be below 1/2, so Conditional Competence might not hold.<ref>{{Cite book|last=Estlund|first=David|url=https://books.google. |
One criticism of Conditional Competence is that it depends on the way the decision question is formulated. For example, instead of asking whether the defendant is guilty or innocent, one can ask whether the defendant is guilty of exactly 10 charges (option A), or guilty of another number of charges (0..9 or more than 11). This changes the conditions, and hence, the conditional probability. Moreover, if the state is very specific, then the probability of voting correctly might be below 1/2, so Conditional Competence might not hold.<ref>{{Cite book|last=Estlund|first=David|url=https://books.google.com/books?id=YlHGF166Kd4C&dq=estlund+2008+framework&pg=PP1|title=Democratic Authority: A Philosophical Framework|date=2009-08-03|publisher=Princeton University Press|isbn=978-1-4008-3154-8|language=en}}</ref> |
||
=== Effect of an opinion leader === |
=== Effect of an opinion leader === |
||
Line 64: | Line 65: | ||
* [[Deliberation]] among voters; |
* [[Deliberation]] among voters; |
||
* [[Peer pressure]]; |
* [[Peer pressure]]; |
||
* False evidence (e.g. a guilty defendant that |
* False evidence (e.g. a guilty defendant that excels at pretending to be innocent); |
||
* External conditions (e.g. poor weather affecting their judgement). |
* External conditions (e.g. poor weather affecting their judgement). |
||
*Any other common cause of votes |
*Any other common cause of votes |
||
It is possible to weaken the Conditional Independence assumption, and conditionalize on ''all'' common causes of the votes (rather than just the state). In other words, the votes are now independent ''conditioned on the specific decision problem''. However, in a specific problem, the Conditional Competence assumption may not be valid. For example, in a specific problem with false evidence, it is likely that most voters will have a wrong opinion. Thus, the two assumptions - conditional independence and conditional competence - are not justifiable simultaneously (under the same conditionalization).<ref>{{Cite journal|last=Dietrich|first=Franz|date=2008|title=The Premises of |
It is possible to weaken the Conditional Independence assumption, and conditionalize on ''all'' common causes of the votes (rather than just the state). In other words, the votes are now independent ''conditioned on the specific decision problem''. However, in a specific problem, the Conditional Competence assumption may not be valid. For example, in a specific problem with false evidence, it is likely that most voters will have a wrong opinion. Thus, the two assumptions - conditional independence and conditional competence - are not justifiable simultaneously (under the same conditionalization).<ref>{{Cite journal|last=Dietrich|first=Franz|date=2008|title=The Premises of Condorcet's Jury Theorem Are Not Simultaneously Justified|url=https://muse.jhu.edu/article/240352|journal=Episteme: A Journal of Social Epistemology|volume=5|issue=1|pages=56–73|doi=10.1353/epi.0.0023|s2cid=9214091|issn=1750-0117}}</ref> |
||
A possible solution is to weaken Conditional Competence as follows. For each voter and each problem ''x'', there is a probability ''p''(''x'') that the voter's opinion is correct in this specific problem. Since ''x'' is a random variable, ''p''(''x'') is a random variable too. Conditional Competence requires that ''p''(''x'') > 1/2 with probability 1. The weakened assumption is: |
A possible solution is to weaken Conditional Competence as follows. For each voter and each problem ''x'', there is a probability ''p''(''x'') that the voter's opinion is correct in this specific problem. Since ''x'' is a random variable, ''p''(''x'') is a random variable too. Conditional Competence requires that ''p''(''x'') > 1/2 with probability 1. The weakened assumption is: |
||
Line 73: | Line 74: | ||
* ''Tendency to Competence'': for each voter, and for each ''r''>0, the probability that ''p''(''x'') = 1/2+''r'' is at least as large as the probability that ''p''(''x'') = 1/2-''r''. |
* ''Tendency to Competence'': for each voter, and for each ''r''>0, the probability that ''p''(''x'') = 1/2+''r'' is at least as large as the probability that ''p''(''x'') = 1/2-''r''. |
||
A jury theorem by Dietrich and Spiekerman<ref>{{Cite journal| |
A jury theorem by Dietrich and Spiekerman<ref>{{Cite journal|last1=Dietrich|first1=Franz|last2=Spiekermann|first2=Kai|date=2013-03-01|title=Epistemic democracy with defensible premises|url=http://journals.cambridge.org/action/displayJournal?jid=EAP|journal=Economics and Philosophy|language=en|volume=29|issue=1|pages=87–120|doi=10.1017/S0266267113000096|s2cid=55692104|issn=0266-2671}}</ref> says that Conditional Independence, Tendency to Competence, and Conditional Uniformity, together imply Growing Reliability. Note that Crowd Infallibility is not implied. In fact, the probability of correctness tends to a value which is below 1, if and only of Conditional Competence does not hold. |
||
=== Bounded correlation === |
=== Bounded correlation === |
||
A jury theorem by Pivato<ref>{{Cite journal|date=2017-10-01|title=Epistemic democracy with correlated voters|url=https://www.sciencedirect.com/science/article/abs/pii/S0304406816301094|journal=Journal of Mathematical Economics|language=en|volume=72|pages=51–69|doi=10.1016/j.jmateco.2017.06.001|issn=0304-4068}}</ref> shows that, if the average covariance between voters becomes small as the population becomes large, then Crowd Infallibility holds (for some voting rule). There are other jury theorems that take into account the degree to which votes may be correlated.<ref>{{cite web|author=James Hawthorne|title=Voting In Search of the Public Good: the Probabilistic Logic of Majority Judgments|url=http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|url-status=dead|archive-url=https://web.archive.org/web/20160323044630/http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|archive-date=2016-03-23|accessdate=2009-04-20}}</ref><ref>see for example: {{cite journal|author=Krishna K. Ladha|date=August 1992|title=The Condorcet Jury Theorem, Free Speech, and Correlated Votes|journal=American Journal of Political Science|volume=36|issue=3|pages=617–634|doi=10.2307/2111584|jstor=2111584}}</ref> |
A jury theorem by Pivato<ref>{{Cite journal|date=2017-10-01|title=Epistemic democracy with correlated voters|url=https://www.sciencedirect.com/science/article/abs/pii/S0304406816301094|journal=Journal of Mathematical Economics|language=en|volume=72|pages=51–69|doi=10.1016/j.jmateco.2017.06.001|issn=0304-4068|last1=Pivato|first1=Marcus}}</ref> shows that, if the average covariance between voters becomes small as the population becomes large, then Crowd Infallibility holds (for some voting rule). There are other jury theorems that take into account the degree to which votes may be correlated.<ref>{{cite web|author=James Hawthorne|title=Voting In Search of the Public Good: the Probabilistic Logic of Majority Judgments|url=http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|url-status=dead|archive-url=https://web.archive.org/web/20160323044630/http://faculty-staff.ou.edu/H/James.A.Hawthorne-1/Hawthorne--Jury-Theorems.pdf|archive-date=2016-03-23|accessdate=2009-04-20}}</ref><ref>see for example: {{cite journal|author=Krishna K. Ladha|date=August 1992|title=The Condorcet Jury Theorem, Free Speech, and Correlated Votes|journal=American Journal of Political Science|volume=36|issue=3|pages=617–634|doi=10.2307/2111584|jstor=2111584}}</ref> |
||
=== Other solutions === |
=== Other solutions === |
||
Other ways to cope with voter correlation include [[ |
Other ways to cope with voter correlation include [[causal network]]s, dependence structures, and interchangeability.<ref name=":2" />{{Rp||location=2.2}} |
||
== Diverse capabilities: weakening the uniformity assumption == |
== Diverse capabilities: weakening the uniformity assumption == |
||
Different voters often have different competence levels, so the Uniformity assumption does not hold. In this case, both Growing Reliability and Crowd Infallibility may not hold. This may happen if new voters have much lower competence than existing voters, so that adding new voters decreases the group's probability of correctness. In some cases, the probability of correctness might converge to 1/2 (- a random decision) rather than to 1.<ref name=":3">{{Cite journal|last=Paroush|first=Jacob|date=1998|title=Stay away from fair coins: A Condorcet jury theorem|url=https://www.jstor.org/stable/41106237|journal=Social Choice and Welfare|volume=15|issue=1|pages=15–20|issn=0176-1714}}</ref> |
Different voters often have different competence levels, so the Uniformity assumption does not hold. In this case, both Growing Reliability and Crowd Infallibility may not hold. This may happen if new voters have much lower competence than existing voters, so that adding new voters decreases the group's probability of correctness. In some cases, the probability of correctness might converge to 1/2 (- a random decision) rather than to 1.<ref name=":3">{{Cite journal|last=Paroush|first=Jacob|date=1998|title=Stay away from fair coins: A Condorcet jury theorem|url=https://www.jstor.org/stable/41106237|journal=Social Choice and Welfare|volume=15|issue=1|pages=15–20|doi=10.1007/s003550050088|jstor=41106237|s2cid=153646874|issn=0176-1714}}</ref> |
||
=== Stronger competence requirements === |
=== Stronger competence requirements === |
||
Line 88: | Line 89: | ||
* Strong Competence: for each voter ''i'', the probability of correctness ''p<sub>i</sub>'' is at least 1/2+''e'', where ''e''>0 is fixed for all voters. In other words: the competence is bounded away from a fair coin toss. A jury theorem by Paroush<ref name=":3" /> shows that Strong Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability). |
* Strong Competence: for each voter ''i'', the probability of correctness ''p<sub>i</sub>'' is at least 1/2+''e'', where ''e''>0 is fixed for all voters. In other words: the competence is bounded away from a fair coin toss. A jury theorem by Paroush<ref name=":3" /> shows that Strong Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability). |
||
* Average Competence: the ''average'' of the individual competence levels of the voters (i.e. the average of their individual probabilities of deciding correctly) is slightly greater than half, or converges to a value above 1/2. Jury theorems by Grofman, Owen and Feld,<ref>{{cite journal|author1=Bernard Grofman|author2=Guillermo Owen|author3=Scott L. Feld|year=1983|title=Thirteen theorems in search of the truth.|url=http://www.socsci.uci.edu/~bgrofman/69%20Grofman-Owen-Feld-13%20theorems%20in%20search%20of%20truth.pdf|journal=Theory |
* Average Competence: the ''average'' of the individual competence levels of the voters (i.e. the average of their individual probabilities of deciding correctly) is slightly greater than half, or converges to a value above 1/2. Jury theorems by Grofman, Owen and Feld,<ref>{{cite journal|author1=Bernard Grofman|author2=Guillermo Owen|author3=Scott L. Feld|year=1983|title=Thirteen theorems in search of the truth.|url=http://www.socsci.uci.edu/~bgrofman/69%20Grofman-Owen-Feld-13%20theorems%20in%20search%20of%20truth.pdf|journal=Theory and Decision|volume=15|issue=3|pages=261–78|doi=10.1007/BF00125672|s2cid=50576036}}</ref> and Berend and Paroush,<ref>{{Cite journal|last1=Berend|first1=Daniel|last2=Paroush|first2=Jacob|date=1998|title=When is Condorcet's Jury Theorem valid?|url=https://www.jstor.org/stable/41106274|journal=Social Choice and Welfare|volume=15|issue=4|pages=481–488|doi=10.1007/s003550050118|jstor=41106274|s2cid=120012958|issn=0176-1714}}</ref> show that Average Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability). |
||
=== Random voter selection === |
=== Random voter selection === |
||
instead of assuming that the voter identity is fixed, one can assume that there is a large pool of potential voters with different competence levels, and the actual voters are selected at random from this pool (as in [[sortition]]). |
instead of assuming that the voter identity is fixed, one can assume that there is a large pool of potential voters with different competence levels, and the actual voters are selected at random from this pool (as in [[sortition]]). |
||
A jury theorem by Ben Yashar and Paroush<ref>{{Cite journal|last1=Ben-Yashar|first1=Ruth|last2=Paroush|first2=Jacob|date=2000-03-01|title=A nonasymptotic Condorcet jury theorem|journal=Social Choice and Welfare|language=en|volume=17|issue=2|pages=189–199|doi=10.1007/s003550050014|issn=1432-217X|s2cid=32072741}}</ref> shows that, under certain conditions, the correctness probability of a jury, or of a subset of it chosen at random, is larger than the correctness probability of a single juror selected at random. A more general jury theorem by Berend and Sapir<ref>{{Cite journal| |
A jury theorem by Ben Yashar and Paroush<ref>{{Cite journal|last1=Ben-Yashar|first1=Ruth|last2=Paroush|first2=Jacob|date=2000-03-01|title=A nonasymptotic Condorcet jury theorem|journal=Social Choice and Welfare|language=en|volume=17|issue=2|pages=189–199|doi=10.1007/s003550050014|issn=1432-217X|s2cid=32072741}}</ref> shows that, under certain conditions, the correctness probability of a jury, or of a subset of it chosen at random, is larger than the correctness probability of a single juror selected at random. A more general jury theorem by Berend and Sapir<ref>{{Cite journal|last1=Berend|first1=Daniel|last2=Sapir|first2=Luba|date=2005|title=Monotonicity in Condorcet Jury Theorem|url=https://www.jstor.org/stable/41106652|journal=Social Choice and Welfare|volume=24|issue=1|pages=83–92|doi=10.1007/s00355-003-0293-z|jstor=41106652|s2cid=5617331|issn=0176-1714}}</ref> proves that Growing Reliability holds in this setting: the correctness probability of a random committee increases with the committee size. The theorem holds, under certain conditions, even with correlated votes.<ref>{{Cite journal|last1=Berend|first1=Daniel|last2=Sapir|first2=Luba|date=2007|title=Monotonicity in Condorcet's Jury Theorem with dependent voters|url=https://www.jstor.org/stable/41106830|journal=Social Choice and Welfare|volume=28|issue=3|pages=507–528|doi=10.1007/s00355-006-0179-y|jstor=41106830|s2cid=41180424|issn=0176-1714}}</ref> |
||
A jury theorem by Owen, Grofman and Feld<ref>{{Cite journal| |
A jury theorem by Owen, Grofman and Feld<ref>{{Cite journal|last1=Owen|first1=Guillermo|last2=Grofman|first2=Bernard|last3=Feld|first3=Scott L.|date=1989-02-01|title=Proving a distribution-free generalization of the Condorcet Jury Theorem|url=http://dx.doi.org/10.1016/0165-4896(89)90012-7|journal=Mathematical Social Sciences|volume=17|issue=1|pages=1–16|doi=10.1016/0165-4896(89)90012-7|issn=0165-4896}}</ref> analyzes a setting where the competence level is random. They show what distribution of individual competence maximizes or minimizes the probability of correctness. |
||
=== Weighted majority rule === |
=== Weighted majority rule === |
||
When the competence levels of the voters are known, the simple majority rule may not be the best decision rule. There are various works on identifying the ''optimal decision rule'' - the rule maximizing the group correctness probability. Nitzan and Paroush<ref>{{Cite journal| |
When the competence levels of the voters are known, the simple majority rule may not be the best decision rule. There are various works on identifying the ''optimal decision rule'' - the rule maximizing the group correctness probability. Nitzan and Paroush<ref>{{Cite journal|last1=Nitzan|first1=Shmuel|last2=Paroush|first2=Jacob|date=1982|title=Optimal Decision Rules in Uncertain Dichotomous Choice Situations|url=https://www.jstor.org/stable/2526438|journal=International Economic Review|volume=23|issue=2|pages=289–297|doi=10.2307/2526438|jstor=2526438|issn=0020-6598}}</ref> show that, under Unconditional Independence, the optimal decision rule is a ''weighted'' majority rule, where the weight of each voter with correctness probability ''p<sub>i</sub>'' is log(''p<sub>i</sub>''/(1-''p<sub>i</sub>'')), and an alternative is selected if the sum of weights of its supporters is above some threshold. Grofman and Shapley<ref>{{Cite journal|last1=Shapley|first1=Lloyd|last2=Grofman|first2=Bernard|date=1984-01-01|title=Optimizing group judgmental accuracy in the presence of interdependencies|url=https://doi.org/10.1007/BF00118940|journal=Public Choice|language=en|volume=43|issue=3|pages=329–343|doi=10.1007/BF00118940|s2cid=14858639|issn=1573-7101}}</ref> analyze the effect of interdependencies between voters on the optimal decision rule. Ben-Yashar and Nitzan<ref>{{Cite journal|last1=Ben-Yashar|first1=Ruth C.|last2=Nitzan|first2=Shmuel I.|date=1997|title=The Optimal Decision Rule for Fixed-Size Committees in Dichotomous Choice Situations: The General Result|url=https://www.jstor.org/stable/2527413|journal=International Economic Review|volume=38|issue=1|pages=175–186|doi=10.2307/2527413|jstor=2527413|issn=0020-6598}}</ref> prove a more general result. |
||
Dietrich<ref>{{Cite journal|last=Dietrich|first=Franz|date=2006|title=General representation of epistemically optimal procedures|url=https://www.jstor.org/stable/41106734|journal=Social Choice and Welfare|volume=26|issue=2|pages=263–283|issn=0176-1714}}</ref> generalizes this result to a setting that does not require prior probabilities of the 'correctness' of the two alternative. The only required assumption is Epistemic Monotonicity, which says that, if under certain profile alternative ''x'' is selected, and the profile changes such that ''x'' becomes more probable, then x is still selected. Dietrich shows that Epistemic Monotonicity implies that the optimal decision rule is weighted majority with a threshold. In the same paper, he generalizes the optimal decision rule to a setting that does not require the input to be a vote for one of the alternatives. It can be, for example, a subjective degree of belief. Moreover, competence parameters do not need to be known. For example, if the inputs are subjective beliefs ''x''<sub>1</sub>,...,''x<sub>n</sub>'', then the optimal decision rule sums log(''x<sub>i</sub>''/(1-''x<sub>i</sub>'')) and checks whether the sum is above some threshold. Epistemic Monotonicity is not sufficient for computing the threshold itself; the threshold can be computed by assuming [[Expected utility hypothesis|expected-utility maximization]] and prior probabilities. |
Dietrich<ref>{{Cite journal|last=Dietrich|first=Franz|date=2006|title=General representation of epistemically optimal procedures|url=https://www.jstor.org/stable/41106734|journal=Social Choice and Welfare|volume=26|issue=2|pages=263–283|doi=10.1007/s00355-006-0094-2|jstor=41106734|s2cid=12716206|issn=0176-1714}}</ref> generalizes this result to a setting that does not require prior probabilities of the 'correctness' of the two alternative. The only required assumption is Epistemic Monotonicity, which says that, if under certain profile alternative ''x'' is selected, and the profile changes such that ''x'' becomes more probable, then x is still selected. Dietrich shows that Epistemic Monotonicity implies that the optimal decision rule is weighted majority with a threshold. In the same paper, he generalizes the optimal decision rule to a setting that does not require the input to be a vote for one of the alternatives. It can be, for example, a subjective degree of belief. Moreover, competence parameters do not need to be known. For example, if the inputs are subjective beliefs ''x''<sub>1</sub>,...,''x<sub>n</sub>'', then the optimal decision rule sums log(''x<sub>i</sub>''/(1-''x<sub>i</sub>'')) and checks whether the sum is above some threshold. Epistemic Monotonicity is not sufficient for computing the threshold itself; the threshold can be computed by assuming [[Expected utility hypothesis|expected-utility maximization]] and prior probabilities. |
||
A general problem with the weighted majority rules is that they require to know the competence levels of the different voters, which is usually hard to compute in an objective way. Baharad, Goldberger, [[Moshe Koppel|Koppel]] and Nitzan<ref>{{Cite journal|last1=Baharad|first1=Eyal|last2=Goldberger|first2=Jacob|last3=Koppel|first3=Moshe|last4=Nitzan|first4=Shmuel|date=2012-01-01|title=Beyond Condorcet: optimal aggregation rules using voting records|url=https://doi.org/10.1007/s11238-010-9240-5|journal=Theory and Decision|language=en|volume=72|issue=1|pages=113–130|doi=10.1007/s11238-010-9240-5|s2cid=189822673|issn=1573-7187|hdl=10419/46518|hdl-access=free}}</ref> present an algorithm that solves this problem using [[Statistical learning theory|statistical machine learning]]. It requires as input only a list of past votes; it does not need to know whether these votes were correct or not. If the list is sufficiently large, then its probability of correctness converges to 1 even if the individual voters' competence levels are close to 1/2. |
|||
== More than two options == |
== More than two options == |
||
Line 111: | Line 114: | ||
* Multioption Conditional Competence: for any two options ''x'' and ''y'', if ''x'' is correct and ''y'' is not, then any voter is more likely to vote for ''x'' than for ''y''. |
* Multioption Conditional Competence: for any two options ''x'' and ''y'', if ''x'' is correct and ''y'' is not, then any voter is more likely to vote for ''x'' than for ''y''. |
||
A jury theorem by List and Goodin shows that Multioption Conditional Competence and Conditional Independence together imply Crowd Infallibility.<ref>{{cite journal|author=Christian List and Robert Goodin|date=September 2001|title=Epistemic democracy : generalizing the Condorcet Jury Theorem|url=http://personal.lse.ac.uk/LIST/PDF-files/listgoodin.pdf|journal=Journal of Political Philosophy|volume=9|issue=3|pages=277–306|citeseerx=10.1.1.105.9476|doi=10.1111/1467-9760.00128}}</ref> Dietrich and Spiekermann conjecture that they imply Growing Reliability too.<ref name=":2" /> |
A jury theorem by List and Goodin shows that Multioption Conditional Competence and Conditional Independence together imply Crowd Infallibility.<ref>{{cite journal|author=Christian List and Robert Goodin|date=September 2001|title=Epistemic democracy : generalizing the Condorcet Jury Theorem|url=http://personal.lse.ac.uk/LIST/PDF-files/listgoodin.pdf|journal=Journal of Political Philosophy|volume=9|issue=3|pages=277–306|citeseerx=10.1.1.105.9476|doi=10.1111/1467-9760.00128}}</ref> Dietrich and Spiekermann conjecture that they imply Growing Reliability too.<ref name=":2" /> Another related jury theorem is by Everaere, Konieczny and Marquis.<ref>{{cite journal|author=Patricia Everaere, Sébastien Konieczny and Pierre Marquis|date=August 2010|title=The Epistemic View of Belief Merging: Can We Track the Truth?|url=http://www.cril.univ-artois.fr/~marquis/everaere-konieczny-marquis-ecai10.pdf|journal=Proceedings of the 19th European Conference on Artificial Intelligence (ECAI'10)|volume=215|issue=ECAI 2010|pages=621–626|citeseerx=10.1.1.298.3965|doi=10.3233/978-1-60750-606-5-621}}</ref> |
||
When there are more than two options, there are various [[Electoral system|voting |
When there are more than two options, there are various [[Electoral system|voting rules]] that can be used instead of simple majority. The statistic and utilitarian properties of such rules are analyzed e.g. by Pivato.<ref>{{Cite journal|last=Pivato|first=Marcus|date=2013|title=Voting rules as statistical estimators|url=https://econpapers.repec.org/article/sprsochwe/v_3a40_3ay_3a2013_3ai_3a2_3ap_3a581-630.htm|journal=Social Choice and Welfare|volume=40|issue=2|pages=581–630|doi=10.1007/s00355-011-0619-1|s2cid=22310477|issn=1432-217X}}</ref><ref>{{Cite journal|last=Pivato|first=Marcus|date=2016-08-01|title=Asymptotic utilitarianism in scoring rules|url=https://doi.org/10.1007/s00355-016-0971-2|journal=Social Choice and Welfare|language=en|volume=47|issue=2|pages=431–458|doi=10.1007/s00355-016-0971-2|s2cid=34482765|issn=1432-217X}}</ref> |
||
== Indirect majority systems == |
== Indirect majority systems == |
||
Condorcet's theorem considers a ''direct majority system'', in which all votes are counted directly towards the final outcome. Many countries use an ''indirect majority system'', in which the voters are divided into groups. The voters in each group decide on an outcome by an internal majority vote; then, the groups decide on the final outcome by a majority vote among them. For example,<ref name=":1">{{Cite journal|last=Boland|first=Philip J.|date=1989|title=Majority Systems and the Condorcet Jury Theorem|journal=Journal of the Royal Statistical Society, Series D (The Statistician)|language=en|volume=38|issue=3|pages=181–189|doi=10.2307/2348873|issn=1467-9884|jstor=2348873}}</ref> suppose there are 15 voters. In a direct majority system, a decision is accepted whenever at least 8 votes support it. Suppose now that the voters are grouped into 3 groups of size 5 each. A decision is accepted whenever at least 2 groups support it, and in each group, a decision is accepted whenever at least 3 voters support it. Therefore, a decision may be accepted even if only 6 voters support it. |
Condorcet's theorem considers a ''direct majority system'', in which all votes are counted directly towards the final outcome. Many countries use an ''indirect majority system'', in which the voters are divided into groups. The voters in each group decide on an outcome by an internal majority vote; then, the groups decide on the final outcome by a majority vote among them. For example,<ref name=":1">{{Cite journal|last=Boland|first=Philip J.|date=1989|title=Majority Systems and the Condorcet Jury Theorem|journal=Journal of the Royal Statistical Society, Series D (The Statistician)|language=en|volume=38|issue=3|pages=181–189|doi=10.2307/2348873|issn=1467-9884|jstor=2348873}}</ref> suppose there are 15 voters. In a direct majority system, a decision is accepted whenever at least 8 votes support it. Suppose now that the voters are grouped into 3 groups of size 5 each. A decision is accepted whenever at least 2 groups support it, and in each group, a decision is accepted whenever at least 3 voters support it. Therefore, a decision may be accepted even if only 6 voters support it. |
||
Boland, Proschan and Tong<ref name=":4">{{Cite journal|last1=Boland|first1=Philip J.|last2=Proschan|first2=Frank|last3=Tong|first3=Y. L.|date=March 1989|title=Modelling dependence in simple and indirect majority systems|url=https://www.cambridge.org/core/journals/journal-of-applied-probability/article/modelling-dependence-in-simple-and-indirect-majority-systems/070D6335BDDDDC7AF4D70BC9B21B0B7B|journal=Journal of Applied Probability|language=en|volume=26|issue=1|pages=81–88|doi=10.2307/3214318|issn=0021-9002|jstor=3214318}}</ref> prove that, when the voters are independent and p>1/2, a direct majority system - as in Condorcet's theorem - always has a higher chance of accepting the correct decision than any indirect majority system. |
Boland, Proschan and Tong<ref name=":4">{{Cite journal|last1=Boland|first1=Philip J.|last2=Proschan|first2=Frank|last3=Tong|first3=Y. L.|date=March 1989|title=Modelling dependence in simple and indirect majority systems|url=https://www.cambridge.org/core/journals/journal-of-applied-probability/article/modelling-dependence-in-simple-and-indirect-majority-systems/070D6335BDDDDC7AF4D70BC9B21B0B7B|journal=Journal of Applied Probability|language=en|volume=26|issue=1|pages=81–88|doi=10.2307/3214318|issn=0021-9002|jstor=3214318|s2cid=123605673 }}</ref> prove that, when the voters are independent and p>1/2, a direct majority system - as in Condorcet's theorem - always has a higher chance of accepting the correct decision than any indirect majority system. |
||
Berg and Paroush<ref>{{Cite journal|last1=Berg|first1=Sven|last2=Paroush|first2=Jacob|date=1998-05-01|title=Collective decision making in hierarchies|url=http://www.sciencedirect.com/science/article/pii/S0165489697000474|journal=Mathematical Social Sciences|language=en|volume=35|issue=3|pages=233–244|doi=10.1016/S0165-4896(97)00047-4|issn=0165-4896}}</ref> consider multi-tier voting hierarchies, which may have several levels with different decision-making rules in each level. They study the optimal voting structure, and compares the competence against the benefit of time-saving and other expenses. |
Berg and Paroush<ref>{{Cite journal|last1=Berg|first1=Sven|last2=Paroush|first2=Jacob|date=1998-05-01|title=Collective decision making in hierarchies|url=http://www.sciencedirect.com/science/article/pii/S0165489697000474|journal=Mathematical Social Sciences|language=en|volume=35|issue=3|pages=233–244|doi=10.1016/S0165-4896(97)00047-4|issn=0165-4896}}</ref> consider multi-tier voting hierarchies, which may have several levels with different decision-making rules in each level. They study the optimal voting structure, and compares the competence against the benefit of time-saving and other expenses. |
||
Goodin and Spiekermann<ref>{{Cite journal| |
Goodin and Spiekermann<ref>{{Cite journal|last1=Goodin|first1=Robert E.|last2=Spiekermann|first2=Kai|date=2012-11-01|title=Epistemic aspects of representative government|url=http://journals.cambridge.org/action/displayJournal?jid=EPR|journal=European Political Science Review|language=en|volume=4|issue=3|pages=303–325|doi=10.1017/S1755773911000245|s2cid=85556702|issn=1755-7739}}</ref> compute the amount by which a small group of experts should be better than the average voters, in order for them to accept better decisions. |
||
== Strategic voting == |
== Strategic voting == |
||
It is well-known that, when there are three or more alternatives, and voters have different preferences, they may engage in [[Tactical voting|strategic voting]], for example, vote for the second-best option in order to prevent the worst option from being elected. Surprisingly, strategic voting might occur even with two alternatives and when all voters have the same preference, which is to reveal the truth. For example, suppose the question is whether a defendant is guilty or innocent, and suppose a certain juror thinks the true answer is "guilty". However, he also knows that his vote is effective only if the other votes are tied. But, if other votes are tied, it means that the probability that the defendant is guilty is close to 1/2. Taking this into account, our juror might decide that this probability is not sufficient for deciding "guilty", and thus will vote "innocent". But if all other voters do the same, the wrong answer is derived. In game-theoretic terms, truthful voting might not be a [[Nash equilibrium]].<ref>{{cite journal|last1=Austen-Smith|first1=David|last2=Banks|first2=Jeffrey S.|year=1996|title=Information aggregation, rationality, and the Condorcet Jury Theorem|url=https://authors.library.caltech.edu/67312/1/2082796.pdf|journal=American Political Science Review|volume=90|issue=1|pages=34–45|doi=10.2307/2082796|jstor=2082796|s2cid=8495814 }}</ref> This problem has been termed ''the swing voter's curse'',<ref>{{Cite journal|last1=Feddersen|first1=Timothy J.|last2=Pesendorfer|first2=Wolfgang|date=1996|title=The Swing Voter's Curse|url=https://www.jstor.org/stable/2118204|journal=The American Economic Review|volume=86|issue=3|pages=408–424|jstor=2118204|issn=0002-8282}}</ref> as it is analogous to the [[winner's curse]] in auction theory. |
|||
A jury theorem by Peleg and Zamir<ref>{{Cite journal|last1=Peleg|first1=Bezalel|last2=Zamir|first2=Shmuel|date=2012|title=Extending the Condorcet Jury Theorem to a general dependent jury|url=https://www.jstor.org/stable/41485510|journal=Social Choice and Welfare|volume=39|issue=1|pages=91–125|doi=10.1007/s00355-011-0546-1|jstor=41485510|s2cid=5685386|issn=0176-1714}}</ref> shows sufficient and necessary conditions for the existence of a [[Bayesian Nash equilibrium|Bayesian-Nash equilibrium]] that satisfies Condorcet's jury theorem. Bozbay, Dietrich and Peters<ref>{{Cite journal|date=2014-09-01|title=Judgment aggregation in search for the truth|journal=Games and Economic Behavior|language=en|volume=87|pages=571–590|doi=10.1016/j.geb.2014.02.007|issn=0899-8256|last1=Bozbay|first1=İrem|last2=Dietrich|first2=Franz|last3=Peters|first3=Hans|doi-access=free}}</ref> show voting rules that lead to efficient aggregation of the voters' private information even with strategic voting. |
|||
In practice, this problem may not be very severe, since most voters care not only about the final outcome, but also about voting correctly by their conscience. Moreover, most voters are not sophisticated enough to vote strategically.<ref name=":2" />{{Rp||location=4.7}} |
|||
== Subjective opinions == |
|||
The notion of "correctness" may not be meaningful when making policy decisions, which are based on values or preferences, rather than just on facts. |
|||
⚫ | Some defenders of the theorem hold that it is applicable when voting is aimed at determining which policy best promotes the public good, rather than at merely expressing individual preferences. On this reading, what the theorem says is that although each member of the electorate may only have a vague perception of which of two policies is better, majority voting has an amplifying effect. The "group competence level", as represented by the probability that the majority chooses the better alternative, increases towards 1 as the size of the electorate grows assuming that each voter is more often right than wrong. |
||
Several papers show that, under reasonable conditions, large groups are better trackers of the majority preference.<ref>{{Cite journal|last=Goldman|first=Alvin|date=2002|title=Knowledge in a Social World|url=https://philpapers.org/rec/GOLKIA-3|journal=Philosophy and Phenomenological Research|volume=64|issue=1|pages=185–190|doi=10.1111/j.1933-1592.2002.tb00151.x}}</ref>{{Rp|323}}<ref>{{Cite journal|last1=Goodin|first1=Robert E.|last2=Spiekermann|first2=Kai|date=December 2015|title=Epistemic solidarity as a political strategy|url=http://journals.cambridge.org/action/displayJournal?jid=EPI|journal=Episteme|language=en|volume=12|issue=4|pages=439–457|doi=10.1017/epi.2015.29|s2cid=142927949|issn=1742-3600}}</ref><ref>{{Citation|last1=List|first1=Christian|title=The Condorcet Jury Theorem and Voter-Specific Truth|date=2016|url=https://onlinelibrary.wiley.com/doi/abs/10.1002/9781118609378.ch10|work=Goldman and His Critics|pages=219–233|publisher=John Wiley & Sons, Ltd|language=en|doi=10.1002/9781118609378.ch10|isbn=978-1-118-60937-8|access-date=2021-05-27|last2=Spiekermann|first2=Kai}}</ref> |
|||
== Limitations == |
|||
⚫ | |||
== Applicability == |
|||
Despite these objections, Condorcet's jury theorem provides a theoretical basis for [[democracy]], even if somewhat idealized, as well as a basis of the decision of [[Question of fact|questions of fact]] by [[jury trial]], and as such continues to be studied by political scientists. |
|||
{{Main|Condorcet's jury theorem#Applicability to democratic processes}} |
|||
The applicability of jury theorems, in particular, Condorcet's Jury Theorem (CJT) to democratic processes is debated, as it can prove majority rule to be a perfect mechanism or a disaster depending on individual competence. Recent studies show that, in a non-homogeneous case, the theorem's thesis does not hold almost surely (unless weighted majority rule is used with stochastic weights that are correlated with epistemic rationality but such that every voter has a minimal weight of one).<ref>{{Cite journal |last=Romaniega Sancho |first=Álvaro |date=2022-09-01 |title=On the probability of the Condorcet Jury Theorem or the Miracle of Aggregation |url=https://www.sciencedirect.com/science/article/pii/S0165489622000543 |journal=Mathematical Social Sciences |language=en |volume=119 |pages=41–55 |doi=10.1016/j.mathsocsci.2022.06.002 |arxiv=2108.00733 |s2cid=249921504 |issn=0165-4896}}</ref> |
|||
== Further reading == |
== Further reading == |
||
⚫ | |||
* Majority systems and the Condorcet jury theorem:<ref name=":12">{{Cite journal|last=Boland|first=Philip J.|date=1989|title=Majority Systems and the Condorcet Jury Theorem|journal=Journal of the Royal Statistical Society, Series D (The Statistician)|language=en|volume=38|issue=3|pages=181–189|doi=10.2307/2348873|issn=1467-9884|jstor=2348873}}</ref> discusses non-homogeneous and correlated voters, and indirect majority systems. |
|||
* |
*Evolution in collective decision making.<ref>{{Cite journal|year=2017|title=Evolution in collective decision making|journal=Understanding Collective Decision Making|pages=167–192|doi=10.4337/9781783473151.00011|isbn=9781783473151}}</ref> |
||
⚫ | *Realizing Epistemic Democracy: a criticism on the assumptions of jury theorems.<ref>{{Citation|last=Pivato|first=Marcus|title=Realizing Epistemic Democracy|date=2019|url=https://doi.org/10.1007/978-3-030-18050-8_16|work=The Future of Economic Design: The Continuing Development of a Field as Envisioned by Its Researchers|pages=103–112|editor-last=Laslier|editor-first=Jean-François|series=Studies in Economic Design|place=Cham|publisher=Springer International Publishing|language=en|doi=10.1007/978-3-030-18050-8_16|isbn=978-3-030-18050-8|s2cid=211399419|access-date=2021-05-27|editor2-last=Moulin|editor2-first=Hervé|editor3-last=Sanver|editor3-first=M. Remzi|editor4-last=Zwicker|editor4-first=William S.}}</ref> |
||
⚫ | |||
⚫ | *The Epistemology of Democracy: a comparison of jury theorems to two other epistemic models of democracy: [[experimentalism]] and [[Diversity trumps ability]].<ref>{{Cite journal|last=Anderson|first=Elizabeth|date=2006|title=The Epistemology of Democracy|url=https://muse.jhu.edu/article/209431|journal=Episteme: A Journal of Social Epistemology|volume=3|issue=1|pages=8–22|doi=10.1353/epi.0.0000|issn=1750-0117|doi-access=free}}</ref> |
||
⚫ | *Realizing Epistemic Democracy: a criticism on the assumptions of jury theorems.<ref>{{Citation|last=Pivato|first=Marcus|title=Realizing Epistemic Democracy|date=2019|url=https://doi.org/10.1007/978-3-030-18050-8_16|work=The Future of Economic Design: The Continuing Development of a Field as Envisioned by Its Researchers|pages=103–112|editor-last=Laslier|editor-first=Jean-François|series=Studies in Economic Design|place=Cham|publisher=Springer International Publishing|language=en|doi=10.1007/978-3-030-18050-8_16|isbn=978-3-030-18050-8|access-date=2021-05-27|editor2-last=Moulin|editor2-first=Hervé|editor3-last=Sanver|editor3-first=M. Remzi|editor4-last=Zwicker|editor4-first=William S.}}</ref> |
||
⚫ | *The Epistemology of Democracy: a comparison of jury theorems to two other epistemic models of democracy: [[experimentalism]] and [[Diversity trumps ability]].<ref>{{Cite journal|last=Anderson|first=Elizabeth|date=2006|title=The Epistemology of Democracy|url=https://muse.jhu.edu/article/209431|journal=Episteme: A Journal of Social Epistemology|volume=3|issue=1|pages=8–22|doi=10.1353/epi.0.0000|issn=1750-0117}}</ref> |
||
== References == |
== References == |
||
{{Reflist}} |
|||
<references /> |
|||
[[Category:Probability theorems]] |
[[Category:Probability theorems]] |
||
[[Category:Voting theory]] |
[[Category:Voting theory]] |
Latest revision as of 23:21, 20 August 2024
A jury theorem is a mathematical theorem proving that, under certain assumptions, a decision attained using majority voting in a large group is more likely to be correct than a decision attained by a single expert. It serves as a formal argument for the idea of wisdom of the crowd, for decision of questions of fact by jury trial, and for democracy in general.[1]
The first and most famous jury theorem is Condorcet's jury theorem. It assumes that all voters have independent probabilities to vote for the correct alternative, these probabilities are larger than 1/2, and are the same for all voters. Under these assumptions, the probability that the majority decision is correct is strictly larger when the group is larger; and when the group size tends to infinity, the probability that the majority decision is correct tends to 1.
There are many other jury theorems, relaxing some or all of these assumptions.
Setting
[edit]The premise of all jury theorems is that there is an objective truth, which is unknown to the voters. Most theorems focus on binary issues (issues with two possible states), for example, whether a certain defendant is guilty or innocent, whether a certain stock is going to rise or fall, etc. There are voters (or jurors), and their goal is to reveal the truth. Each voter has an opinion about which of the two options is correct. The opinion of each voter is either correct (i.e., equals the true state), or wrong (i.e., differs than the true state). This is in contrast to other settings of voting, in which the opinion of each voter represents his/her subjective preferences and is thus always "correct" for this specific voter. The opinion of a voter can be considered a random variable: for each voter, there is a positive probability that his opinion equals the true state.
The group decision is determined by the majority rule. For example, if a majority of voters says "guilty" then the decision is "guilty", while if a majority says "innocent" then the decision is "innocent". To avoid ties, it is often assumed that the number of voters is odd. Alternatively, if is even, then ties are broken by tossing a fair coin.
Jury theorems are interested in the probability of correctness - the probability that the majority decision coincides with the objective truth. Typical jury theorems make two kinds of claims on this probability:[1]
- Growing Reliability: the probability of correctness is larger when the group is larger.
- Crowd Infallibility: the probability of correctness goes to 1 when the group size goes to infinity.
Claim 1 is often called the non-asymptotic part and claim 2 is often called the asymptotic part of the jury theorem.
Obviously, these claims are not always true, but they are true under certain assumptions on the voters. Different jury theorems make different assumptions.
Independence, competence, and uniformity
[edit]Condorcet's jury theorem makes the following three assumptions:
- Unconditional Independence: the voters make up their minds independently. In other words, their opinions are independent random variables.
- Unconditional Competence: the probability that the opinion of a single voter coincides with the objective truth is larger than 1/2 (i.e., the voter is smarter than a random coin-toss).
- Uniformity: all voters have the same probability of being correct.
The jury theorem of Condorcet says that these three assumptions imply Growing Reliability and Crowd Infallibility.
Correlated votes: weakening the independence assumption
[edit]The opinions of different voters are often correlated, so Unconditional Independence may not hold. In this case, the Growing Reliability claim might fail.
Example
[edit]Let be the probability of a juror voting for the correct alternative and be the (second-order) correlation coefficient between any two correct votes. If all higher-order correlation coefficients in the Bahadur representation[2] of the joint probability distribution of votes equal to zero, and is an admissible pair, then the probability of the jury collectively reaching the correct decision under simple majority is given by:
where is the regularized incomplete beta function.
Example: Take a jury of three jurors , with individual competence and second-order correlation . Then . The competence of the jury is lower than the competence of a single juror, which equals to . Moreover, enlarging the jury by two jurors decreases the jury competence even further, . Note that and is an admissible pair of parameters. For and , the maximum admissible second-order correlation coefficient equals .
The above example shows that when the individual competence is low but the correlation is high:
- The collective competence under simple majority may fall below that of a single juror;
- Enlarging the jury may decrease its collective competence.
The above result is due to Kaniovski and Zaigraev. They also discuss optimal jury design for homogenous juries with correlated votes.[3]
There are several jury theorems that weaken the Independence assumption in various ways.
Truth-sensitive independence and competence
[edit]In binary decision problems, there is often one option that is easier to detect that the other one. For example, it may be easier to detect that a defendant is guilty (as there is clear evidence for guilt) than to detect that he is innocent. In this case, the probability that the opinion of a single voter is correct is represented by two different numbers: probability given that option #1 is correct, and probability given that option #2 is correct. This also implies that opinions of different voters are correlated. This motivates the following relaxations of the above assumptions:
- Conditional Independence: for each of the two options, the voters' opinions given that this option is the true one are independent random variables.
- Conditional Competence: for each of the two options, the probability that a single voter's opinion is correct given that this option is true is larger than 1/2.
- Conditional Uniformity: for each of the two options, all voters have the same probability of being correct given that this option is true.
Growing Reliability and Crowd Infallibility continue to hold under these weaker assumptions.[1]
One criticism of Conditional Competence is that it depends on the way the decision question is formulated. For example, instead of asking whether the defendant is guilty or innocent, one can ask whether the defendant is guilty of exactly 10 charges (option A), or guilty of another number of charges (0..9 or more than 11). This changes the conditions, and hence, the conditional probability. Moreover, if the state is very specific, then the probability of voting correctly might be below 1/2, so Conditional Competence might not hold.[4]
Effect of an opinion leader
[edit]Another cause of correlation between voters is the existence of an opinion leader. Suppose each voter makes an independent decision, but then each voter, with some fixed probability, changes his opinion to match that of the opinion leader. Jury theorems by Boland[5] and Boland, Proschan and Tong[6] shows that, if (and only if) the probability of following the opinion leader is less than 1-1/2p (where p is the competence level of all voters), then Crowd Infallibility holds.
Problem-sensitive independence and competence
[edit]In addition to the dependence on the true option, there are many other reasons for which voters' opinions may be correlated. For example:
- Deliberation among voters;
- Peer pressure;
- False evidence (e.g. a guilty defendant that excels at pretending to be innocent);
- External conditions (e.g. poor weather affecting their judgement).
- Any other common cause of votes
It is possible to weaken the Conditional Independence assumption, and conditionalize on all common causes of the votes (rather than just the state). In other words, the votes are now independent conditioned on the specific decision problem. However, in a specific problem, the Conditional Competence assumption may not be valid. For example, in a specific problem with false evidence, it is likely that most voters will have a wrong opinion. Thus, the two assumptions - conditional independence and conditional competence - are not justifiable simultaneously (under the same conditionalization).[7]
A possible solution is to weaken Conditional Competence as follows. For each voter and each problem x, there is a probability p(x) that the voter's opinion is correct in this specific problem. Since x is a random variable, p(x) is a random variable too. Conditional Competence requires that p(x) > 1/2 with probability 1. The weakened assumption is:
- Tendency to Competence: for each voter, and for each r>0, the probability that p(x) = 1/2+r is at least as large as the probability that p(x) = 1/2-r.
A jury theorem by Dietrich and Spiekerman[8] says that Conditional Independence, Tendency to Competence, and Conditional Uniformity, together imply Growing Reliability. Note that Crowd Infallibility is not implied. In fact, the probability of correctness tends to a value which is below 1, if and only of Conditional Competence does not hold.
Bounded correlation
[edit]A jury theorem by Pivato[9] shows that, if the average covariance between voters becomes small as the population becomes large, then Crowd Infallibility holds (for some voting rule). There are other jury theorems that take into account the degree to which votes may be correlated.[10][11]
Other solutions
[edit]Other ways to cope with voter correlation include causal networks, dependence structures, and interchangeability.[1]: 2.2
Diverse capabilities: weakening the uniformity assumption
[edit]Different voters often have different competence levels, so the Uniformity assumption does not hold. In this case, both Growing Reliability and Crowd Infallibility may not hold. This may happen if new voters have much lower competence than existing voters, so that adding new voters decreases the group's probability of correctness. In some cases, the probability of correctness might converge to 1/2 (- a random decision) rather than to 1.[12]
Stronger competence requirements
[edit]Uniformity can be dismissed if the Competence assumption is strengthened. There are several ways to strengthen it:
- Strong Competence: for each voter i, the probability of correctness pi is at least 1/2+e, where e>0 is fixed for all voters. In other words: the competence is bounded away from a fair coin toss. A jury theorem by Paroush[12] shows that Strong Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability).
- Average Competence: the average of the individual competence levels of the voters (i.e. the average of their individual probabilities of deciding correctly) is slightly greater than half, or converges to a value above 1/2. Jury theorems by Grofman, Owen and Feld,[13] and Berend and Paroush,[14] show that Average Competence and Conditional Independence together imply Crowd Infallibility (but not Growing Reliability).
Random voter selection
[edit]instead of assuming that the voter identity is fixed, one can assume that there is a large pool of potential voters with different competence levels, and the actual voters are selected at random from this pool (as in sortition).
A jury theorem by Ben Yashar and Paroush[15] shows that, under certain conditions, the correctness probability of a jury, or of a subset of it chosen at random, is larger than the correctness probability of a single juror selected at random. A more general jury theorem by Berend and Sapir[16] proves that Growing Reliability holds in this setting: the correctness probability of a random committee increases with the committee size. The theorem holds, under certain conditions, even with correlated votes.[17]
A jury theorem by Owen, Grofman and Feld[18] analyzes a setting where the competence level is random. They show what distribution of individual competence maximizes or minimizes the probability of correctness.
Weighted majority rule
[edit]When the competence levels of the voters are known, the simple majority rule may not be the best decision rule. There are various works on identifying the optimal decision rule - the rule maximizing the group correctness probability. Nitzan and Paroush[19] show that, under Unconditional Independence, the optimal decision rule is a weighted majority rule, where the weight of each voter with correctness probability pi is log(pi/(1-pi)), and an alternative is selected if the sum of weights of its supporters is above some threshold. Grofman and Shapley[20] analyze the effect of interdependencies between voters on the optimal decision rule. Ben-Yashar and Nitzan[21] prove a more general result.
Dietrich[22] generalizes this result to a setting that does not require prior probabilities of the 'correctness' of the two alternative. The only required assumption is Epistemic Monotonicity, which says that, if under certain profile alternative x is selected, and the profile changes such that x becomes more probable, then x is still selected. Dietrich shows that Epistemic Monotonicity implies that the optimal decision rule is weighted majority with a threshold. In the same paper, he generalizes the optimal decision rule to a setting that does not require the input to be a vote for one of the alternatives. It can be, for example, a subjective degree of belief. Moreover, competence parameters do not need to be known. For example, if the inputs are subjective beliefs x1,...,xn, then the optimal decision rule sums log(xi/(1-xi)) and checks whether the sum is above some threshold. Epistemic Monotonicity is not sufficient for computing the threshold itself; the threshold can be computed by assuming expected-utility maximization and prior probabilities.
A general problem with the weighted majority rules is that they require to know the competence levels of the different voters, which is usually hard to compute in an objective way. Baharad, Goldberger, Koppel and Nitzan[23] present an algorithm that solves this problem using statistical machine learning. It requires as input only a list of past votes; it does not need to know whether these votes were correct or not. If the list is sufficiently large, then its probability of correctness converges to 1 even if the individual voters' competence levels are close to 1/2.
More than two options
[edit]Often, decision problems involve three or more options. This critical limitation was in fact recognized by Condorcet (see Condorcet's paradox), and in general it is very difficult to reconcile individual decisions between three or more outcomes (see Arrow's theorem).
This limitation may also be overcome by means of a sequence of votes on pairs of alternatives, as is commonly realized via the legislative amendment process. (However, as per Arrow's theorem, this creates a "path dependence" on the exact sequence of pairs of alternatives; e.g., which amendment is proposed first can make a difference in what amendment is ultimately passed, or if the law—with or without amendments—is passed at all.)
With three or more options, Conditional Competence can be generalized as follows:
- Multioption Conditional Competence: for any two options x and y, if x is correct and y is not, then any voter is more likely to vote for x than for y.
A jury theorem by List and Goodin shows that Multioption Conditional Competence and Conditional Independence together imply Crowd Infallibility.[24] Dietrich and Spiekermann conjecture that they imply Growing Reliability too.[1] Another related jury theorem is by Everaere, Konieczny and Marquis.[25]
When there are more than two options, there are various voting rules that can be used instead of simple majority. The statistic and utilitarian properties of such rules are analyzed e.g. by Pivato.[26][27]
Indirect majority systems
[edit]Condorcet's theorem considers a direct majority system, in which all votes are counted directly towards the final outcome. Many countries use an indirect majority system, in which the voters are divided into groups. The voters in each group decide on an outcome by an internal majority vote; then, the groups decide on the final outcome by a majority vote among them. For example,[5] suppose there are 15 voters. In a direct majority system, a decision is accepted whenever at least 8 votes support it. Suppose now that the voters are grouped into 3 groups of size 5 each. A decision is accepted whenever at least 2 groups support it, and in each group, a decision is accepted whenever at least 3 voters support it. Therefore, a decision may be accepted even if only 6 voters support it.
Boland, Proschan and Tong[6] prove that, when the voters are independent and p>1/2, a direct majority system - as in Condorcet's theorem - always has a higher chance of accepting the correct decision than any indirect majority system.
Berg and Paroush[28] consider multi-tier voting hierarchies, which may have several levels with different decision-making rules in each level. They study the optimal voting structure, and compares the competence against the benefit of time-saving and other expenses.
Goodin and Spiekermann[29] compute the amount by which a small group of experts should be better than the average voters, in order for them to accept better decisions.
Strategic voting
[edit]It is well-known that, when there are three or more alternatives, and voters have different preferences, they may engage in strategic voting, for example, vote for the second-best option in order to prevent the worst option from being elected. Surprisingly, strategic voting might occur even with two alternatives and when all voters have the same preference, which is to reveal the truth. For example, suppose the question is whether a defendant is guilty or innocent, and suppose a certain juror thinks the true answer is "guilty". However, he also knows that his vote is effective only if the other votes are tied. But, if other votes are tied, it means that the probability that the defendant is guilty is close to 1/2. Taking this into account, our juror might decide that this probability is not sufficient for deciding "guilty", and thus will vote "innocent". But if all other voters do the same, the wrong answer is derived. In game-theoretic terms, truthful voting might not be a Nash equilibrium.[30] This problem has been termed the swing voter's curse,[31] as it is analogous to the winner's curse in auction theory.
A jury theorem by Peleg and Zamir[32] shows sufficient and necessary conditions for the existence of a Bayesian-Nash equilibrium that satisfies Condorcet's jury theorem. Bozbay, Dietrich and Peters[33] show voting rules that lead to efficient aggregation of the voters' private information even with strategic voting.
In practice, this problem may not be very severe, since most voters care not only about the final outcome, but also about voting correctly by their conscience. Moreover, most voters are not sophisticated enough to vote strategically.[1]: 4.7
Subjective opinions
[edit]The notion of "correctness" may not be meaningful when making policy decisions, which are based on values or preferences, rather than just on facts.
Some defenders of the theorem hold that it is applicable when voting is aimed at determining which policy best promotes the public good, rather than at merely expressing individual preferences. On this reading, what the theorem says is that although each member of the electorate may only have a vague perception of which of two policies is better, majority voting has an amplifying effect. The "group competence level", as represented by the probability that the majority chooses the better alternative, increases towards 1 as the size of the electorate grows assuming that each voter is more often right than wrong.
Several papers show that, under reasonable conditions, large groups are better trackers of the majority preference.[34]: 323 [35][36]
Applicability
[edit]The applicability of jury theorems, in particular, Condorcet's Jury Theorem (CJT) to democratic processes is debated, as it can prove majority rule to be a perfect mechanism or a disaster depending on individual competence. Recent studies show that, in a non-homogeneous case, the theorem's thesis does not hold almost surely (unless weighted majority rule is used with stochastic weights that are correlated with epistemic rationality but such that every voter has a minimal weight of one).[37]
Further reading
[edit]- Law of large numbers: a mathematical generalization of jury theorems.
- Evolution in collective decision making.[38]
- Realizing Epistemic Democracy: a criticism on the assumptions of jury theorems.[39]
- The Epistemology of Democracy: a comparison of jury theorems to two other epistemic models of democracy: experimentalism and Diversity trumps ability.[40]
References
[edit]- ^ a b c d e f "Jury Theorems" entry by Franz Dietrich & Kai Spiekermann in the Stanford Encyclopedia of Philosophy, November 17, 2021
- ^ Bahadur, R.R. (1961). "A representation of the joint distribution of responses to n dichotomous items". H. Solomon (Ed.), Studies in Item Analysis and Prediction: 158–168.
- ^ Kaniovski, Serguei; Alexander, Zaigraev (2011). "Optimal Jury Design for Homogeneous Juries with Correlated Votes" (PDF). Theory and Decision. 71 (4): 439–459. CiteSeerX 10.1.1.225.5613. doi:10.1007/s11238-009-9170-2. S2CID 9189720.
- ^ Estlund, David (2009-08-03). Democratic Authority: A Philosophical Framework. Princeton University Press. ISBN 978-1-4008-3154-8.
- ^ a b Boland, Philip J. (1989). "Majority Systems and the Condorcet Jury Theorem". Journal of the Royal Statistical Society, Series D (The Statistician). 38 (3): 181–189. doi:10.2307/2348873. ISSN 1467-9884. JSTOR 2348873.
- ^ a b Boland, Philip J.; Proschan, Frank; Tong, Y. L. (March 1989). "Modelling dependence in simple and indirect majority systems". Journal of Applied Probability. 26 (1): 81–88. doi:10.2307/3214318. ISSN 0021-9002. JSTOR 3214318. S2CID 123605673.
- ^ Dietrich, Franz (2008). "The Premises of Condorcet's Jury Theorem Are Not Simultaneously Justified". Episteme: A Journal of Social Epistemology. 5 (1): 56–73. doi:10.1353/epi.0.0023. ISSN 1750-0117. S2CID 9214091.
- ^ Dietrich, Franz; Spiekermann, Kai (2013-03-01). "Epistemic democracy with defensible premises". Economics and Philosophy. 29 (1): 87–120. doi:10.1017/S0266267113000096. ISSN 0266-2671. S2CID 55692104.
- ^ Pivato, Marcus (2017-10-01). "Epistemic democracy with correlated voters". Journal of Mathematical Economics. 72: 51–69. doi:10.1016/j.jmateco.2017.06.001. ISSN 0304-4068.
- ^ James Hawthorne. "Voting In Search of the Public Good: the Probabilistic Logic of Majority Judgments" (PDF). Archived from the original (PDF) on 2016-03-23. Retrieved 2009-04-20.
- ^ see for example: Krishna K. Ladha (August 1992). "The Condorcet Jury Theorem, Free Speech, and Correlated Votes". American Journal of Political Science. 36 (3): 617–634. doi:10.2307/2111584. JSTOR 2111584.
- ^ a b Paroush, Jacob (1998). "Stay away from fair coins: A Condorcet jury theorem". Social Choice and Welfare. 15 (1): 15–20. doi:10.1007/s003550050088. ISSN 0176-1714. JSTOR 41106237. S2CID 153646874.
- ^ Bernard Grofman; Guillermo Owen; Scott L. Feld (1983). "Thirteen theorems in search of the truth" (PDF). Theory and Decision. 15 (3): 261–78. doi:10.1007/BF00125672. S2CID 50576036.
- ^ Berend, Daniel; Paroush, Jacob (1998). "When is Condorcet's Jury Theorem valid?". Social Choice and Welfare. 15 (4): 481–488. doi:10.1007/s003550050118. ISSN 0176-1714. JSTOR 41106274. S2CID 120012958.
- ^ Ben-Yashar, Ruth; Paroush, Jacob (2000-03-01). "A nonasymptotic Condorcet jury theorem". Social Choice and Welfare. 17 (2): 189–199. doi:10.1007/s003550050014. ISSN 1432-217X. S2CID 32072741.
- ^ Berend, Daniel; Sapir, Luba (2005). "Monotonicity in Condorcet Jury Theorem". Social Choice and Welfare. 24 (1): 83–92. doi:10.1007/s00355-003-0293-z. ISSN 0176-1714. JSTOR 41106652. S2CID 5617331.
- ^ Berend, Daniel; Sapir, Luba (2007). "Monotonicity in Condorcet's Jury Theorem with dependent voters". Social Choice and Welfare. 28 (3): 507–528. doi:10.1007/s00355-006-0179-y. ISSN 0176-1714. JSTOR 41106830. S2CID 41180424.
- ^ Owen, Guillermo; Grofman, Bernard; Feld, Scott L. (1989-02-01). "Proving a distribution-free generalization of the Condorcet Jury Theorem". Mathematical Social Sciences. 17 (1): 1–16. doi:10.1016/0165-4896(89)90012-7. ISSN 0165-4896.
- ^ Nitzan, Shmuel; Paroush, Jacob (1982). "Optimal Decision Rules in Uncertain Dichotomous Choice Situations". International Economic Review. 23 (2): 289–297. doi:10.2307/2526438. ISSN 0020-6598. JSTOR 2526438.
- ^ Shapley, Lloyd; Grofman, Bernard (1984-01-01). "Optimizing group judgmental accuracy in the presence of interdependencies". Public Choice. 43 (3): 329–343. doi:10.1007/BF00118940. ISSN 1573-7101. S2CID 14858639.
- ^ Ben-Yashar, Ruth C.; Nitzan, Shmuel I. (1997). "The Optimal Decision Rule for Fixed-Size Committees in Dichotomous Choice Situations: The General Result". International Economic Review. 38 (1): 175–186. doi:10.2307/2527413. ISSN 0020-6598. JSTOR 2527413.
- ^ Dietrich, Franz (2006). "General representation of epistemically optimal procedures". Social Choice and Welfare. 26 (2): 263–283. doi:10.1007/s00355-006-0094-2. ISSN 0176-1714. JSTOR 41106734. S2CID 12716206.
- ^ Baharad, Eyal; Goldberger, Jacob; Koppel, Moshe; Nitzan, Shmuel (2012-01-01). "Beyond Condorcet: optimal aggregation rules using voting records". Theory and Decision. 72 (1): 113–130. doi:10.1007/s11238-010-9240-5. hdl:10419/46518. ISSN 1573-7187. S2CID 189822673.
- ^ Christian List and Robert Goodin (September 2001). "Epistemic democracy : generalizing the Condorcet Jury Theorem" (PDF). Journal of Political Philosophy. 9 (3): 277–306. CiteSeerX 10.1.1.105.9476. doi:10.1111/1467-9760.00128.
- ^ Patricia Everaere, Sébastien Konieczny and Pierre Marquis (August 2010). "The Epistemic View of Belief Merging: Can We Track the Truth?" (PDF). Proceedings of the 19th European Conference on Artificial Intelligence (ECAI'10). 215 (ECAI 2010): 621–626. CiteSeerX 10.1.1.298.3965. doi:10.3233/978-1-60750-606-5-621.
- ^ Pivato, Marcus (2013). "Voting rules as statistical estimators". Social Choice and Welfare. 40 (2): 581–630. doi:10.1007/s00355-011-0619-1. ISSN 1432-217X. S2CID 22310477.
- ^ Pivato, Marcus (2016-08-01). "Asymptotic utilitarianism in scoring rules". Social Choice and Welfare. 47 (2): 431–458. doi:10.1007/s00355-016-0971-2. ISSN 1432-217X. S2CID 34482765.
- ^ Berg, Sven; Paroush, Jacob (1998-05-01). "Collective decision making in hierarchies". Mathematical Social Sciences. 35 (3): 233–244. doi:10.1016/S0165-4896(97)00047-4. ISSN 0165-4896.
- ^ Goodin, Robert E.; Spiekermann, Kai (2012-11-01). "Epistemic aspects of representative government". European Political Science Review. 4 (3): 303–325. doi:10.1017/S1755773911000245. ISSN 1755-7739. S2CID 85556702.
- ^ Austen-Smith, David; Banks, Jeffrey S. (1996). "Information aggregation, rationality, and the Condorcet Jury Theorem" (PDF). American Political Science Review. 90 (1): 34–45. doi:10.2307/2082796. JSTOR 2082796. S2CID 8495814.
- ^ Feddersen, Timothy J.; Pesendorfer, Wolfgang (1996). "The Swing Voter's Curse". The American Economic Review. 86 (3): 408–424. ISSN 0002-8282. JSTOR 2118204.
- ^ Peleg, Bezalel; Zamir, Shmuel (2012). "Extending the Condorcet Jury Theorem to a general dependent jury". Social Choice and Welfare. 39 (1): 91–125. doi:10.1007/s00355-011-0546-1. ISSN 0176-1714. JSTOR 41485510. S2CID 5685386.
- ^ Bozbay, İrem; Dietrich, Franz; Peters, Hans (2014-09-01). "Judgment aggregation in search for the truth". Games and Economic Behavior. 87: 571–590. doi:10.1016/j.geb.2014.02.007. ISSN 0899-8256.
- ^ Goldman, Alvin (2002). "Knowledge in a Social World". Philosophy and Phenomenological Research. 64 (1): 185–190. doi:10.1111/j.1933-1592.2002.tb00151.x.
- ^ Goodin, Robert E.; Spiekermann, Kai (December 2015). "Epistemic solidarity as a political strategy". Episteme. 12 (4): 439–457. doi:10.1017/epi.2015.29. ISSN 1742-3600. S2CID 142927949.
- ^ List, Christian; Spiekermann, Kai (2016), "The Condorcet Jury Theorem and Voter-Specific Truth", Goldman and His Critics, John Wiley & Sons, Ltd, pp. 219–233, doi:10.1002/9781118609378.ch10, ISBN 978-1-118-60937-8, retrieved 2021-05-27
- ^ Romaniega Sancho, Álvaro (2022-09-01). "On the probability of the Condorcet Jury Theorem or the Miracle of Aggregation". Mathematical Social Sciences. 119: 41–55. arXiv:2108.00733. doi:10.1016/j.mathsocsci.2022.06.002. ISSN 0165-4896. S2CID 249921504.
- ^ "Evolution in collective decision making". Understanding Collective Decision Making: 167–192. 2017. doi:10.4337/9781783473151.00011. ISBN 9781783473151.
- ^ Pivato, Marcus (2019), Laslier, Jean-François; Moulin, Hervé; Sanver, M. Remzi; Zwicker, William S. (eds.), "Realizing Epistemic Democracy", The Future of Economic Design: The Continuing Development of a Field as Envisioned by Its Researchers, Studies in Economic Design, Cham: Springer International Publishing, pp. 103–112, doi:10.1007/978-3-030-18050-8_16, ISBN 978-3-030-18050-8, S2CID 211399419, retrieved 2021-05-27
- ^ Anderson, Elizabeth (2006). "The Epistemology of Democracy". Episteme: A Journal of Social Epistemology. 3 (1): 8–22. doi:10.1353/epi.0.0000. ISSN 1750-0117.