Mean reciprocal rank: Difference between revisions

Content deleted Content added

Inline

Revision as of 05:48, 9 February 2011

Mean reciprocal rank is a statistic for evaluating any process that produces a list of possible responses to a query, ordered by probability of correctness. The reciprocal rank of a query response is the multiplicative inverse of the rank of the first correct answer. The mean reciprocal rank is the average of the reciprocal ranks of results for a sample of queries Q^[1]:

{\text{MRR}}={\frac {1}{|Q|}}\sum _{i=1}^{|Q|}{\frac {1}{{\text{rank}}_{i}}}.\!

Example

For example, suppose we have the following three sample queries for a system that tries to translate English words to their plurals. In each case, the system makes three guesses, with the first one being the one it thinks is most likely correct:

Query	Results	Correct response	Rank	Reciprocal rank
cat	catten, cati, cats	cats	3	1/3
torus	torii, tori, toruses	tori	2	1/2
virus	viruses, virii, viri	viruses	1	1

Given those three samples, we could calculate the mean reciprocal rank as (1/3 + 1/2 + 1)/3 = 11/18 or about 0.61.

This basic definition does not specify what to do if...

none of the proposed results are correct (use mean reciprocal rank 0), or if
there are multiple correct answers in the list. Consider using mean average precision (MAP).

References

^ E.M. Voorhees (1999). "Proceedings of the 8th Text Retrieval Conference". TREC-8 Question Answering Track Report. pp. 77–82. {{cite conference}}: Unknown parameter |booktitle= ignored (|book-title= suggested) (help)
^ D. R. Radev, H. Qi, H. Wu, W. Fan (2002). "Evaluating web-based question answering systems". Proceedings of LREC. {{cite conference}}: Unknown parameter |booktitle= ignored (|book-title= suggested) (help)CS1 maint: multiple names: authors list (link)

[1] E.M. Voorhees (1999). "Proceedings of the 8th Text Retrieval Conference". TREC-8 Question Answering Track Report. pp. 77–82. {{cite conference}}: Unknown parameter |booktitle= ignored (|book-title= suggested) (help)

[2] D. R. Radev, H. Qi, H. Wu, W. Fan (2002). "Evaluating web-based question answering systems". Proceedings of LREC. {{cite conference}}: Unknown parameter |booktitle= ignored (|book-title= suggested) (help)CS1 maint: multiple names: authors list (link)

[1]

[2]

@@ Line 38: / Line 38: @@
 This basic definition does not specify what to do if...
 # none of the proposed results are correct (use mean reciprocal rank 0), or if
-# there are multiple correct answers in the list. Consider using [[Information_retrieval#Mean_Average_precision|mean average precision (MAP)]].
+# there are multiple correct answers in the list. Consider using [[Information_retrieval#Mean_average_precision|mean average precision (MAP)]].
 See also [[Information retrieval]] and [[Question answering]]<ref>{{cite conference | title=Evaluating web-based question answering systems | booktitle=Proceedings of LREC | author=D. R. Radev, H. Qi, H. Wu, W. Fan |year=2002 }}</ref>.