Jump to content

Item-total correlation: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Amead (talk | contribs)
m wikified "trait" because it's an unusual word and is used here in a technical manner
m v2.05b - Bot T20 CW#61 - Fix errors for CW project (Reference before punctuation)
 
(10 intermediate revisions by 2 users not shown)
Line 1: Line 1:
The '''item–total correlation''' is the [[Pearson_correlation_coefficient|correlation]] between a scored item and the total [[test score]]. It is an item [[statistic]] used in [[psychometric]] analysis to diagnose assessment items that fail to indicate the underlying psychological [[Trait_theory|trait]] so that they can be removed or revised <ref>{{Cite journal|last=Henrysson|first=Sten|date=1963-06-01|title=Correction of item-total correlations in item analysis|journal=Psychometrika|language=en|volume=28|issue=2|pages=211–218|doi=10.1007/BF02289618|s2cid=120534016 |issn=1860-0980}}</ref>.
The '''item–total correlation''' is the [[Pearson_correlation_coefficient|correlation]] between a scored item and the total [[test score]]. It is an item [[statistic]] used in [[psychometric]] analysis to diagnose assessment items that fail to indicate the underlying psychological [[Trait_theory|trait]] so that they can be removed or revised.<ref name="Henrysson1963">{{Cite journal|last=Henrysson|first=Sten|date=1963-06-01|title=Correction of item-total correlations in item analysis|journal=Psychometrika|language=en|volume=28|issue=2|pages=211–218|doi=10.1007/BF02289618|s2cid=120534016 |issn=1860-0980}}</ref>


==The item-total correlation in item analysis==
==The item-total correlation in item analysis==
In [[item analysis]], an item–total correlation is usually calculated for each item of a scale or test to diagnose the degree to which assessment items indicate the underlying trait. Assuming that most of the items of an assessment do indicate the underlying trait, each item should have a reasonably strong positive correlation with the total score on that assessment. An important goal of item analysis is to identify and remove or revise items that are not good indicators of the underlying trait<ref>Churchill, G.A., (1979). "A paradigm for developing better measures of marketing constructs", ''[[Journal of Marketing Research]]'', 16(1) pp 64–73, {{doi|10.1177/002224377901600110}}, {{JSTOR|3150876}}</ref>.
In [[item analysis]], an item–total correlation is usually calculated for each item of a scale or test to diagnose the degree to which assessment items indicate the underlying trait. Assuming that most of the items of an assessment do indicate the underlying trait, each item should have a reasonably strong positive correlation with the total score on that assessment. An important goal of item analysis is to identify and remove or revise items that are not good indicators of the underlying trait.<ref>Churchill, G.A., (1979). "A paradigm for developing better measures of marketing constructs", ''[[Journal of Marketing Research]]'', 16(1) pp 64–73, {{doi|10.1177/002224377901600110}}, {{JSTOR|3150876}}</ref>


A small or negative item-correlation provides [[Empirical research|empirical evidence]] that the item is not measuring the same [[Construct_(psychology)|construct]] measured by the assessment. Exact values depend on the type of measure, but as a heuristic, a correlation value less than 0.2 indicates that the corresponding item does not correlate very well with the scale overall and, thus, it may be dropped. A negative value indicates that the item may be damaging the overall [[Reliability_(statistics)|psychometric reliability]] of the measure. <ref>Everitt, B.S. (2002) ''The Cambridge Dictionary of Statistics'', 2nd Edition, CUP. {{ISBN|0-521-81099-X}}</ref><ref>Field, A., (2005). ''Discovering Statistics Using SPSS''. 2nd ed. London: Sage</ref> Identifying and removing (or revising) poorly-performing items is a critical way that psychometric analysis can improve the quality of a measure.
A small or negative item-correlation provides [[Empirical research|empirical evidence]] that the item is not measuring the same [[Construct_(psychology)|construct]] measured by the assessment. Exact values depend on the type of measure, but as a heuristic, a correlation value less than 0.2 indicates that the corresponding item does not correlate very well with the scale overall and, thus, it may be dropped. A negative value indicates that the item may be damaging the overall [[Reliability_(statistics)|psychometric reliability]] of the measure. <ref>Everitt, B.S. (2002) ''The Cambridge Dictionary of Statistics'', 2nd Edition, CUP. {{ISBN|0-521-81099-X}}</ref><ref>Field, A., (2005). ''Discovering Statistics Using SPSS''. 2nd ed. London: Sage</ref> Identifying and removing (or revising) poorly-performing items is a critical way that psychometric analysis can improve the quality of a measure.


When items are scored dichotomously, as in [[Educational_assessment|exams with correct and incorrect answers]], the item-total correlation may be calculated as either a [[point-biserial correlation]] or a [[biserial correlation]]. This is considered important because items vary in difficulty and the point-biserial correlation cannot attain its theoretical maxima [+1,-1] unless the proportion correct is 0.50 (50% answering the item correctly). The biserial correlation is a correction that, in theory, avoids this issue. In practice, analysts should choose either the point-biserial or biserial and not try to compare, because the correction of the biserial will always produce a slightly larger magnitude as compared to the the point-biserial <ref>Allen, M.J., & Yen, W. M. (1979) ''Introduction to Measurement Theory'', Wadsworth. {{ISBN|0-8185-0283-5}}</ref>.
When items are scored dichotomously, as in [[Educational_assessment|exams with correct and incorrect answers]], the item-total correlation may be calculated as either a [[point-biserial correlation]] or a [[biserial correlation]]. This is considered important because items vary in difficulty and the point-biserial correlation cannot attain its theoretical maxima [+1,-1] unless the proportion correct is 0.50 (50% answering the item correctly). The biserial correlation has a correction that, in theory, avoids this issue.<ref name="Henrysson1963" /> In practice, analysts should choose either the point-biserial or biserial and not try to compare, because the correction of the biserial will always produce a slightly larger magnitude as compared to the point-biserial.<ref name="AllenYen1979">Allen, M.J., & Yen, W. M. (1979) ''Introduction to Measurement Theory'', Wadsworth. {{ISBN|0-8185-0283-5}}</ref>


The [[item-reliability index]] (IRI) is defined as the product of the point-biserial item-total correlation and the item standard deviation. In [[classical test theory]], the IRI indexes the degree to which an item contributes true score variance to the exam score. In practice, a negative IRI indicates the relative degree which an item damages the reliability estimate and a positive value indicates the relative degree which it contributes towards a high reliability estimate<ref>Allen, M.J., & Yen, W. M. (1979) ''Introduction to Measurement Theory'', Wadsworth. {{ISBN|0-8185-0283-5}}</ref>.
The [[item-reliability index]] (IRI) is defined as the product of the point-biserial item-total correlation and the item standard deviation. In [[classical test theory]], the IRI indexes the degree to which an item contributes true score variance to the exam observed score variance. In practice, a negative IRI indicates the relative degree which an item damages the reliability estimate and a positive value indicates the relative degree which it contributes towards a high reliability estimate.<ref name="AllenYen1979"/>


==See also==
==See also==
*[[Scale analysis (statistics)]]
*[[Scale analysis (statistics)|Scale analysis]]
*[[Item analysis]]
*[[Classical test theory]]
*[[Likert scaling]]


== References ==
== References ==

Latest revision as of 06:13, 9 May 2024

The item–total correlation is the correlation between a scored item and the total test score. It is an item statistic used in psychometric analysis to diagnose assessment items that fail to indicate the underlying psychological trait so that they can be removed or revised.[1]

The item-total correlation in item analysis

[edit]

In item analysis, an item–total correlation is usually calculated for each item of a scale or test to diagnose the degree to which assessment items indicate the underlying trait. Assuming that most of the items of an assessment do indicate the underlying trait, each item should have a reasonably strong positive correlation with the total score on that assessment. An important goal of item analysis is to identify and remove or revise items that are not good indicators of the underlying trait.[2]

A small or negative item-correlation provides empirical evidence that the item is not measuring the same construct measured by the assessment. Exact values depend on the type of measure, but as a heuristic, a correlation value less than 0.2 indicates that the corresponding item does not correlate very well with the scale overall and, thus, it may be dropped. A negative value indicates that the item may be damaging the overall psychometric reliability of the measure. [3][4] Identifying and removing (or revising) poorly-performing items is a critical way that psychometric analysis can improve the quality of a measure.

When items are scored dichotomously, as in exams with correct and incorrect answers, the item-total correlation may be calculated as either a point-biserial correlation or a biserial correlation. This is considered important because items vary in difficulty and the point-biserial correlation cannot attain its theoretical maxima [+1,-1] unless the proportion correct is 0.50 (50% answering the item correctly). The biserial correlation has a correction that, in theory, avoids this issue.[1] In practice, analysts should choose either the point-biserial or biserial and not try to compare, because the correction of the biserial will always produce a slightly larger magnitude as compared to the point-biserial.[5]

The item-reliability index (IRI) is defined as the product of the point-biserial item-total correlation and the item standard deviation. In classical test theory, the IRI indexes the degree to which an item contributes true score variance to the exam observed score variance. In practice, a negative IRI indicates the relative degree which an item damages the reliability estimate and a positive value indicates the relative degree which it contributes towards a high reliability estimate.[5]

See also

[edit]

References

[edit]
  1. ^ a b Henrysson, Sten (1963-06-01). "Correction of item-total correlations in item analysis". Psychometrika. 28 (2): 211–218. doi:10.1007/BF02289618. ISSN 1860-0980. S2CID 120534016.
  2. ^ Churchill, G.A., (1979). "A paradigm for developing better measures of marketing constructs", Journal of Marketing Research, 16(1) pp 64–73, doi:10.1177/002224377901600110, JSTOR 3150876
  3. ^ Everitt, B.S. (2002) The Cambridge Dictionary of Statistics, 2nd Edition, CUP. ISBN 0-521-81099-X
  4. ^ Field, A., (2005). Discovering Statistics Using SPSS. 2nd ed. London: Sage
  5. ^ a b Allen, M.J., & Yen, W. M. (1979) Introduction to Measurement Theory, Wadsworth. ISBN 0-8185-0283-5