Jump to content

Correlation coefficient: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
AnomieBOT (talk | contribs)
m Dating maintenance tags: {{Cn}}
Line 1: Line 1:
A '''correlation coefficient''' is a [[numerical measure]] of some type of [[correlation and dependence|correlation]], meaning a statistical relationship between two [[variable (mathematics)|variables]].<ref>{{cite web |url=http://www.ncme.org/ncme/NCME/Resource_Center/Glossary/NCME/Resource_Center/Glossary1.aspx?hkey=4bb87415-44dc-4088-9ed9-e8515326a061#anchorC |title=correlation coefficient |author=<!--Not stated--> |date= |website=NCME.org |publisher=[[National Council on Measurement in Education]] |access-date=April 17, 2014 |quote=correlation coefficient: A statistic used to show how the scores from one measure relate to scores on a second measure for the same group of individuals. A high value (approaching +1.00) is a strong direct relationship, values near 0.50 are considered moderate and values below 0.30 are considered to show weak relationship. A low negative value (approaching -1.00) is similarly a strong inverse relationship, and values near 0.00 indicate little, if any, relationship. |archive-url=https://web.archive.org/web/20170722194028/http://www.ncme.org/ncme/NCME/Resource_Center/Glossary/NCME/Resource_Center/Glossary1.aspx?hkey=4bb87415-44dc-4088-9ed9-e8515326a061#anchorC |archive-date=July 22, 2017 |dead-url=yes }}</ref> The variables may be two [[column (database)|column]]s of a given [[data set]] of observations, often called a [[sample (statistics)|sample]], or two components of a [[multivariate random variable]] with a known [[distribution (statistics)|distribution]].{{cn}}
A '''correlation coefficient''' is a [[numerical measure]] of some type of [[correlation and dependence|correlation]], meaning a statistical relationship between two [[variable (mathematics)|variables]].<ref>{{cite web |url=http://www.ncme.org/ncme/NCME/Resource_Center/Glossary/NCME/Resource_Center/Glossary1.aspx?hkey=4bb87415-44dc-4088-9ed9-e8515326a061#anchorC |title=correlation coefficient |author=<!--Not stated--> |date= |website=NCME.org |publisher=[[National Council on Measurement in Education]] |access-date=April 17, 2014 |quote=correlation coefficient: A statistic used to show how the scores from one measure relate to scores on a second measure for the same group of individuals. A high value (approaching +1.00) is a strong direct relationship, values near 0.50 are considered moderate and values below 0.30 are considered to show weak relationship. A low negative value (approaching -1.00) is similarly a strong inverse relationship, and values near 0.00 indicate little, if any, relationship. |archive-url=https://web.archive.org/web/20170722194028/http://www.ncme.org/ncme/NCME/Resource_Center/Glossary/NCME/Resource_Center/Glossary1.aspx?hkey=4bb87415-44dc-4088-9ed9-e8515326a061#anchorC |archive-date=July 22, 2017 |dead-url=yes }}</ref> The variables may be two [[column (database)|column]]s of a given [[data set]] of observations, often called a [[sample (statistics)|sample]], or two components of a [[multivariate random variable]] with a known [[distribution (statistics)|distribution]].{{cn|date=July 2019}}


Several types of correlation coefficient exist, each with their own definition and own range of usability and characteristics. They all assume values in the range from −1 to +1, where ±1 indicates the strongest possible agreement and 0 the strongest possible disagreement.<ref>{{cite book |last1=Taylor |first1=John R. |title=An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements |date=1997 |publisher=University Science Books |location=Sausalito, CA |isbn=0-935702-75-X |page=217 |edition=2nd |url=http://faculty.kfupm.edu.sa/phys/aanaqvi/Taylor-An%20Introduction%20to%20Error%20Analysis.pdf |accessdate=14 February 2019}}</ref> As tools of analysis, correlation coefficients present certain problems, including the propensity of some types to be distorted by [[outliers]] and the possibility of incorrectly being used to infer a [[causal relationship]] between the variables.<ref name="Boddy">{{cite book|last1=Boddy|first1=Richard |last2=Smith|first2=Gordon |title=Statistical methods in practice: for scientists and technologists |date=2009|publisher=Wiley|location=Chichester, U.K.|isbn=978-0-470-74664-6|pages=95-96}}</ref>
Several types of correlation coefficient exist, each with their own definition and own range of usability and characteristics. They all assume values in the range from −1 to +1, where ±1 indicates the strongest possible agreement and 0 the strongest possible disagreement.<ref>{{cite book |last1=Taylor |first1=John R. |title=An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements |date=1997 |publisher=University Science Books |location=Sausalito, CA |isbn=0-935702-75-X |page=217 |edition=2nd |url=http://faculty.kfupm.edu.sa/phys/aanaqvi/Taylor-An%20Introduction%20to%20Error%20Analysis.pdf |accessdate=14 February 2019}}</ref> As tools of analysis, correlation coefficients present certain problems, including the propensity of some types to be distorted by [[outliers]] and the possibility of incorrectly being used to infer a [[causal relationship]] between the variables.<ref name="Boddy">{{cite book|last1=Boddy|first1=Richard |last2=Smith|first2=Gordon |title=Statistical methods in practice: for scientists and technologists |date=2009|publisher=Wiley|location=Chichester, U.K.|isbn=978-0-470-74664-6|pages=95-96}}</ref>

Revision as of 18:25, 31 July 2019

A correlation coefficient is a numerical measure of some type of correlation, meaning a statistical relationship between two variables.[1] The variables may be two columns of a given data set of observations, often called a sample, or two components of a multivariate random variable with a known distribution.[citation needed]

Several types of correlation coefficient exist, each with their own definition and own range of usability and characteristics. They all assume values in the range from −1 to +1, where ±1 indicates the strongest possible agreement and 0 the strongest possible disagreement.[2] As tools of analysis, correlation coefficients present certain problems, including the propensity of some types to be distorted by outliers and the possibility of incorrectly being used to infer a causal relationship between the variables.[3]

Types

Pearson

The Pearson product-moment correlation coefficient, also known as r, R, or Pearson's r, is a measure of the strength and direction of the linear relationship between two variables that is defined as the covariance of the variables divided by the product of their standard deviations. This is the best known and most commonly used type of correlation coefficient; when the term "correlation coefficient" is used without further qualification, it usually refers to the Pearson product-moment correlation coefficient

Intra-class

Intraclass correlation (ICC) is a descriptive statistic that can be used when quantitative measurements are made on units that are organized into groups; it describes how strongly units in the same group resemble each other.

Rank

Rank correlation is a measure of the relationship between the rankings of two variables or two rankings of the same variable:

See also

References

  1. ^ "correlation coefficient". NCME.org. National Council on Measurement in Education. Archived from the original on July 22, 2017. Retrieved April 17, 2014. correlation coefficient: A statistic used to show how the scores from one measure relate to scores on a second measure for the same group of individuals. A high value (approaching +1.00) is a strong direct relationship, values near 0.50 are considered moderate and values below 0.30 are considered to show weak relationship. A low negative value (approaching -1.00) is similarly a strong inverse relationship, and values near 0.00 indicate little, if any, relationship. {{cite web}}: Unknown parameter |dead-url= ignored (|url-status= suggested) (help)
  2. ^ Taylor, John R. (1997). An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements (PDF) (2nd ed.). Sausalito, CA: University Science Books. p. 217. ISBN 0-935702-75-X. Retrieved 14 February 2019.
  3. ^ Boddy, Richard; Smith, Gordon (2009). Statistical methods in practice: for scientists and technologists. Chichester, U.K.: Wiley. pp. 95–96. ISBN 978-0-470-74664-6.