Jump to content

H-index: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
Line 10: Line 10:
In other words, a scholar with an index of ''h'' has published ''h'' papers with at least ''h'' citations each.[http://pda.physorg.com/lofi-news-hindex-says-papers_7971.html] Thus, the ''h''-index is the result of the balance between the number of publications and the number of citations per publication. The index is designed to improve upon simpler measures such as the total number of citations or publications, to distinguish truly influential scientists from those who simply publish many papers. The index is also not affected by single papers that have many citations. The index works properly only for comparing scientists working in the same field; citation conventions differ widely among different fields.
In other words, a scholar with an index of ''h'' has published ''h'' papers with at least ''h'' citations each.[http://pda.physorg.com/lofi-news-hindex-says-papers_7971.html] Thus, the ''h''-index is the result of the balance between the number of publications and the number of citations per publication. The index is designed to improve upon simpler measures such as the total number of citations or publications, to distinguish truly influential scientists from those who simply publish many papers. The index is also not affected by single papers that have many citations. The index works properly only for comparing scientists working in the same field; citation conventions differ widely among different fields.


Online web programs are available to directly calculate a scientist's ''h''-index: [http://quadsearch.csd.auth.gr/index.php?lan=1&s=2 h-Index Calculation and Charts on QuadSearch: Metasearch Engine], [http://www.brics.dk/~mis/hnumber.html h-index calculator,]and [http://www.epidemiologic.org/2006/12/h-index-calculator-of-scientist-impact.html h-index calculator mirror]. The ''h''-index can also be manually determined using free Internet databases, such as [[Google Scholar]], and serves as an alternative to more traditional journal [[impact factor]] metrics, which are not freely available. Because only the most highly cited articles contribute to the ''h''-index, its determination is a relatively simpler process. Hirsch has demonstrated that <math>h</math> has high predictive value for whether a scientist has won honors like [[National academy|National Academy]] membership or the [[Nobel Prize]]. In [[physics]], a moderately productive scientist should have an <math>h</math> equal to the number of years of service while biomedical scientists tend to have higher values.
Online web programs are available to directly calculate a scientist's ''h''-index: [http://quadsearch.csd.auth.gr/index.php?lan=1&s=2 h-Index Calculation and Charts on QuadSearch: Metasearch Engine], [http://www.brics.dk/~mis/hnumber.html h-index calculator,]and [http://www.epidemiologic.org/2006/12/h-index-calculator-of-scientist-impact.html h-index calculator mirror]. The ''h''-index is calculated when citation overview is generatted with [[Scopus]]. The ''h''-index can also be manually determined using free Internet databases, such as [[Google Scholar]], and serves as an alternative to more traditional journal [[impact factor]] metrics, which are not freely available. Because only the most highly cited articles contribute to the ''h''-index, its determination is a relatively simpler process. Hirsch has demonstrated that <math>h</math> has high predictive value for whether a scientist has won honors like [[National academy|National Academy]] membership or the [[Nobel Prize]]. In [[physics]], a moderately productive scientist should have an <math>h</math> equal to the number of years of service while biomedical scientists tend to have higher values.


===Advantages===
===Advantages===

Revision as of 18:37, 4 September 2007

The h-index is an index that quantifies scientific productivity of a scientist based on the number of papers published by the scientist and on how often these papers are cited in papers written by other scientists. It can also apply to the productivity of a group of scientists, such as a department or university or country. The index was suggested in 2005 by Jorge E. Hirsch as a tool for determining theoretical physicists' productivity and is sometimes called the Hirsch index or Hirsch number. The h-index is currently not a widely accepted measure of scientific productivity, but only one of a number of possible indices.[citation needed]

Definition and purpose

The index is calculated based on the distribution of citations received by a given researcher's publications. Hirsch writes:

A scientist has index h if h of his Np papers have at least h citations each, and the other (Np - h) papers have at most h citations each.

In other words, a scholar with an index of h has published h papers with at least h citations each.[1] Thus, the h-index is the result of the balance between the number of publications and the number of citations per publication. The index is designed to improve upon simpler measures such as the total number of citations or publications, to distinguish truly influential scientists from those who simply publish many papers. The index is also not affected by single papers that have many citations. The index works properly only for comparing scientists working in the same field; citation conventions differ widely among different fields.

Online web programs are available to directly calculate a scientist's h-index: h-Index Calculation and Charts on QuadSearch: Metasearch Engine, h-index calculator,and h-index calculator mirror. The h-index is calculated when citation overview is generatted with Scopus. The h-index can also be manually determined using free Internet databases, such as Google Scholar, and serves as an alternative to more traditional journal impact factor metrics, which are not freely available. Because only the most highly cited articles contribute to the h-index, its determination is a relatively simpler process. Hirsch has demonstrated that has high predictive value for whether a scientist has won honors like National Academy membership or the Nobel Prize. In physics, a moderately productive scientist should have an equal to the number of years of service while biomedical scientists tend to have higher values.

Advantages

The main disadvantages of the old bibliometric indicators, such as total number of papers or total number of citations are that the former does not account for the quality of scientific publications, while the latter is disproportionately affected by participation in a single publication of major influence. The h-index is intended to measure simultaneously the quality and sustainability of scientific output, as well as, to some extent, the diversity of scientific research. For instance, the h-index is much less affected by methodological papers proposing successful new techniques, methods or approximations. For example, one of the most cited condensed matter theorists, John P. Perdew, has been very successful in devising new approximations within the widely used density functional theory. He has published 3 papers cited more than 5000 times and 2 cited more than 4000 times. Several thousand papers utilizing the density functional theory are published every year, most of them citing at least one paper of J.P. Perdew. His total citation index is close to 39 000, while his h-index is large, 51, but not unique. In contrast, the condensed matter theorist with the highest h-index (94), Marvin L. Cohen, has a lower citation index of 35 000. One can argue that in this case the h-index reflects the broader impact of Cohen's paper in solid state physics.

The h-index can also be calculated as a function of time, in two different ways. It was originally proposed by Hirsch that h depends linearly on the age of a researcher; in this case the time derivative allows to compare scientists of different age. Another possibility is to calculate h using papers published within a particular time period, for instance, within the last 10 years, thus measuring the current productivity as opposed to the lifetime achievement.

Criticism

It is not difficult to come up with situations in which may provide misleading information about a scientist's output. Most importantly the fact that is bounded by the total number of publications means that scientists with a short career are at an inherent disadvantage, regardless of the importance of their discoveries. For example, Évariste Galois' h-index is 2, and will remain so forever. Had Albert Einstein died in early 1906, his h-index would be stuck at 4 or 5, despite his being widely acknowledged as one of the most important physicists, even considering only his publications to that date.

Additionally, some potential drawbacks of the impact factor apply equally to the h-index. For example, review articles are usually more cited than original articles, so a hypothetic author who would only write review articles would have a higher h-index than authors who would actually contribute original research.

Furthermore, it was pointed out that while the h-index de-emphasizes singular successful publications in favor of sustained productivity, it does so too strongly. Indeed, two scientists may have the same h-index, say, , but one has 10 papers cited more than 200 times, and the other has none. Clearly scientific output of the former is more valuable. Several recipes to correct for that have been proposed, but none has gained a universal recognition.

The h-index is also affected by implementation issues. Some automated searching processes find citations to papers going back many years, while others will only yield recent papers or citations. This issue is less important for more people whose publication record started after automated indexing begun around 1990. A manual search of a citation source may identify a substantial number of citations that are not quite correct, and therefore not automatically matched to the correct paper.

General problems associated with any bibliometric index, namely the necessity to measure scientific impact by one number, apply here as well. For instance, comparing two condensed matter theorists with the highest h-index, Marvin Cohen and Philip Anderson, we observe that they have the same h-index within 3%, although the latter is a Nobel Prize winner and a founder of entire new fields in condensed matter theory.

While the h-index is one 'measure' of scientific productivity, some object that a human activity as complex as the formal acquisition of knowledge should be condensed down to a single numeric metric. Two potential dangers of this have been expressed:[citation needed]

  • career progression and other aspects of a human's life may be damaged by the use of a simple metric in a decision-making process by someone who has neither the time nor the intelligence to consider more appropriate decision metrics
  • Scientists may respond to this by maximising their h-index to the detriment of doing more justifiable work. This effect of using simple metrics for making management decisions has often been found to be an unintended consequence of metric-based decision taking; for instance, governments routinely operate policies designed to minimise not crime, but crime figures.

Modifications of h-index and m value

Various proposals to modify the h-index in order to emphasize different features have been made [1] [2] [3] [4].

Examples of ranking with h index

Some physicists with high h-indices

Based on the SPIRES HEP Database (Particle and High energy Physics, As of August 2005,):

Based on the ISI Web of Science, according to the original paper (all physics):

Based on the ISI Web of Knowledge (all fields):

Physical chemists from Berkeley and Stanford with high h-indices

The following is a list of some US physical chemists with high h-indices. It has been compiled from "The Everyday Scientist" and ISI Web of Science, using data for only professors at Stanford and U.C. Berkeley:

Biologists with high h-indices

A starting attempt was made at using the physics-oriented h-index for the life sciences (a blanket term for biology, botany, medicine, and so forth). Only ten names were listed in the reference, but if a more solid attempt is made this list will certainly grow. The following list is based on publications from 1983-2002.

Computer scientists with high h-indices

The following are five computer scientists with high h-indices (from http://www.cs.ucla.edu/~palsberg/h-number.html).

Economics researchers with high h-indices

The following are five economics researchers with high h-indices as measured by the University of Connecticut's RePEc Author Service (as of December 2006)[2]:

Scientists in other fields with high h-indices

See also

References

  • Hirsch, Jorge E., (2005), "An index to quantify an individual's scientific research output," PNAS 102(46):16569-16572, November 15 2005 (Free copy available from arXiv).
  • Sidiropoulos A., Katsaros D. and Manolopoulos Y., (2006), Generalized h-index for disclosing latent facts in citation networks.
  • Kelly C. D. and Jennions M.D. (2006), The h index and career assessment by numbers: a paper expounding certain problems of the h-index.
  • "Impact factor," Science 309:1181, 19 August 2005.
  • "An index to quantify an individual's scientific research output," PNAS 102(46):16569-16572, November 15 2005.
  • H values for Stanford p-chem professors from "The Everyday Scientist"
  • Lehmann S. L., Lautrup B. E., and Jackson A. D. (2006). "Measures for measures". Nature. 404 (7122): 1003–1004. doi:10.1038/4441003a. {{cite journal}}: Unknown parameter |month= ignored (help)CS1 maint: multiple names: authors list (link)

Notes

  1. ^ Sidiropoulos A., Katsaros D. and Manolopoulos Y., (2006), Generalized h-index for disclosing latent facts in citation networks
  2. ^ g-index
  3. ^ Jayant S Vaidya (2005). "V-index: A fairer index to quantify an individual 's research output capacity". BMJ. 331: 339-c – 1340-c. {{cite journal}}: Unknown parameter |month= ignored (help)
  4. ^ Katsaros D., Sidiropoulos A., Manolopous Y., (2007), Age Decaying H-Index for Social Network of Citations in Proceedings of Workshop on Social Aspects of the Web Poznan, Poland, April 27, 2007