Jump to content

K-median problem: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
distinguish from k-means ... definition was the same
Line 1: Line 1:
The '''''k''-median problem''' is the problem of finding ''k'' centers such that the clusters formed by them are the most compact.
The '''''k''-median problem''' is the problem of finding ''k'' centers such that the clusters formed by them are the most compact.


Formally, given a set of data points ''x'', the ''k'' centers ''c''<sub>''i''</sub> are to be chosen so as to minimize the sum of the squares of the distances from ''x'' to the nearest&nbsp;''c''<sub>''i''</sub>.
Formally, given a set of data points ''x'', the ''k'' centers ''c''<sub>''i''</sub> are to be chosen so as to minimize the sum of the absolute values of the distances from each ''x'' to the nearest&nbsp;''c''<sub>''i''</sub>.


The problem constitutes a better measure for the [[k-means clustering|''k''-means clustering]] algorithm, and is widely used in applications such as [[facility location]]<ref>http://www.aladdin.cs.cmu.edu/reu/mini_probes/papers/facilitylocation.ppt</ref>.
The problem constitutes a better measure for the [[k-means clustering|''k''-means clustering]] algorithm, and is widely used in applications such as [[facility location]]<ref>http://www.aladdin.cs.cmu.edu/reu/mini_probes/papers/facilitylocation.ppt</ref>.
Line 8: Line 8:
{{reflist}}
{{reflist}}


{{math-stub}}
{{statistics-stub}}


[[Category:Statistics]]
[[Category:Statistics]]

Revision as of 13:15, 22 April 2010

The k-median problem is the problem of finding k centers such that the clusters formed by them are the most compact.

Formally, given a set of data points x, the k centers ci are to be chosen so as to minimize the sum of the absolute values of the distances from each x to the nearest ci.

The problem constitutes a better measure for the k-means clustering algorithm, and is widely used in applications such as facility location[1].

References