Evaluation
Contents
17.8. Evaluation#
There are two possible approaches for evaluation of a clustering algorithm.
Full reference
No reference
In full reference evaluation, a ground truth about an ideal clustering of the data is given. E.g., suppose our goal is to cluster a set of facial images of different persons. The ground truth in this case is the name/id of each person associated with each image.
17.8.1. Full Reference Evaluation#
Let
Assume that the dataset
can be divided into clusters.Let these clusters be named
.Assume that it is known which point belongs to which cluster.
In general a clustering
The clustering process may make a number of mistakes.
It may identify incorrect number of clusters and
may not be equal to .More-over even if
, the data points may be placed in wrong clusters.
Ideally, we want
All the labels can be put in a label vector
Following [85], we will quickly establish the commonly used measures for clustering performance.
We have a reference clustering of vectors in
given by which is known to us in advance (either by construction in synthetic experiments or as ground truth with real life data-sets).The clustering obtained by the algorithm is given by
.
For two arbitrary points
they belong to same cluster in both
and (true positive),they are in same cluster in
but different cluster in (false negative)they are in different clusters in
but in same cluster inthey are in different clusters in both
and (true negative).
Consider some cluster
The elements common to
and are given by .We define
We define the overall precision for
asWe define
.We define the overall recall for
asWe define the
score asWe define the overall
-score for asWe note that cluster
for which the maximum is achieved is best matching cluster for .Finally, we define the overall
-score for the clustering
where
We also define a clustering ratio given by the factor
There are different ways to define clustering error. For the special case where the number of clusters is known in advance, and we ensure that the data-set is divided into exactly those many clusters, it is possible to define subspace clustering error as follows:
The definition is adopted from [32] for comparing
the results in this paper with their results. This definition can be
used after a proper one-one mapping between original labels
and cluster labels assigned by the clustering algorithms has been
identified. We can compute this mapping by comparing