next up previous contents
Next: Representing Sets of Clusterings Up: Cluster Ensembles Previous: Objective Function for Cluster   Contents


Efficient Consensus Functions

In this section, we introduce three efficient heuristics to solve the cluster ensemble problem. All algorithms approach the problem by first transforming the set of clusterings into a hypergraph representation.

Cluster-based Similarity Partitioning Algorithm (CSPA).
A clustering signifies a relationship between objects in the same cluster and can thus be used to establish a measure of pairwise similarity. This induced similarity measure is then used to recluster the objects, yielding a combined clustering.
HyperGraph Partitioning Algorithm (HGPA).
In this algorithm, we approximate the maximum mutual information objective with a constrained minimum cut objective. Essentially, the cluster ensemble problem is posed as a partitioning problem of a suitably defined hypergraph where hyperedges represent clusters.
Meta-CLustering Algorithm (MCLA).
Here, the objective of integration is viewed as a cluster correspondence problem. Essentially, groups of clusters (meta-clusters) have to be identified and consolidated.
The following four subsections describe the common hypergraph representation, CSPA, HGPA, and MCLA. Section 5.3.5 discusses differences in the algorithms and evaluates their performance in a controlled experiment.



Subsections
next up previous contents
Next: Representing Sets of Clusterings Up: Cluster Ensembles Previous: Objective Function for Cluster   Contents
Alexander Strehl 2002-05-03