next up previous contents
Next: Self-Organizing Map (SOM) Up: Algorithms Previous: Weighted Graph Partitioning (GP)   Contents

Hypergraph Partitioning (HGP)

A hypergraph is a graph whose edges can connect more than two vertices (hyperedges). The clustering problem is then formulated as a finding the minimum-cut of a hypergraph. A minimum-cut is the removal of the set of hyperedges (with minimum edge weight) that separates the hypergraph into $ k$ unconnected components. Again, an object $ \mathbf{x}_j$ maps to a vertex $ v_j$. Each word (feature) maps to a hyperedge connecting all vertices with non-zero frequency count of this word. The weight of this hyperedge is chosen to be the total number of occurrences in the data-set. Hence, the importance of a hyperedge during partitioning is proportional to the occurrence of the corresponding word. The minimum-cut of this hypergraph into $ k$ unconnected components gives the desired clustering. We employ the HMETIS package for partitioning. An advantage of this approach is that the clustering problem can be mapped to a graph problem without the explicit computation of similarity, which makes this approach computationally efficient with $ O(n \cdot d \cdot k)$ assuming a (close to) linear performing hypergraph partitioner. Note that, sample-wise frequency information gets lost in this formulation since there is only a single weight associated with a hyperedge.


next up previous contents
Next: Self-Organizing Map (SOM) Up: Algorithms Previous: Weighted Graph Partitioning (GP)   Contents
Alexander Strehl 2002-05-03