Hierarchical Clustering: A Nearly-Optimal Construction for Well-Clustered Graphs

Steinar Laenen, Bogdan Manghiuc, He Sun

July 2023

Abstract

This paper studies efficient algorithms for constructing hierarchical clustering (HC) with respect to Dasgupta’s cost function. For any input graph G with a clear cluster-structure, our presented algorithm runs in nearly-linear time in the input size of G, and returns an O(1)-approximate HC tree with respect to Dasgupta’s cost function; hence both the runtime and approximation ratio are optimal up to some poly-logarithmic factors. We further compare the performance of our algorithm against the previous state-of-the-art on different datasets, and report the experimental results.

Type

Conference paper

Publication

In International Conference on Machine Learning 2023

Hierarchical Clustering: A Nearly-Optimal Construction for Well-Clustered Graphs

Abstract

Steinar Laenen

PhD Candidate Computer Science