High Dimensional Clustering with r-nets

Authors

  • Georgia Avarikioti ETH Zurich
  • Alain Ryser ETH Zurich
  • Yuyi Wang ETH Zurich
  • Roger Wattenhofer ETH Zurich

DOI:

https://doi.org/10.1609/aaai.v33i01.33013207

Abstract

Clustering, a fundamental task in data science and machine learning, groups a set of objects in such a way that objects in the same cluster are closer to each other than to those in other clusters. In this paper, we consider a well-known structure, so-called r-nets, which rigorously captures the properties of clustering. We devise algorithms that improve the runtime of approximating r-nets in high-dimensional spaces with1 and `2 metrics from, where . These algorithms are also used to improve a framework that provides approximate solutions to other high dimensional distance problems. Using this framework, several important related problems can also be solved efficiently, e.g.,pproximate kth-nearest neighbor distance-approximate Min-Max clustering,-approximate k-center clustering. In addition, we build an algorithm that-approximates greedy permutations in time O˜((dn+n2−α)·logΦ) where Φ is the spread of the input. This algorithm is used to -approximate k-center with the same time complexity.

Downloads

Published

2019-07-17

How to Cite

Avarikioti, G., Ryser, A., Wang, Y., & Wattenhofer, R. (2019). High Dimensional Clustering with r-nets. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 3207-3214. https://doi.org/10.1609/aaai.v33i01.33013207

Issue

Section

AAAI Technical Track: Machine Learning