2507.02774

Total: 1

#1 Connected k-Median with Disjoint and Non-disjoint Clusters [PDF] [Copy] [Kimi] [REL]

Authors: Jan Eube, Kelin Luo, Dorian Reineccius, Heiko Röglin, Melanie Schmidt

The connected k-median problem is a constrained clustering problem that combines distance-based k-clustering with connectivity information. The problem allows to input a metric space and an unweighted undirected connectivity graph that is completely unrelated to the metric space. The goal is to compute k centers and corresponding clusters such that each cluster forms a connected subgraph of G, and such that the k-median cost is minimized. The problem has applications in very different fields like geodesy (particularly districting), social network analysis (especially community detection), or bioinformatics. We study a version with overlapping clusters where points can be part of multiple clusters which is natural for the use case of community detection. This problem variant is Ω(logn)-hard to approximate, and our main result is an O(k2logn)-approximation algorithm for the problem. We complement it with an Ω(n1ϵ)-hardness result for the case of disjoint clusters without overlap with general connectivity graphs, as well as an exact algorithm in this setting if the connectivity graph is a tree.

Subject: Data Structures and Algorithms

Publish: 2025-07-03 16:35:35 UTC