Carissimo, A, Cutillo, L orcid.org/0000-0002-2205-0338 and Feis, ID (2018) Validation of community robustness. Computational Statistics & Data Analysis, 120. pp. 1-24. ISSN 0167-9473
Abstract
The large amount of work on community detection and its applications leaves unaddressed one important question: the statistical validation of the results. A methodology is presented that is able to clearly detect if the community structure found by some algorithms is statistically significant or is a result of chance, merely due to edge positions in the network. Given a community detection method and a network of interest, the proposal examines the stability of the partition recovered against random perturbations of the original graph structure. To address this issue, a perturbation strategy and a null model graph, which matches the original in some of its structural properties, but is otherwise a random graph, is specified. A set of procedures is built based on a special measure of clustering distance, namely Variation of Information, using tools set up for functional data analysis. The procedures determine whether the obtained clustering departs significantly from the null model. This strongly supports the robustness against perturbation of the algorithm used to identify the community structure. Results obtained with the proposed technique on simulated and real datasets are shown and discussed.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Keywords: | Community; Network; Variation of information; Multiple testing |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Mathematics (Leeds) > Statistics (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 20 Mar 2019 13:57 |
Last Modified: | 07 Feb 2020 14:48 |
Status: | Published |
Publisher: | Elsevier |
Identification Number: | 10.1016/j.csda.2017.10.006 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:143860 |