DAC-HPP: deep attributed clustering with high-order proximity preserve

, , & (2023) DAC-HPP: deep attributed clustering with high-order proximity preserve. Neural Computing and Applications, 35(34), pp. 24493-24511.

Open access copy at publisher website

Description

Attributed graph clustering, the task of grouping nodes into communities using both graph structure and node attributes, is a fundamental problem in graph analysis. Recent approaches have utilized deep learning for node embedding followed by conventional clustering methods. However, these methods often suffer from the limitations of relying on the original network structure, which may be inadequate for clustering due to sparsity and noise, and using separate approaches that yield suboptimal embeddings for clustering. To address these limitations, we propose a novel method called Deep Attributed Clustering with High-order Proximity Preserve (DAC-HPP) for attributed graph clustering. DAC-HPP leverages an end-to-end deep clustering framework that integrates high-order proximities and fosters structural cohesiveness and attribute homogeneity. We introduce a modified Random Walk with Restart that captures k-order structural and attribute information, enabling the modelling of interactions between network structure and high-order proximities. A consensus matrix representation is constructed by combining diverse proximity measures, and a deep joint clustering approach is employed to leverage the complementary strengths of embedding and clustering. In summary, DAC-HPP offers a unique solution for attributed graph clustering by incorporating high-order proximities and employing an end-to-end deep clustering framework. Extensive experiments demonstrate its effectiveness, showcasing its superiority over existing methods. Evaluation on synthetic and real networks demonstrates that DAC-HPP outperforms seven state-of-the-art approaches, confirming its potential for advancing attributed graph clustering research.

Impact and interest:

10 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 244679
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
ORCID iD:
Li, Yuefengorcid.org/0000-0002-3594-8980
Xu, Yueorcid.org/0000-0002-1137-0272
Additional Information: Funding: Open Access funding enabled and organized by CAUL and its Member Institutions. The authors have not disclosed any funding.
Measurements or Duration: 19 pages
Keywords: Attributed graph clustering, Attributed networks, Deep clustering, Random walk with restart
DOI: 10.1007/s00521-023-09052-4
ISSN: 0941-0643
Pure ID: 150863807
Divisions: Current > QUT Faculties and Divisions > Faculty of Science
Current > Schools > School of Computer Science
Copyright Owner: 2023 The Authors
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 24 Nov 2023 02:17
Last Modified: 27 Jul 2024 20:21