Two-Stage Sparse Representation Clustering for Dynamic Data Streams

Chen, Jie, Wang, Zhu, Yang, Shengxiang and Mao, Hua (2022) Two-Stage Sparse Representation Clustering for Dynamic Data Streams. IEEE Transactions on Cybernetics. ISSN 2168-2267 (In Press)

[img]
Preview
Text
IEEETCYB22.pdf - Accepted Version

Download (2MB) | Preview
Official URL: https://doi.org/10.1109/tcyb.2022.3204894

Abstract

Data streams are a potentially unbounded sequence of data objects, and the clustering of such data is an effective way of identifying their underlying patterns. Existing data stream clustering algorithms face two critical issues: 1) evaluating the relationship among data objects with individual landmark windows of fixed size and 2) passing useful knowledge from previous landmark windows to the current landmark window. Based on sparse representation techniques, this article proposes a two-stage sparse representation clustering (TSSRC) method. The novelty of the proposed TSSRC algorithm comes from evaluating the effective relationship among data objects in the landmark windows with an accurate number of clusters. First, the proposed algorithm evaluates the relationship among data objects using sparse representation techniques. The dictionary and sparse representations are iteratively updated by solving a convex optimization problem. Second, the proposed TSSRC algorithm presents a dictionary initialization strategy that seeks representative data objects by making full use of the sparse representation results. This efficiently passes previously learned knowledge to the current landmark window over time. Moreover, the convergence and sparse stability of TSSRC can be theoretically guaranteed in continuous landmark windows under certain conditions. Experimental results on benchmark datasets demonstrate the effectiveness and robustness of TSSRC.

Item Type: Article
Uncontrolled Keywords: Electrical and Electronic Engineering, Computer Science Applications, Human-Computer Interaction, Information Systems, Control and Systems Engineering, Software
Subjects: G400 Computer Science
G500 Information Systems
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Rachel Branson
Date Deposited: 14 Oct 2022 13:04
Last Modified: 14 Oct 2022 13:15
URI: https://nrl.northumbria.ac.uk/id/eprint/50392

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics