A similarity-based approach for data stream classification

Aguilar-Ruiz,Jesus; Mena Torres, Dayrelis; Aguilar-Ruiz, Jesús Salvador

Publication:
A similarity-based approach for data stream classification

Files

ESWA9106.pdf (979.8 KB)

Identifiers

URI: https://hdl.handle.net/10433/26331

DOI: 10.1016/j.eswa.2013.12.041

Publication date

2014-07-01

Authors

Aguilar-Ruiz,Jesus

Mena Torres, Dayrelis

Aguilar-Ruiz, Jesús Salvador

Publisher

Elsevier

Export

Abstract

Incremental learning techniques have been used extensively to address the data stream classification problem. The most important issue is to maintain a balance between accuracy and efficiency, i.e., the algorithm should provide good classification performance with a reasonable time response. This work introduces a new technique, named Similarity-based Data Stream Classifier (SimC), which achieves good performance by introducing a novel insertion/removal policy that adapts quickly to the data tendency and maintains a representative, small set of examples and estimators that guarantees good classification rates. The methodology is also able to detect novel classes/labels, during the running phase, and to remove useless ones that do not add any value to the classification process. Statistical tests were used to evaluate the model performance, from two points of view: efficacy (classification rate) and efficiency (online response time). Five well-known techniques and sixteen data streams were compared, using the Friedman’s test. Also, to find out which schemes were significantly different, the Nemenyi’s, Holm’s and Shaffer’s tests were considered. The results show that SimC is very competitive in terms of (absolute and streaming) accuracy, and classification/updating time, in comparison to several of the most popular methods in the literature.

Keywords

Data streams
Classification
Similarity

Bibliographic reference

Expert Systems with Applications Volume 41, Issue 9, July 2014, Pages 4224-4234

Collections

DDI - Artículos de revistas

Full item page

Publication:
A similarity-based approach for data stream classification

Files

Identifiers

Publication date

Reading date

Event date

Start date of the public exhibition period

End date of the public exhibition period

Authors

Advisors

Authors of photography

Person who provides the photography

Journal Title

Journal ISSN

Volume Title

Publisher

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Doctoral program

Related publication

Research projects

Description

Keywords

Bibliographic reference

Photography rights

Collections

Publication: A similarity-based approach for data stream classification

Files

Identifiers

Publication date

Reading date

Event date

Start date of the public exhibition period

End date of the public exhibition period

Authors

Advisors

Authors of photography

Person who provides the photography

Journal Title

Journal ISSN

Volume Title

Publisher

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Doctoral program

Related publication

Research projects

Description

Keywords

Bibliographic reference

Photography rights

Collections

Publication:
A similarity-based approach for data stream classification