Publication:
A similarity-based approach for data stream classification

dc.contributor.authorAguilar-Ruiz,Jesus
dc.contributor.authorMena Torres, Dayrelis
dc.contributor.authorAguilar-Ruiz, Jesús Salvador
dc.date.accessioned2026-03-02T10:57:29Z
dc.date.available2026-03-02T10:57:29Z
dc.date.issued2014-07-01
dc.description.abstractIncremental learning techniques have been used extensively to address the data stream classification problem. The most important issue is to maintain a balance between accuracy and efficiency, i.e., the algorithm should provide good classification performance with a reasonable time response. This work introduces a new technique, named Similarity-based Data Stream Classifier (SimC), which achieves good performance by introducing a novel insertion/removal policy that adapts quickly to the data tendency and maintains a representative, small set of examples and estimators that guarantees good classification rates. The methodology is also able to detect novel classes/labels, during the running phase, and to remove useless ones that do not add any value to the classification process. Statistical tests were used to evaluate the model performance, from two points of view: efficacy (classification rate) and efficiency (online response time). Five well-known techniques and sixteen data streams were compared, using the Friedman’s test. Also, to find out which schemes were significantly different, the Nemenyi’s, Holm’s and Shaffer’s tests were considered. The results show that SimC is very competitive in terms of (absolute and streaming) accuracy, and classification/updating time, in comparison to several of the most popular methods in the literature.
dc.description.sponsorshipDeporte e Informática
dc.format.mimetypeapplication/pdf
dc.identifier.citationExpert Systems with Applications Volume 41, Issue 9, July 2014, Pages 4224-4234
dc.identifier.doi10.1016/j.eswa.2013.12.041
dc.identifier.urihttps://hdl.handle.net/10433/26331
dc.language.isoen
dc.publisherElsevier
dc.rights.accessRightsrestricted access
dc.subjectData streams
dc.subjectClassification
dc.subjectSimilarity
dc.titleA similarity-based approach for data stream classification
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication5d0a50b3-624e-4b73-914e-0464ccd668dc
relation.isAuthorOfPublication5ca8a962-86a4-4465-aad6-508a8e70adc7
relation.isAuthorOfPublication.latestForDiscovery5d0a50b3-624e-4b73-914e-0464ccd668dc

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ESWA9106.pdf
Size:
979.8 KB
Format:
Adobe Portable Document Format