Publication:
Pattern sequence-based algorithm for multivariate big data time series forecasting: Application to electricity consumption

dc.contributor.authorPérez Chacón, Rubén
dc.contributor.authorAsencio Cortés, Gualberto
dc.contributor.authorMartínez-Álvarez, Francisco
dc.contributor.authorTroncoso, Alicia
dc.date.accessioned2024-02-06T13:42:28Z
dc.date.available2024-02-06T13:42:28Z
dc.date.issued2024-01-22
dc.description.abstractSeveral interrelated variables typically characterize real-world processes, and a time series cannot be predicted without considering the influence that other time series might have on the target time series. This work proposes a novel algorithm to forecast multivariate big data time series. This new general-purpose approach consists first of a previous pattern recognition performed jointly using all time series that form the multivariate time series and then predicts the target time series by searching for similarities between pattern sequences. The proposed algorithm is designed to tackle multivariate time series forecasting problems within the context of big data. In particular, the algorithm has been developed with a distributed nature to enhance its efficiency in analyzing and processing large volumes of data. Moreover, the algorithm is straightforward to use, with only two parameters needing adjustment. Another advantage of the MV-bigPSF algorithm is its ability to perform multi-step forecasting, which is particularly useful in many practical applications. To evaluate the algorithm’s performance, real-world data from Uruguay’s power consumption has been utilized. Specifically, MV-bigPSF has been compared with both univariate and multivariate methods. Regarding the univariate ones, MV-bigPSF improved 12.8% in MAPE compared to the second-best method. Regarding the multivariate comparison, MV-bigPSF improved 44.8% in MAPE with respect to the second most accurate method. Regarding efficiency, the execution time of MV-bigPSF was 1.83 times faster than the second-fastest multivariate method, both in a single-core environment. Therefore, the proposed algorithm can be a valuable tool for practitioners and researchers working in multivariate time series forecasting, particularly in big data applications.
dc.description.sponsorshipData Science & Big Data Lab
dc.format.mimetypeapplication/pdf
dc.identifier.doi10.1016/j.future.2023.12.021
dc.identifier.urihttps://hdl.handle.net/10433/19791
dc.language.isoen
dc.publisherElsevier
dc.rightsAttribution-ShareAlike 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-sa/4.0/
dc.subjectMultivariate analysis
dc.subjectBig Data
dc.subjectTime Series Forecasting
dc.subjectPattern Sequence Forecasting
dc.subjectElectricity Consumption
dc.titlePattern sequence-based algorithm for multivariate big data time series forecasting: Application to electricity consumption
dc.typejournal article
dc.type.hasVersionAM
dspace.entity.typePublication
relation.isAuthorOfPublication6ced30e5-fdea-43dc-9ad8-acb8c9d18fa0
relation.isAuthorOfPublication81e98c02-1e64-490c-8131-df9e19722d6f
relation.isAuthorOfPublication26bf4f66-a7bd-460f-aba1-234cab99b9e0
relation.isAuthorOfPublication5dfece1b-990d-4744-b597-0bdc0fd52e2b
relation.isAuthorOfPublication.latestForDiscovery81e98c02-1e64-490c-8131-df9e19722d6f

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1.pdf
Size:
5.42 MB
Format:
Adobe Portable Document Format