Pérez Chacón, RubénAsencio Cortés, GualbertoMartínez-Álvarez, FranciscoTroncoso, Alicia2024-02-062024-02-062020-07-07Information Sciences, vol 540, p. 160–17410.1016/j.ins.2020.06.014https://hdl.handle.net/10433/19787This work proposes a novel algorithm to forecast big data time series. Based on the well-established Pattern Sequence-based Forecasting algorithm, this new approach has two major contributions to the literature. First, the improvement of the original algorithm with respect to the accuracy of predictions, and second, its transformation into the big data context, having reached meaningful results in terms of scalability. The algorithm uses the Apache Spark distributed computation framework and it is a ready-to-use application with few parameters to adjust. Physical and cloud clusters have been used to carry out the experimentation, which consisted in applying the algorithm to real-world data from Uruguay electricity demand.application/pdfenBig dataTime seriesForecastingElectricityBig data time series forecasting based on pattern sequence similarity and its application to the electricity demandjournal articlerestricted access