Aggregate selection, individual selection, and cluster selection: an empirical evaluation and implications for systems research

Vangumalli, Dinesh Reddy, Nikolopoulos, Konstantinos and Litsiou, Konstantia (2021) Aggregate selection, individual selection, and cluster selection: an empirical evaluation and implications for systems research. Cybernetics and Systems: An International Journal, 52 (7). pp. 553-578. ISSN 0196-9722

Preview

Accepted Version
Available under License Creative Commons Attribution Non-commercial.
Download (906kB) | Preview

Official URL: https://www.tandfonline.com/doi/full/10.1080/01969...

Abstract

Data analysts when forecasting large number of time series, they regularly employ one of the following methodological approaches: either select a single forecasting method for the entire dataset (aggregate selection), or use the best forecasting method for each time series (individual selection). There is evidence in the predictive analytics literature that the former is more robust than the latter, as in individual selection you tend to overfit models to the data. A third approach is to first identify homogeneous clusters within the dataset, and then select a single forecasting method for each cluster (cluster selection). To that end, we examine three machine learning clustering methods: k-medoids, k-NN and random forests. The evaluation is performed in the 645 yearly series of the M3 competition. The empirical evidence suggests: a) random forests provide the best clusters for the sequential forecasting task, and b) cluster selection has the potential to outperform aggregate selection.

Item Type:	Article
Peer-reviewed:	Yes
Date Deposited:	20 May 2021 11:57
Publisher:	Taylor & Francis
Additional Information:	This is an Accepted Manuscript of an article published by Taylor & Francis in Cybernetics and Systems: An International Journal on 14th June 2021, available at: http://www.tandfonline.com/10.1080/01969722.2021.1902049. It is deposited under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Divisions:	Organisation > Business and Law
Subject terms:	0801 Artificial Intelligence and Image Processing, 1702 Cognitive Sciences, Artificial Intelligence & Image Processing
URI:	https://e-space.mmu.ac.uk/id/eprint/627754
DOI:	https://doi.org/10.1080/01969722.2021.1902049
ISSN	0196-9722
e-ISSN	1087-6553

Impact and Reach

Statistics

DownloadsShow export options

Activity Overview

6 month trend

238Downloads

6 month trend

164Hits

Additional statistics for this dataset are available via IRStats2.

Altmetric

Repository staff only

Edit record