Prediction Performance of Separate Collection of Packaging Waste Yields Using Genetic Algorithm Optimized Support Vector Machines
authors Sousa, V; Meireles, I; Oliveira, V; Dias-Ferreira, C
nationality International
author keywords Household packaging waste; Regression; Separate collection; SVM; Predictive model
abstract Understanding the drivers underlying waste production in general, and source-segregated waste in particular, is of utmost importance for waste managers. This work aims at evaluating the performance of support vector machines (SVM) models in the prediction of separate collection yields for packaging waste at municipal level. Two SVM models were developed for a case study of 42 municipalities simultaneously serviced by separate collection of packaging waste and by unsorted waste collection. The SVM-fixed model used a fixed set of 5 variables to predict collection yields, whereas the SVM-optimal model chose from a pool of 14 variables those that optimized performance, using a genetic algorithm. These SVM models were compared with 3 traditional regression models: the ordinary least square linear (OLS-L), the ordinary least square non-linear (OLS-NL) and robust regression. The robust regression model was further compared against the other regression models in order to assess the influence of the dataset outliers on the model performance. The coefficient of determination, R-2, was used to evaluate the performance of these models. The highest performance was attained by the SVM-optimal model (R-2 = 0.918), compared to the SVM-fixed model (R-2 = 0.670). The performance of the SVM-optimal model was 42% higher than the best performing regression model, the OLS-NL model (R-2 = 0.646). The differences in performance among the 3 regression models are small (circa 3%), whereas the exclusion of outliers improved their performance by 13%, indicating that outliers impacted more on performance than the type of traditional regression technique used. The results demonstrate that SVM model can be a viable alternative for prediction of separate collection of packaging waste yields and that there are nine important drivers that all together explain roughly 92% (R-2 = 0.918) of the variability in the separate collection yields data.
publisher SPRINGER
issn 1877-2641
isbn 1877-265X
year published 2019
volume 10
issue 12
beginning page 3603
ending page 3612
digital object identifier (doi) 10.1007/s12649-019-00656-3
web of science category Environmental Sciences
subject category Environmental Sciences & Ecology
unique article identifier WOS:000496664800007
  ciceco authors
  impact metrics
journal analysis (jcr 2019):
journal impact factor 2.851
5 year journal impact factor 2.608
category normalized journal impact factor percentile 60.943
dimensions (citation analysis):
altmetrics (social interaction):