Technical review : performance of existing imputation methods for missing data in SVM ensemble creation

Loading...
Thumbnail Image
Other Title
Authors
Ali, Shahid
Dacey, Simon
Author ORCID Profiles (clickable)
Degree
Grantor
Date
2017
Supervisors
Type
Journal Article
Ngā Upoko Tukutuku (Māori subject headings)
Keyword
missing data problem
ensemble learning
imputation methods
series mean (SM) method
support vector machine (SVM)
bootstrap aggregating (meta-algorithm)
bagging (meta-algorithm)
boosting (meta-algorithm)
aggregation (machine learning)
air pollution analysis
SVM
ANZSRC Field of Research Code (2020)
Citation
Ali, S., & Dacey, S. (2017). Technical Review: Performance of Existing Imputation Methods for Missing Data in SVM Ensemble Creation. International Journal of Data Mining & Knowledge Management Process (IJDKP), 7(6), 75-91. doi:10.5121/ijdkp.2017.7606
Abstract
Incomplete data is present in many study contents. This incomplete or uncollected data information is named as missing data (values), and considered as vital problem for various researchers. Even this missing data problem is faced more in air pollution monitoring stations, where data is collected from multiple monitoring stations widespread across various locations. In literature, various imputation methods for missing data are proposed, however, in this research we considered only existing imputation methods for missing data and recorded their performance in ensemble creation. The five existing imputation methods for missing data deployed in this research are series mean method, mean of nearby points, median of nearby points, linear trend at a point and linear interpolation respectively. Series mean (SM) method demonstrated comparatively better to other imputation methods with least mean absolute error and better performance accuracy for SVM ensemble creation on CO data set using bagging and boosting algorithms.
Publisher
AIRCC Publishing Corporation
Link to ePress publication
DOI
doi:10.5121/ijdkp.2017.7606
Copyright holder
Authors
Copyright notice
All rights reserved
Copyright license
This item appears in: