TY - JOUR ID - 8211 TI - Applying Regression Models on Subsets with High Correlations for a Better Numeric Missing Values Imputation JO - TABRIZ JOURNAL OF ELECTRICAL ENGINEERING JA - TJEE LA - en SN - 2008-7799 AU - Sefidian, A. M. AU - Daneshpour, N. AD - Faculty of Computer Engineering, Shahid Rajaee Teacher Training University, Tehran, Iran Y1 - 2018 PY - 2018 VL - 48 IS - 3 SP - 1187 EP - 1200 KW - Missing values imputation KW - Correlation KW - Regression DO - N2 - The presence of missing values in the real world data is a very prevalent and inevitable problem. So, it’s necessary to fill up these missing values accurately, before they are used for knowledge discovery process. This paper proposes three novel methods to fill numeric missing values. All of the proposed methods apply regression models on subsets of data which there are strong correlations among them. These subsets are selected using forward selection based approaches. In the selection of the desired subsets, it is tried to maximize the correlation between missing attribute and other attributes. The correlation coefficient is used to measure the relationships between attributes. The priority of each missing attribute for imputation purpose is also considered in the proposed methods. The performance of proposed methods is evaluated on five real world datasets with different missing ratios. The efficiency of the proposed methods is compared with five different estimation methods, namely, the mean imputation, the k nearest neighbours imputation, a fuzzy c-means based imputation, a decision tree based imputation, and a regression based imputation algorithm, called “Incremental Attribute Regression Imputation” (IARI) method. Two well-known evaluation criteria, namely, Root Mean Squared Error (RMSE) and Coefficient of Determination (CoD) are used to compare the performance of proposed methods with other imputation methods. Experimental results show that the proposed methods perform better than other compared methods, even when the missing ratio is high. UR - https://tjee.tabrizu.ac.ir/article_8211.html L1 - https://tjee.tabrizu.ac.ir/article_8211_0b81cdc096e41c64cdf50968958c1aee.pdf ER -