Title: | absent features or missing values? |
Author: | Zhang Wen
; Yang Ye
; Wang Qing
|
Source: | SEKE 2010 - Proceedings of the 22nd International Conference on Software Engineering and Knowledge Engineering
|
Conference Name: | 22nd International Conference on Software Engineering and Knowledge Engineering, SEKE 2010
|
Conference Date: | 44013
|
Issued Date: | 2010
|
Conference Place: | Redwood City, CA, United states
|
Keyword: | Knowledge engineering
; Software engineering
|
Publish Place: | United States
|
Indexed Type: | EI
|
ISBN: | 1891706268
|
Department: | (1) Laboratory for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
|
Sponsorship: | Knowledge Systems Institute Graduate School
|
English Abstract: | To clarify the essence of unobserved values of software effort dataset, we comparatively investigate the effectiveness of regarding the unobserved values as absent features and missing values with the task of predicting software effort. When regarding unobserved values as absent features, max-margin classification is used to classify the effort directly. While regarding unobserved values as missing values, we use different imputation methods, including MINI (mean imputation based k nearest neighbor hot-deck imputation), CMI (class mean imputation) and MI (mean imputation) to impute missing values firstly and then SVM (support vector machine) is used to classify software efforts. The experiments show that the treatment of regarding unobserved values in software effort dataset as missing values produces more desirable performance measured by accuracy in using historical data for software effort classification than regarding unobserved values as absent features. Moreover, among the mentioned three imputation methods, on CSBSG data set, CMI has better performance than MINI, and on ISBSG data set, MINI has better performance than CMI. We explain the outcome in this paper. |
Content Type: | 会议论文
|
URI: | http://ir.iscas.ac.cn/handle/311060/8638
|
Appears in Collections: | 互联网软件技术实验室 _会议论文
|
There are no files associated with this item.
|
Recommended Citation: |
Zhang Wen,Yang Ye,Wang Qing. absent features or missing values?[C]. 见:22nd International Conference on Software Engineering and Knowledge Engineering, SEKE 2010. Redwood City, CA, United states. 44013.
|
|
|