ISCAS OpenIR
on the predictability of software efforts using machine learning techniques
Zhang Wen; Yang Ye; Wang Qing
2011
Conference Name6th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2011
SourceENASE 2011 - Proceedings of the 6th International Conference on Evaluation of Novel Approaches to Software Engineering
Pages5-14
Conference DateJune 8, 2011 - June 11, 2011
Conference PlaceBeijing, China
Indexed TypeEI
ISBN9789898425577
Department(1) Laboratory for Internet Software Technologies Institute of Software Chinese Academy of Sciences Beijing 100190 China
English AbstractThis paper investigates the predictability of software effort using machine learning techniques. We employed unsupervised learning as k-medoids clustering with different similarity measures to extract natural clusters of projects from software effort data set, and supervised learning as J48 decision tree, back propagation neural network (BPNN) and na¨ive Bayes to classify the software projects. We also investigate the impact of imputing missing values of projects on the performances of both unsupervised and supervised learning techniques. Experiments on ISBSG and CSBSG data sets demonstrate that unsupervised learning as k-medoids clustering has produced a poor performance in software effort prediction and Kulzinsky coefficient has the best performance in software effort prediction in measuring the similarities of projects. Supervised learning techniques have produced superior performances in software effort prediction. Among the three supervised learning techniques, BPNN produces the best performance. Missing data imputation has improved the performances of both unsupervised and supervised learning techniques.; This paper investigates the predictability of software effort using machine learning techniques. We employed unsupervised learning as k-medoids clustering with different similarity measures to extract natural clusters of projects from software effort data set, and supervised learning as J48 decision tree, back propagation neural network (BPNN) and na¨ive Bayes to classify the software projects. We also investigate the impact of imputing missing values of projects on the performances of both unsupervised and supervised learning techniques. Experiments on ISBSG and CSBSG data sets demonstrate that unsupervised learning as k-medoids clustering has produced a poor performance in software effort prediction and Kulzinsky coefficient has the best performance in software effort prediction in measuring the similarities of projects. Supervised learning techniques have produced superior performances in software effort prediction. Among the three supervised learning techniques, BPNN produces the best performance. Missing data imputation has improved the performances of both unsupervised and supervised learning techniques.
KeywordDecision Trees Forecasting Neural Networks Software Engineering Supervised Learning Unsupervised Learning
Language英语
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/16307
Collection中国科学院软件研究所
Recommended Citation
GB/T 7714
Zhang Wen,Yang Ye,Wang Qing. on the predictability of software efforts using machine learning techniques[C],2011:5-14.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zhang Wen]'s Articles
[Yang Ye]'s Articles
[Wang Qing]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhang Wen]'s Articles
[Yang Ye]'s Articles
[Wang Qing]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang Wen]'s Articles
[Yang Ye]'s Articles
[Wang Qing]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.