Institutional Repository
| an investigation on the feasibility of cross-project defect prediction | |
| He Zhimin; Shu Fengdi; Yang Ye; Li Mingshu; Wang Qing | |
| 2011 | |
| Source | Automated Software Engineering
![]() |
| ISSN | 9288910 |
| Issue | 1Pages:1-33 |
| English Abstract | Software defect prediction helps to optimize testing resources allocation by identifying defect-prone modules prior to testing. Most existing models build their prediction capability based on a set of historical data, presumably from the same or similar project settings as those under prediction. However, such historical data is not always available in practice. One potential way of predicting defects in projects without historical data is to learn predictors from data of other projects. This paper investigates defect predictions in the cross-project context focusing on the selection of training data. We conduct three large-scale experiments on 34 data sets obtained from 10 open source projects. Major conclusions from our experiments include: (1) in the best cases, training data from other projects can provide better prediction results than training data from the same project; (2) the prediction results obtained using training data from other projects meet our criteria for acceptance on the average level, defects in 18 out of 34 cases were predicted at a Recall greater than 70% and a Precision greater than 50%; (3) results of cross-project defect predictions are related with the distributional characteristics of data sets which are valuable for training data selection. We further propose an approach to automatically select suitable training data for projects without historical data. Prediction results provided by the training data selected by using our approach are comparable with those provided by training data from the same project. © 2011 Springer Science+Business Media, LLC. |
| Keyword | Data Reduction Defects Experiments Software Testing |
| Department | (1) Laboratory for Internet Software Technologies, Institute of Software Chinese Academy of Sciences, Beijing, 100190, China; (2) Graduate University Chinese Academy of Sciences, Beijing, 100190, China; (3) State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing, 100190, China |
| WOS ID | WOS:000298798700003 |
| Citation statistics | |
| Content Type | 期刊论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/14067 |
| Collection | 互联网软件技术实验室 |
| Recommended Citation GB/T 7714 | He Zhimin,Shu Fengdi,Yang Ye,et al. an investigation on the feasibility of cross-project defect prediction[J]. Automated Software Engineering,2011(1):1-33. |
| APA | He Zhimin,Shu Fengdi,Yang Ye,Li Mingshu,&Wang Qing.(2011).an investigation on the feasibility of cross-project defect prediction.Automated Software Engineering(1),1-33. |
| MLA | He Zhimin,et al."an investigation on the feasibility of cross-project defect prediction".Automated Software Engineering .1(2011):1-33. |
| Files in This Item: | ||||||
| File Name/Size | DocType | Version | Access | License | ||
| An investigation on (2130KB) | 开放获取 | -- | Application Full Text | |||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment