Institutional Repository
| Measuring the Heterogeneity of Cross-company Dataset | |
| Chen J(陈嘉); Yang Y(杨叶); Zhang W(张文); Gregory Gay | |
| 2010-06 | |
| Conference Name | Profes 2010 |
| Conference Date | 2010-6-22 |
| Conference Place | 爱尔兰,Limerick大学 |
| English Abstract | As a standard practice, general effort estimate models are calibrated from large cross-company datasets. However, many of the records within such datasets are taken from companies that have calibrated the model to match their own local practices. Locally calibrated models are a double-edged sword; they often improve estimate accuracy for that particular organization, but they also encourage the growth of local biases. Such biases remain present when projects from that firm are used in a new cross-company dataset. Over time, such biases compound, and the reliability and accuracy of a general model derived from the data will be affected by the increased level of heterogeneity. In this paper, we propose a statistical measure of the exact level of heterogeneity of a cross-company dataset. In experimental tests, we measure the heterogeneity of two COCOMO-based datasets and demonstrate that one is more homogeneous than the other. Such a measure has potentially important implications for both model maintainers and model users. Furthermore, a heterogeneity measure can be used to inform users of the appropriate data handling techniques. |
| Keyword | Heterogeneous Datasets Software Effort Estimation Parameter Comparison Estimation Model Calibration |
| Subject | 软件工程 |
| Language | 英语 |
| Content Type | 会议论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/14786 |
| Collection | 互联网软件技术实验室 |
| Recommended Citation GB/T 7714 | Chen J,Yang Y,Zhang W,et al. Measuring the Heterogeneity of Cross-company Dataset[C],2010. |
| Files in This Item: | ||||||
| File Name/Size | DocType | Version | Access | License | ||
| Compare Parameters_P(45KB) | 开放获取 | License | Application Full Text | |||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment