Institutional Repository
| Measuring the Heterogeneity of Cross-company Dataset | |
| Chen J(陈嘉); Yang Y(杨叶); Zhang W(张文); Gregory Gay | |
| 2010-06 | |
| 会议名称 | Profes 2010 |
| 会议日期 | 2010-6-22 |
| 会议地点 | 爱尔兰,Limerick大学 |
| 摘要 | As a standard practice, general effort estimate models are calibrated from large cross-company datasets. However, many of the records within such datasets are taken from companies that have calibrated the model to match their own local practices. Locally calibrated models are a double-edged sword; they often improve estimate accuracy for that particular organization, but they also encourage the growth of local biases. Such biases remain present when projects from that firm are used in a new cross-company dataset. Over time, such biases compound, and the reliability and accuracy of a general model derived from the data will be affected by the increased level of heterogeneity. In this paper, we propose a statistical measure of the exact level of heterogeneity of a cross-company dataset. In experimental tests, we measure the heterogeneity of two COCOMO-based datasets and demonstrate that one is more homogeneous than the other. Such a measure has potentially important implications for both model maintainers and model users. Furthermore, a heterogeneity measure can be used to inform users of the appropriate data handling techniques. |
| 关键词 | Heterogeneous Datasets Software Effort Estimation Parameter Comparison Estimation Model Calibration |
| 学科领域 | 软件工程 |
| 语种 | 英语 |
| 内容类型 | 会议论文 |
| URI标识 | http://ir.iscas.ac.cn/handle/311060/14786 |
| 专题 | 互联网软件技术实验室 |
| 推荐引用方式 GB/T 7714 | Chen J,Yang Y,Zhang W,et al. Measuring the Heterogeneity of Cross-company Dataset[C],2010. |
| 条目包含的文件 | ||||||
| 文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
| Compare Parameters_P(45KB) | 开放获取 | 使用许可 | 请求全文 | |||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论