ISCAS OpenIR  > 互联网软件技术实验室
Measuring the Heterogeneity of Cross-company Dataset
Chen J(陈嘉); Yang Y(杨叶); Zhang W(张文); Gregory Gay
2010-06
会议名称Profes 2010
会议日期2010-6-22
会议地点爱尔兰,Limerick大学
摘要As a standard practice, general effort estimate models are calibrated from large cross-company datasets. However, many of the records within such datasets are taken from companies that have calibrated the model to match their own local practices. Locally calibrated models are a double-edged sword; they often improve estimate accuracy for that particular organization, but they also encourage the growth of local biases. Such biases remain present when projects from that firm are used in a new cross-company dataset. Over time, such biases compound, and the reliability and accuracy of a general model derived from the data will be affected by the increased level of heterogeneity. In this paper, we propose a statistical measure of the exact level of heterogeneity of a cross-company dataset. In experimental tests, we measure the heterogeneity of two COCOMO-based datasets and demonstrate that one is more homogeneous than the other. Such a measure has potentially important implications for both model maintainers and model users. Furthermore, a heterogeneity measure can be used to inform users of the appropriate data handling techniques.
关键词Heterogeneous Datasets Software Effort Estimation Parameter Comparison Estimation Model Calibration
学科领域软件工程
语种英语
内容类型会议论文
URI标识http://ir.iscas.ac.cn/handle/311060/14786
专题互联网软件技术实验室
推荐引用方式
GB/T 7714
Chen J,Yang Y,Zhang W,et al. Measuring the Heterogeneity of Cross-company Dataset[C],2010.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Compare Parameters_P(45KB) 开放获取使用许可请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Chen J(陈嘉)]的文章
[Yang Y(杨叶)]的文章
[Zhang W(张文)]的文章
百度学术
百度学术中相似的文章
[Chen J(陈嘉)]的文章
[Yang Y(杨叶)]的文章
[Zhang W(张文)]的文章
必应学术
必应学术中相似的文章
[Chen J(陈嘉)]的文章
[Yang Y(杨叶)]的文章
[Zhang W(张文)]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。