Institutional Repository
| multi-objective optimization integration of query interfaces for the deep web based on attribute constraints | |
| Li Yanni; Wang Yuping; Jiang Peng; Zhang Zhensong | |
| 2013 | |
| 发表期刊 | Data and Knowledge Engineering
![]() |
| ISSN | 0169-023X |
| 卷号 | 86页码:- |
| 摘要 | In order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. © 2013 Elsevier B.V. All rights reserved.; In order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. © 2013 Elsevier B.V. All rights reserved. |
| 收录类别 | EI |
| 关键词 | Algorithms Multiobjective Optimization World Wide Web |
| 部门归属 | (1) School of Computer Science and Technology Xidian University No. 2 South Taibai Road Xi'an Shaanxi 710071 PR China; (2) School of Software Xidian University No. 2 South Taibai Road Xi'an Shaanxi 710071 PR China; (3) Institute of Computing Technology Chinese Academy of Sciences No. 6 Kexueyuan South Road Zhongguancun Haidian District Beijing 100190 PR China; (4) Institute of Software Chinese Academy of Sciences No. 4 Kexueyuan South Road Zhongguancun Haidian District Beijing 100190 PR China |
| 语种 | 英语 |
| WOS记录号 | WOS:000320353700003 |
| 引用统计 | |
| 内容类型 | 期刊论文 |
| URI标识 | http://ir.iscas.ac.cn/handle/311060/15212 |
| 专题 | 中国科学院软件研究所 |
| 推荐引用方式 GB/T 7714 | Li Yanni,Wang Yuping,Jiang Peng,et al. multi-objective optimization integration of query interfaces for the deep web based on attribute constraints[J]. Data and Knowledge Engineering,2013,86:-. |
| APA | Li Yanni,Wang Yuping,Jiang Peng,&Zhang Zhensong.(2013).multi-objective optimization integration of query interfaces for the deep web based on attribute constraints.Data and Knowledge Engineering,86,-. |
| MLA | Li Yanni,et al."multi-objective optimization integration of query interfaces for the deep web based on attribute constraints".Data and Knowledge Engineering 86(2013):-. |
| 条目包含的文件 | 条目无相关文件。 | |||||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论