Institutional Repository
| multi-objective optimization integration of query interfaces for the deep web based on attribute constraints | |
| Li Yanni; Wang Yuping; Jiang Peng; Zhang Zhensong | |
| 2013 | |
| Source | Data and Knowledge Engineering
![]() |
| ISSN | 0169-023X |
| Volume | 86Pages:- |
| English Abstract | In order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. © 2013 Elsevier B.V. All rights reserved.; In order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. © 2013 Elsevier B.V. All rights reserved. |
| Indexed Type | EI |
| Keyword | Algorithms Multiobjective Optimization World Wide Web |
| Department | (1) School of Computer Science and Technology Xidian University No. 2 South Taibai Road Xi'an Shaanxi 710071 PR China; (2) School of Software Xidian University No. 2 South Taibai Road Xi'an Shaanxi 710071 PR China; (3) Institute of Computing Technology Chinese Academy of Sciences No. 6 Kexueyuan South Road Zhongguancun Haidian District Beijing 100190 PR China; (4) Institute of Software Chinese Academy of Sciences No. 4 Kexueyuan South Road Zhongguancun Haidian District Beijing 100190 PR China |
| Language | 英语 |
| WOS ID | WOS:000320353700003 |
| Citation statistics | |
| Content Type | 期刊论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/15212 |
| Collection | 中国科学院软件研究所 |
| Recommended Citation GB/T 7714 | Li Yanni,Wang Yuping,Jiang Peng,et al. multi-objective optimization integration of query interfaces for the deep web based on attribute constraints[J]. Data and Knowledge Engineering,2013,86:-. |
| APA | Li Yanni,Wang Yuping,Jiang Peng,&Zhang Zhensong.(2013).multi-objective optimization integration of query interfaces for the deep web based on attribute constraints.Data and Knowledge Engineering,86,-. |
| MLA | Li Yanni,et al."multi-objective optimization integration of query interfaces for the deep web based on attribute constraints".Data and Knowledge Engineering 86(2013):-. |
| Files in This Item: | There are no files associated with this item. | |||||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment