ISCAS OpenIR
multi-objective optimization integration of query interfaces for the deep web based on attribute constraints
Li Yanni; Wang Yuping; Jiang Peng; Zhang Zhensong
2013
SourceData and Knowledge Engineering
ISSN0169-023X
Volume86Pages:-
English AbstractIn order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. © 2013 Elsevier B.V. All rights reserved.; In order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. © 2013 Elsevier B.V. All rights reserved.
Indexed TypeEI
KeywordAlgorithms Multiobjective Optimization World Wide Web
Department(1) School of Computer Science and Technology Xidian University No. 2 South Taibai Road Xi'an Shaanxi 710071 PR China; (2) School of Software Xidian University No. 2 South Taibai Road Xi'an Shaanxi 710071 PR China; (3) Institute of Computing Technology Chinese Academy of Sciences No. 6 Kexueyuan South Road Zhongguancun Haidian District Beijing 100190 PR China; (4) Institute of Software Chinese Academy of Sciences No. 4 Kexueyuan South Road Zhongguancun Haidian District Beijing 100190 PR China
Language英语
WOS IDWOS:000320353700003
Citation statistics
Content Type期刊论文
URIhttp://ir.iscas.ac.cn/handle/311060/15212
Collection中国科学院软件研究所
Recommended Citation
GB/T 7714
Li Yanni,Wang Yuping,Jiang Peng,et al. multi-objective optimization integration of query interfaces for the deep web based on attribute constraints[J]. Data and Knowledge Engineering,2013,86:-.
APA Li Yanni,Wang Yuping,Jiang Peng,&Zhang Zhensong.(2013).multi-objective optimization integration of query interfaces for the deep web based on attribute constraints.Data and Knowledge Engineering,86,-.
MLA Li Yanni,et al."multi-objective optimization integration of query interfaces for the deep web based on attribute constraints".Data and Knowledge Engineering 86(2013):-.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Li Yanni]'s Articles
[Wang Yuping]'s Articles
[Jiang Peng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Li Yanni]'s Articles
[Wang Yuping]'s Articles
[Jiang Peng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Li Yanni]'s Articles
[Wang Yuping]'s Articles
[Jiang Peng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.