中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 中科院软件所  > 中科院软件所
题名:
自治异构数据源的集成查询处理
作者: 李效东
答辩日期: 2002
专业: 计算机应用技术
授予单位: 中国科学院软件研究所
授予地点: 中国科学院软件研究所
学位: 博士
关键词: 数据集成 ; XML查询 ; 数据源描述 ; XML代数 ; 声明型查询语言
其他题名: Integrated Query Processing over Autonomous Heterogeneous Data Sources
摘要: 数据集成的需求由来已久,对数据集成系统的研究一直是数据管理研究领域及其它相关领域一个非常热门的课题.随着Web平台逐渐成为信息服务的主导平台,对Web环境下的数据集成系统的研究也越来越成蓬勃发展的趋势.该论文以XML技术为基础,研究Web环境下集成多个自治异构数据源将会遇到的问题,研究的核心集中于自治异构据源的集成查询处理,主要包含以下五个方面的内容:以一种半结构化数据模型为基础提出了一种XML数据模型表示-Xtree.以这种模型为基础,给出了路径表达式的形式化定义,设计了一种全新的XML查询语言-AnXQL.采用在XML的中介模式上的视图定义来描述数据源的内容,第一次在XML的集成查询中使用"利用视图重写查询技术",使得模型化数据源内容间的微妙差别成为可能.并基于此,开发了一种高效的重写查询查找算法.设计了一种XML查询语言代数操作符表示,并在代数表示的基础上,研究了基于代数重写XML查询优化的方法和可能性.设计了一种轻型的关系型数据源包装器系统.将XML查询翻译成SQL查询,并将返回的元组集合转换成相应的XML数据格式.以DOM和归纳学习技术为基础,设计了一种声明型的表示方法来表达抽取规则.并根据抽取规则自动生成数据密集型Web数据源包装器的构件.最后,以较小的篇幅给出了一种Web站点的声明型定义语言,可以声明型地定义Web站点的内容和结构,方便站点的构造和管理.
英文摘要: The requirements of integrating multiple data sources exist for a long time. The research on data integration system is always very active in database community and other related communities. The Web is becoming a major conduit for people to obtain information, so the research on data integration under Web environment is very flourishing in recent years. The paper primarily contains five parts: At first, we put forward an XML data model named XTree. Based on the model, the author designed a novel XML query language called AnXQL. The AnXQL had powerful expressions with simple constructs and precise semantics. Secondly, we made use of views on mediated schema to describe the content of data sources. Then we novelly adopt 'answering queries using views' technique into XML query reformulations. The mechanism can express fine-grained differences between data sources. The author also developed an algorithm which reformulate user's query on mediated schema into subqueries that refered directly to the schemas of the sources. Thirdly, an algebra was proposed for XML query. The paper addressed algebra rewriting-based optimization techniques. At the same time, the author quantified the complex degree of XML data to assist the optimization of queries with regular path expressions. Fourthly, the paper designed a novel and lightweight approach to translate XML queries into SQL and transform the retured tuples into XML representation A wrapper for relational sources was developed based on the approach. Based on DOM and inductive learning, the paper also present a novel approach to semi-automatically generate Java classes which can be dominant part of a wrapper for Web sources. Finally, the author proposed a declarative web page definition language. The language can faciliate maintainment, restructuring, reusability of data-intensive Web sites.
语种: 中文
内容类型: 学位论文
URI标识: http://ir.iscas.ac.cn/handle/311060/6602
Appears in Collections:中科院软件所

Files in This Item:
File Name/ File Size Content Type Version Access License
LW008671.pdf(3083KB)----限制开放-- 联系获取全文

Recommended Citation:
李效东. 自治异构数据源的集成查询处理[D]. 中国科学院软件研究所. 中国科学院软件研究所. 2002-01-01.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[李效东]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[李效东]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院软件研究所 - Feedback
Powered by CSpace