中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 中科院软件所  > 中科院软件所
题名:
WebDMS:基于Web的文档管理系统
作者: 温福才
答辩日期: 2000
专业: 计算机软件和理论
授予单位: 中国科学院软件研究所
授予地点: 中国科学院软件研究所
学位: 博士
关键词: Web的文档管理系统 ; 文本-图象映射 ; 信息抽取 ; 自适应内容发送 ; Web技术
摘要: 随着Web技术的日益成熟而越来越受到人们的欢迎,传统的文档管理系统(DMS)适应这一趋势而需要体系结构上的改变,这样就产生了基于Web的文档管理系统的概念。此外,传统的DMS所管理的文档大量是纸文档的扫描图象;在基于Web的DMS系统环境里,当图象文档显示在客户端的时候,为了方便用户,必须像HTML页面那样,允许用户进行“导航”。同时,由于HTML文档的大量产生,入库的HTML文档也越来越多。为了对这些HTML文档上的不规则动态信息按照数据库的方式集成和查询,必须抽取页面上的信息,生成类似于XML的结构,以便进行高效的检索。考虑到客户端设备的能力、网络带宽和用户偏好,在传输多媒体文档信息的时候,必须进行内容改编,以适应上述特定情况。本文在总结、分析传统的文档管理系统的基础上,指出了它们所存在的上述问题,并提出了我们的解决方案,这就是我们的原型系统WebDMS-基于Web的文档管理系统。关于图象“导航”,我们采用文本-图象映射的方法,解决了这个问题。关于HTML文档结构信息抽取的问题,我们采用了一种启发式规则和数据抽取格式相结合的抽取算法进行了解决。关于内容改编发送的问题,我们采用了自适应内容发送,其框架包括:内容改编算法、客户端能力和网络带宽发现方法及决定引擎。三个子系统基本上是独立进行的,系统可移植性、可扩充性良好。
英文摘要: With the maturity and popularity of Web technology, it is required for the traditional Document Management System (DMS) to change in architecture to adapt to the trend, resulting in the concept of Web-based Document Management System. In addition, many of what the traditional DMS manages are images that are scanned from paper media; in Web-based DMS system environment, image document displayed on client device should, as HTML page, allow users to navigate for the convenience of users. With the proliferation of document in HTML format at the same time, many HTML documents would be archived accordingly. In order to integrate and query irregular and dynamic information on these pages in a database-like fashion, the structure information need to be extracting to improve the query performance, generating XML-like structure. Taking into account the capability of client device, network bandwidth and user preference, it's required to adapt the content of the multimedia document when delivering. Based upon the summarization and analysis of the traditional DMS', this paper points out the above-mentioned problems and provides our solution to these problems, which is the prototype system WebDMS. We resolve the problem of image "navigation" by adopting the method of text-image mapping. As to the problem of structure information extraction of HTML document, we resolve it by utilizing an extraction algorithm that combines heuristics rules with the format description of data extraction. The problem of document content adaptation and delivery is resolved by adopting the adaptive content delivery, for which the framework includes content adaptation algorithm, client capability and network bandwidth discovery methods, and a Decision Engine for determining when and how to adapt content. These subsystems are independent so that the system is of high portability and expandability.
语种: 中文
内容类型: 学位论文
URI标识: http://ir.iscas.ac.cn/handle/311060/6696
Appears in Collections:中科院软件所

Files in This Item:
File Name/ File Size Content Type Version Access License
LW002156.pdf(2345KB)----限制开放-- 联系获取全文

Recommended Citation:
温福才. WebDMS:基于Web的文档管理系统[D]. 中国科学院软件研究所. 中国科学院软件研究所. 2000-01-01.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[温福才]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[温福才]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院软件研究所 - Feedback
Powered by CSpace