中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 中科院软件所  > 中科院软件所
题名:
基于句类的指代解析及其在语音识别中的应用
作者: 孙伟峰
答辩日期: 2000
专业: 计算机软件与理论
授予单位: 中国科学院软件研究所
授予地点: 中国科学院软件研究所
学位: 博士
关键词: 自然语言处理 ; 指代解析 ; HNC理论 ; 语音识别
摘要: 指代解析(Anaphora resolution)属于自然语言处理的范畴,已经有很多学者从人工智能、古典语言学、认知心理学的不同角度对其进行了深入的研究。我们是从人工智能/计算语言学途径研究指代解析。HNC 理论(概念层次网络理论)是面向整个自然语言理解的强大而完备的语义描述体系。本文的工作是基于HNC理论的句类知识,对自然语言理解处理的五重模糊中的第五重模糊之指代模糊进行初步探讨。我们选取的应用是提高语音识别的准确率。我们的策略不同于传统的基于语法、焦点等知识的策略,也不同于基于统计、模式匹配的策略。我们考虑从语义分析入手,来获得自然语言深层结构的理解。这就是利用HNC理论的句类知识,从分析句子内的语义角色入手,根据人称代词所在语义块中的语义角色和人称代词对应的先行词可能的语义角色,给出人称代词指代解析的基本的约束规则和优选规则。并结合语音识别,本文提出了语音识别高层处理中人称代词的选取策略。我们以IBM ViaVoice语音识别软件为语音平台。
英文摘要: An algorithm is described in this paper for the automatic resolution of intra-sentential pronominal anaphoric references in Chinese sentences. Anaphora is cohesion (presupposition) which points back to some previous item. Anaphora resolution is a complicated problem in Natural Language Processing and has attracted the attention of many researchers. The method is based on sentences category (SC) and makes use of its representation formula. SC is the main concept of HNC theory -- Hierarchical Network of Concepts theory, which is a novel theory of natural language processing. In our approach, some personal noun phrases (NPs) preceding an anaphor are initially regarded as potential candidates for antecedents. Then, their semantic role are compared with that of personal pronoun, relying on a set of anaphora resolution factors. These factors can be "eliminating", i.e. discounting certain noun phrases from the set of possible candidates (such as gender and number constraints) or "preferential", giving more preference to certain candidates and less to others. All these factors are presented in this paper. The ability to perform anaphora resolution is important in speech recognition. Our algorithm is used to enhance the recognition ability of IBM Via Voice.
语种: 中文
内容类型: 学位论文
URI标识: http://ir.iscas.ac.cn/handle/311060/6656
Appears in Collections:中科院软件所

Files in This Item:
File Name/ File Size Content Type Version Access License
LW002144.pdf(2045KB)----限制开放-- 联系获取全文

Recommended Citation:
孙伟峰. 基于句类的指代解析及其在语音识别中的应用[D]. 中国科学院软件研究所. 中国科学院软件研究所. 2000-01-01.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[孙伟峰]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[孙伟峰]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院软件研究所 - Feedback
Powered by CSpace