Institutional Repository
| 基于中心语块扩展的汉藏基本名词短语对的识别 | |
| Alternative Title | Chinese-Tibetan Base Noun Phrase Alignment Based on Head-Phrase Extension |
诺明花; 刘汇丹; 马龙龙 ; 吴健; 丁治明
| |
| 2013 | |
| Source | 中文信息学报
![]() |
| ISSN | 1003-0077 |
| Volume | 27Issue:4Pages:63-69 |
| English Abstract | 该文提出汉藏基本名词短语对齐框架.从汉语基本名词短语出发,找藏文正确译文过程中,参考英汉短语对齐的方法,针对藏语的特殊性,提出基于中心语块扩展的藏语基本名词短语识别方法.提出词典与自动词对齐结果相结合的方法和基于序列相交的方法抽取藏语中心语块,再以扩展可信度为依据扩展中心语块.实验结果表明,基于序列相交的方法所抽取的汉藏基本名词短语对能够节省人工校正的工作量,有效辅助于汉藏基本名词短语库的建设. |
| Indexed Type | CSCD |
| Abstract | This paper presents a Chinese-Tibetan base noun phrase alignment method.Its a two-phase procedure: Chinese base noun phrases identification and finding their Tibetan correspondences.We propose head-phrase extension based Tibetan base noun phrase identification method in accordance with the morphologic characteristics of Tibetan. In the first phase,we use sequence intersection operation to get Tibetan head-phrase.In the second phase, head-phrase extension confidence is defined and applied to determine the boundary of correspondence.Experimental result indicates that sequence intersection outperforms other methods in head-phrase extension.Chinese-Tibetan base noun phrase produced by our method is effective in reducing subsequent manual check,facilitating the construction of translation lexicon on phrase level. |
| Keyword | 藏文信息处理 基本名词短语 中心语块扩展 Tibetan Information Processing Basenp Head-phrase Extension |
| Department | 中国科学院软件研究所,北京,100190 |
| Language | 中文 |
| CSCD ID | CSCD:4907555 |
| Content Type | 期刊论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/16847 |
| Collection | 中国科学院软件研究所 |
| Recommended Citation GB/T 7714 | 诺明花,刘汇丹,马龙龙,等. 基于中心语块扩展的汉藏基本名词短语对的识别[J]. 中文信息学报,2013,27(4):63-69. |
| APA | 诺明花,刘汇丹,马龙龙,吴健,&丁治明.(2013).基于中心语块扩展的汉藏基本名词短语对的识别.中文信息学报,27(4),63-69. |
| MLA | 诺明花,et al."基于中心语块扩展的汉藏基本名词短语对的识别".中文信息学报 27.4(2013):63-69. |
| Files in This Item: | There are no files associated with this item. | |||||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment