中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 软件所图书馆  > 2009年期刊/会议论文
题名:
research of chinese text classification methods based on semantic vector and semantic similarity
作者: Song Xin ; Huang Jia ; Zhou Jing-Min ; Chen Xi
会议文集: IFCSTA 2009 Proceedings - 2009 International Forum on Computer Science-Technology and Applications
会议名称: 2009 International Forum on Computer Science-Technology and Applications, IFCSTA 2009
会议日期: 40879
出版日期: 2009
会议地点: Chongqing, China
关键词: Computer science ; Information retrieval systems ; Knowledge representation ; Semantics ; Vector spaces ; Vectors
出版地: United States
收录类别: ei
ISBN: 9780769539300
部门归属: (1) State Key Laboratory of Software Development Environment, Beihang University, 100191, Beijing, China; (2) Institute of Software Chinese Academy of Sciences, 100190, Beijing, China
主办者: IITAA - International Information Technology; and Applications Association
英文摘要: To overcome the limitations of traditional text classification approaches based on bag-of-words representation and to effectively incorporate linguistic knowledge and conceptual index into text vector space model, based on two thesaurus HowNet and Tongyici Cilin(hereinafter referred to Cilin), we use semantic vector to describe a document instead of traditional keywords vector, which is based on merging words with high similarity and using a concept to describe the semantic feature rather than a series of words. It not only reduces feature dimension but also adds semantic information to the vector. We also use sentence (document) similarity based on simple vector distance to classify the text and three groups of experiments are made respectively. The results show that the accuracy rates are generally improved along with semantic treatment, which indicates that semantic mining is very important and necessary to text classification. © 2009 IEEE.
语种: 英语
内容类型: 会议论文
URI标识: http://ir.iscas.ac.cn/handle/311060/8434
Appears in Collections:中科院软件所图书馆_2009年期刊/会议论文

Files in This Item:

There are no files associated with this item.


Recommended Citation:
Song Xin,Huang Jia,Zhou Jing-Min,et al. research of chinese text classification methods based on semantic vector and semantic similarity[C]. 见:2009 International Forum on Computer Science-Technology and Applications, IFCSTA 2009. Chongqing, China. 40879.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Song Xin]'s Articles
[Huang Jia]'s Articles
[Zhou Jing-Min]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Song Xin]‘s Articles
[Huang Jia]‘s Articles
[Zhou Jing-Min]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院软件研究所 - Feedback
Powered by CSpace