ISCAS OpenIR
design of text categorization system based on svm
Liu Zhenyan; Wang Weiping; Wang Yong
2012
Conference Name2012 2nd International Conference on Materials Science and Information Technology, MSIT 2012
SourceAdvanced Materials Research
Pages1191-1195
Conference DateAugust 24, 2012 - August 26, 2012
Conference PlaceXi'an, Shaan, China
Indexed TypeEI
ISSN1022-6680
ISBN9783037854389
Department(1) Institute of Computing Technology Chinese Academy of Sciences China; (2) Graduate University Chinese Academy of Sciences China; (3) School of Software Beijing Institute of Technology China
English AbstractThis paper introduces the design of a text categorization system based on Support Vector Machine (SVM). It analyzes the high dimensional characteristic of text data, the reason why SVM is suitable for text categorization. According to system data flow this system is constructed. This system consists of three subsystems which are text representation, classifier training and text classification. The core of this system is the classifier training, but text representation directly influences the currency of classifier and the performance of the system. Text feature vector space can be built by different kinds of feature selection and feature extraction methods. No research can indicate which one is the best method, so many feature selection and feature extraction methods are all developed in this system. For a specific classification task every feature selection method and every feature extraction method will be tested, and then a set of the best methods will be adopted. © (2012) Trans Tech Publications, Switzerland.; This paper introduces the design of a text categorization system based on Support Vector Machine (SVM). It analyzes the high dimensional characteristic of text data, the reason why SVM is suitable for text categorization. According to system data flow this system is constructed. This system consists of three subsystems which are text representation, classifier training and text classification. The core of this system is the classifier training, but text representation directly influences the currency of classifier and the performance of the system. Text feature vector space can be built by different kinds of feature selection and feature extraction methods. No research can indicate which one is the best method, so many feature selection and feature extraction methods are all developed in this system. For a specific classification task every feature selection method and every feature extraction method will be tested, and then a set of the best methods will be adopted. © (2012) Trans Tech Publications, Switzerland.
KeywordClassification (Of Information) Feature Extraction Image Retrieval Information Technology Materials Science Text Processing
Language英语
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/15957
Collection中国科学院软件研究所
Recommended Citation
GB/T 7714
Liu Zhenyan,Wang Weiping,Wang Yong. design of text categorization system based on svm[C],2012:1191-1195.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Liu Zhenyan]'s Articles
[Wang Weiping]'s Articles
[Wang Yong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Liu Zhenyan]'s Articles
[Wang Weiping]'s Articles
[Wang Yong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Liu Zhenyan]'s Articles
[Wang Weiping]'s Articles
[Wang Yong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.