Title: | text classification using semi-supervised clustering |
Author: | Zhang Wen
; Yoshida Taketoshi
; Tang Xijin
|
Source: | 2009 International Conference on Business Intelligence and Financial Engineering, BIFE 2009
|
Conference Name: | 2009 International Conference on Business Intelligence and Financial Engineering, BIFE 2009
|
Conference Date: | 37461
|
Issued Date: | 2009
|
Conference Place: | Beijing, China
|
Keyword: | Classification (of information)
; Maximum principle
; Optimization
; Support vector machines
|
Publish Place: | United States
|
Indexed Type: | EI
|
ISBN: | 9780769537054
|
Department: | (1) School of Knowledge Science, Japan Advanced Institute of Science and Technology, 1-1, Ashahidai, Tatsunokuchi, Ishikawa 923-1292, Japan; (2) Lab. for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; (3) Institute of Systems Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100080, China
|
English Abstract: | In this paper, mixture models are used to classify documents. The basic assumption for the documents in a collection is that each class is composed of a number of mixture components. By indentifying the components in the document collection, the classes of documents can thereby be identified from each other. A semi-supervised clustering method is proposed to identify the components (clusters), and further, unlabeled data is used to produce more accurate clusters in document collection to correspond the components of document classes. Experimental results show that the proposed method produces better performances than support vector machine (SVM) with linear kernel, and produces comparable performance with Bayesian classifier with Expectation Maximization (EM) in text classification. © 2009 IEEE. |
Content Type: | 会议论文
|
URI: | http://ir.iscas.ac.cn/handle/311060/8520
|
Appears in Collections: | 互联网软件技术实验室 _会议论文
|
File Name/ File Size |
Content Type |
Version |
Access |
License |
|
2009-张文-BIFE-Text Classification Using Semi-Supervised Clustering.pdf(268KB) | -- | -- | 限制开放 | -- | 联系获取全文 |
|
Recommended Citation: |
Zhang Wen,Yoshida Taketoshi,Tang Xijin. text classification using semi-supervised clustering[C]. 见:2009 International Conference on Business Intelligence and Financial Engineering, BIFE 2009. Beijing, China. 37461.
|
|
|