Institutional Repository
| tc-dca: a system for text classification based on document's content allocation | |
| Li Wenbo; Sun Le; Zhang Zhenzhong; Jiang Xue; Zhang Weiru | |
| 2010 | |
| 会议名称 | 19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10 |
| 会议录名称 | International Conference on Information and Knowledge Management, Proceedings |
| 页码 | 1937-1938 |
| 会议日期 | 40842 |
| 会议地点 | Toronto, ON, Canada |
| 收录类别 | EI |
| 出版地 | United States |
| ISBN | 9781450000000 |
| 部门归属 | (1) Institute of Software, Chinese Academy of Sciences, 4# South Fourth Street, Zhong Guan Cun, Beijing, China |
| 摘要 | The text classification methods heavily depend on machine learning algorithms with abstract mathematic metrics, which obstruct the direct observation and intuitive understanding of the text-specific classification. In this paper, we model a document as a Document-Classes-Topics top-down hierarchical structure. Furthermore, by running the document generation procedure, we can obtain each class's content share, which not only can be used to make the classification decision but also can provide a natural visualization approach for text classification. We implement this idea by a new tool named TC-DCA, which provides the visualization of text classification result, where the target document is expressed graphically as its content's allocation on every class. TC-DCA can also perform the drilling down operation to reveal the classification effect of each word of the document. |
| 关键词 | Knowledge Management Learning Algorithms Text Processing Visualization |
| 主办者 | ACM SIGIR; ACM SIGWEB; ACM SIGKDD |
| 内容类型 | 会议论文 |
| URI标识 | http://ir.iscas.ac.cn/handle/311060/8928 |
| 专题 | 基础软件国家工程研究中心 |
| 推荐引用方式 GB/T 7714 | Li Wenbo,Sun Le,Zhang Zhenzhong,et al. tc-dca: a system for text classification based on document's content allocation[C]. United States,2010:1937-1938. |
| 条目包含的文件 | ||||||
| 文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
| p1937-li.pdf(636KB) | 开放获取 | -- | 请求全文 | |||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论