ISCAS OpenIR  > 2009年期刊/会议论文
juicer: scalable extraction for thread meta-information of web forum
Guo Yan; Wang Yu; Ding Guodong; Cao Donglin; Zhang Gang; Lv Yi
2009
Conference NamePacific Asia Workshop on Intelligence and Security Informatics, PAISI 2009
SourceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages143-148
Conference DateApril 27,
Conference PlaceBangkok, Thailand
Indexed Typeei,acm
Publish PlaceGermany
ISSN3029743
ISBN9783642013928
Department(1) Institute of Computing Technology, Chinese Academy of Sciences, China; (2) State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China
English AbstractIn Web forum, thread meta-information contained in list-ofthread of board page provide fundamental data for the further forum mining. This paper describes a complete system named Juicer which was developed as a subsystem for an industrial application that involves forum mining. The task of Juicer is to extract thread meta-information from board pages of a great many of large scale online Web forums, which implies that scalable extraction is required with high accuracy and speed, and minimal user effort for maintenance. Among so many existed approaches about information extraction, we can not find any approach to fully satisfy the requirements, so we present simple scalable extraction approach behind Juicer to achieve the goal. Juicer is constituted by four modules: Template generation, Specifying labeling setting, Automatic extraction, Label assignment. Both experiments and practice show that Juicer successfully satisfied the requirements.
KeywordMining
Language英语
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/8478
Collection2009年期刊/会议论文
Recommended Citation
GB/T 7714
Guo Yan,Wang Yu,Ding Guodong,et al. juicer: scalable extraction for thread meta-information of web forum[C]. Germany,2009:143-148.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Guo Yan]'s Articles
[Wang Yu]'s Articles
[Ding Guodong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Guo Yan]'s Articles
[Wang Yu]'s Articles
[Ding Guodong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Guo Yan]'s Articles
[Wang Yu]'s Articles
[Ding Guodong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.