中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 软件所图书馆  > 2009年期刊/会议论文
题名:
juicer: scalable extraction for thread meta-information of web forum
作者: Guo Yan ; Wang Yu ; Ding Guodong ; Cao Donglin ; Zhang Gang ; Lv Yi
会议文集: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
会议名称: Pacific Asia Workshop on Intelligence and Security Informatics, PAISI 2009
会议日期: April 27,
出版日期: 2009
会议地点: Bangkok, Thailand
关键词: Mining
出版地: Germany
收录类别: ei,acm
ISSN: 3029743
ISBN: 9783642013928
部门归属: (1) Institute of Computing Technology, Chinese Academy of Sciences, China; (2) State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China
英文摘要: In Web forum, thread meta-information contained in list-ofthread of board page provide fundamental data for the further forum mining. This paper describes a complete system named Juicer which was developed as a subsystem for an industrial application that involves forum mining. The task of Juicer is to extract thread meta-information from board pages of a great many of large scale online Web forums, which implies that scalable extraction is required with high accuracy and speed, and minimal user effort for maintenance. Among so many existed approaches about information extraction, we can not find any approach to fully satisfy the requirements, so we present simple scalable extraction approach behind Juicer to achieve the goal. Juicer is constituted by four modules: Template generation, Specifying labeling setting, Automatic extraction, Label assignment. Both experiments and practice show that Juicer successfully satisfied the requirements.
语种: 英语
内容类型: 会议论文
URI标识: http://ir.iscas.ac.cn/handle/311060/8478
Appears in Collections:中科院软件所图书馆_2009年期刊/会议论文

Files in This Item:

There are no files associated with this item.


Recommended Citation:
Guo Yan,Wang Yu,Ding Guodong,et al. juicer: scalable extraction for thread meta-information of web forum[C]. 见:Pacific Asia Workshop on Intelligence and Security Informatics, PAISI 2009. Bangkok, Thailand. April 27,.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Guo Yan]'s Articles
[Wang Yu]'s Articles
[Ding Guodong]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Guo Yan]‘s Articles
[Wang Yu]‘s Articles
[Ding Guodong]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院软件研究所 - Feedback
Powered by CSpace