ISCAS OpenIR
a general framework to encode heterogeneous information sources for contextual pattern mining
Dong Weishan; Fan Wei; Shi Leib; Zhou Changjin; Yan Xifeng
2012
会议名称21st ACM International Conference on Information and Knowledge Management, CIKM 2012
会议录名称ACM International Conference Proceeding Series
页码65-74
会议日期October 29, 2012 - November 2, 2012
会议地点Maui, HI, United states
收录类别EI
ISBN9781450311564
部门归属(1) IBM Research China; (2) Huawei Noah's Ark Lab. China; (3) Institute of Software Chinese Academy of Sciences China; (4) University of California Santa Barbara CA United States
摘要Traditional pattern mining methods usually work on single data sources. However, in practice, there are often multiple and heterogeneous information sources. They collectively provide contextual information not available in any single source alone describing the same set of objects, and are useful for discovering hidden contextual patterns. One important challenge is to provide a general methodology to mine contextual patterns easily and efficiently. In this paper, we propose a general framework to encode contextual information from multiple sources into a coherent representation - -Contextual Information Graph (CIG). The complexity of the encoding scheme is linear in both time and space. More importantly, CIG can be handled by any single-source pattern mining algorithms that accept taxonomies without any modification. We demonstrate by three applications of the contextual association rule, sequence and graph mining, that contextual patterns providing rich and insightful knowledge can be easily discovered by the proposed framework. It enables Contextual Pattern Mining (CPM) by reusing single-source methods, and is easy to deploy and use in real-world systems. © 2012 ACM.; Traditional pattern mining methods usually work on single data sources. However, in practice, there are often multiple and heterogeneous information sources. They collectively provide contextual information not available in any single source alone describing the same set of objects, and are useful for discovering hidden contextual patterns. One important challenge is to provide a general methodology to mine contextual patterns easily and efficiently. In this paper, we propose a general framework to encode contextual information from multiple sources into a coherent representation - -Contextual Information Graph (CIG). The complexity of the encoding scheme is linear in both time and space. More importantly, CIG can be handled by any single-source pattern mining algorithms that accept taxonomies without any modification. We demonstrate by three applications of the contextual association rule, sequence and graph mining, that contextual patterns providing rich and insightful knowledge can be easily discovered by the proposed framework. It enables Contextual Pattern Mining (CPM) by reusing single-source methods, and is easy to deploy and use in real-world systems. © 2012 ACM.
关键词Algorithms Data Mining Knowledge Management
主办者Special Interest Group on Information Retrieval (ACM SIGIR); ACM SIGWEB
语种英语
内容类型会议论文
URI标识http://ir.iscas.ac.cn/handle/311060/15824
专题中国科学院软件研究所
推荐引用方式
GB/T 7714
Dong Weishan,Fan Wei,Shi Leib,et al. a general framework to encode heterogeneous information sources for contextual pattern mining[C],2012:65-74.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Dong Weishan]的文章
[Fan Wei]的文章
[Shi Leib]的文章
百度学术
百度学术中相似的文章
[Dong Weishan]的文章
[Fan Wei]的文章
[Shi Leib]的文章
必应学术
必应学术中相似的文章
[Dong Weishan]的文章
[Fan Wei]的文章
[Shi Leib]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。