ISCAS OpenIR
a general framework to encode heterogeneous information sources for contextual pattern mining
Dong Weishan; Fan Wei; Shi Leib; Zhou Changjin; Yan Xifeng
2012
Conference Name21st ACM International Conference on Information and Knowledge Management, CIKM 2012
SourceACM International Conference Proceeding Series
Pages65-74
Conference DateOctober 29, 2012 - November 2, 2012
Conference PlaceMaui, HI, United states
Indexed TypeEI
ISBN9781450311564
Department(1) IBM Research China; (2) Huawei Noah's Ark Lab. China; (3) Institute of Software Chinese Academy of Sciences China; (4) University of California Santa Barbara CA United States
English AbstractTraditional pattern mining methods usually work on single data sources. However, in practice, there are often multiple and heterogeneous information sources. They collectively provide contextual information not available in any single source alone describing the same set of objects, and are useful for discovering hidden contextual patterns. One important challenge is to provide a general methodology to mine contextual patterns easily and efficiently. In this paper, we propose a general framework to encode contextual information from multiple sources into a coherent representation - -Contextual Information Graph (CIG). The complexity of the encoding scheme is linear in both time and space. More importantly, CIG can be handled by any single-source pattern mining algorithms that accept taxonomies without any modification. We demonstrate by three applications of the contextual association rule, sequence and graph mining, that contextual patterns providing rich and insightful knowledge can be easily discovered by the proposed framework. It enables Contextual Pattern Mining (CPM) by reusing single-source methods, and is easy to deploy and use in real-world systems. © 2012 ACM.; Traditional pattern mining methods usually work on single data sources. However, in practice, there are often multiple and heterogeneous information sources. They collectively provide contextual information not available in any single source alone describing the same set of objects, and are useful for discovering hidden contextual patterns. One important challenge is to provide a general methodology to mine contextual patterns easily and efficiently. In this paper, we propose a general framework to encode contextual information from multiple sources into a coherent representation - -Contextual Information Graph (CIG). The complexity of the encoding scheme is linear in both time and space. More importantly, CIG can be handled by any single-source pattern mining algorithms that accept taxonomies without any modification. We demonstrate by three applications of the contextual association rule, sequence and graph mining, that contextual patterns providing rich and insightful knowledge can be easily discovered by the proposed framework. It enables Contextual Pattern Mining (CPM) by reusing single-source methods, and is easy to deploy and use in real-world systems. © 2012 ACM.
KeywordAlgorithms Data Mining Knowledge Management
SponsorshipSpecial Interest Group on Information Retrieval (ACM SIGIR); ACM SIGWEB
Language英语
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/15824
Collection中国科学院软件研究所
Recommended Citation
GB/T 7714
Dong Weishan,Fan Wei,Shi Leib,et al. a general framework to encode heterogeneous information sources for contextual pattern mining[C],2012:65-74.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Dong Weishan]'s Articles
[Fan Wei]'s Articles
[Shi Leib]'s Articles
Baidu academic
Similar articles in Baidu academic
[Dong Weishan]'s Articles
[Fan Wei]'s Articles
[Shi Leib]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Dong Weishan]'s Articles
[Fan Wei]'s Articles
[Shi Leib]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.