ISCAS OpenIR  > 人机交互技术与智能信息处理实验室
maintaining only frequent itemsets to mine approximate frequent itemsets over online data streams
Wang Yongyan; Li Kun; Wang Hongan
2009
Conference NameIEEE Symposium on Computational Intelligence and Data Mining
Source2009 IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2009 - Proceedings
Conference DateMAR 30-APR
Conference PlaceNashville, TN
Publish Place345 E 47TH ST, NEW YORK, NY 10017 USA
Publisher2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING
ISBN978-1-4244-2765-9
DepartmentWang, Yongyan; Li, Kun; Wang, Hongan Chinese Acad Sci, Inst Software, Intelligence Engn Lab, Beijing, Peoples R China.
English AbstractMining frequent itemsets over online data streams, where the new data arrive and the old data will be removed with high speed, is a challenge for the computational complexity. Existing approximate mining algorithms suffer from explosive computational complexity when decreasing the error parameter, c, which is used to control the mining accuracy. We propose a new approximate mining algorithm using an approximate frequent itemset tree (abbreviated as AFI-tree), called AFI algorithm, to mine approximate frequent itemsets over online data streams. The AFI-tree based on prefix tree maintains only frequent itemsets, so the number of nodes in the tree is very small. All the infrequent child nodes of any frequent node are pruned and the maximal support of the pruned nodes is estimated to detect new frequent itemsets. In order to guarantee the mining accuracy, when the estimated maximal support of the pruned nodes is a bit more than the minimum support, their supports will be re-computed and the frequent nodes among them will be inserted into the AFI-tree. Experimental results show that the AFI algorithm consumes much less memory space than existing algorithms, and runs much faster than existing algorithms in most occasions.
KeywordAlgorithms Artificial Intelligence Computational Complexity Data Communication Systems Data Mining
SponsorshipIEEE
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/8318
Collection人机交互技术与智能信息处理实验室
Recommended Citation
GB/T 7714
Wang Yongyan,Li Kun,Wang Hongan. maintaining only frequent itemsets to mine approximate frequent itemsets over online data streams[C]. 345 E 47TH ST, NEW YORK, NY 10017 USA:2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING,2009.
Files in This Item:
File Name/Size DocType Version Access License
maintaining only fre(457KB) 开放获取--Application Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wang Yongyan]'s Articles
[Li Kun]'s Articles
[Wang Hongan]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wang Yongyan]'s Articles
[Li Kun]'s Articles
[Wang Hongan]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wang Yongyan]'s Articles
[Li Kun]'s Articles
[Wang Hongan]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.