maintaining only frequent itemsets to mine approximate frequent itemsets over online data streams
Wang Yongyan; Li Kun; Wang Hongan
2009
会议名称IEEE Symposium on Computational Intelligence and Data Mining
会议录名称2009 IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2009 - Proceedings
会议日期MAR 30-APR
会议地点Nashville, TN
出版地345 E 47TH ST, NEW YORK, NY 10017 USA
出版者2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING
ISBN978-1-4244-2765-9
部门归属Wang, Yongyan; Li, Kun; Wang, Hongan Chinese Acad Sci, Inst Software, Intelligence Engn Lab, Beijing, Peoples R China.
摘要Mining frequent itemsets over online data streams, where the new data arrive and the old data will be removed with high speed, is a challenge for the computational complexity. Existing approximate mining algorithms suffer from explosive computational complexity when decreasing the error parameter, c, which is used to control the mining accuracy. We propose a new approximate mining algorithm using an approximate frequent itemset tree (abbreviated as AFI-tree), called AFI algorithm, to mine approximate frequent itemsets over online data streams. The AFI-tree based on prefix tree maintains only frequent itemsets, so the number of nodes in the tree is very small. All the infrequent child nodes of any frequent node are pruned and the maximal support of the pruned nodes is estimated to detect new frequent itemsets. In order to guarantee the mining accuracy, when the estimated maximal support of the pruned nodes is a bit more than the minimum support, their supports will be re-computed and the frequent nodes among them will be inserted into the AFI-tree. Experimental results show that the AFI algorithm consumes much less memory space than existing algorithms, and runs much faster than existing algorithms in most occasions.
关键词Algorithms Artificial Intelligence Computational Complexity Data Communication Systems Data Mining
主办者IEEE
内容类型会议论文
URI标识http://ir.iscas.ac.cn/handle/311060/8318
专题人机交互技术与智能信息处理实验室
推荐引用方式
GB/T 7714
Wang Yongyan,Li Kun,Wang Hongan. maintaining only frequent itemsets to mine approximate frequent itemsets over online data streams[C]. 345 E 47TH ST, NEW YORK, NY 10017 USA:2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING,2009.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
maintaining only fre(457KB) 开放获取--请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang Yongyan]的文章
[Li Kun]的文章
[Wang Hongan]的文章
百度学术
百度学术中相似的文章
[Wang Yongyan]的文章
[Li Kun]的文章
[Wang Hongan]的文章
必应学术
必应学术中相似的文章
[Wang Yongyan]的文章
[Li Kun]的文章
[Wang Hongan]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。