An Algorithm of Mining High Utility Itemsetswith Vertical Structures
投稿时间:2016-11-18  修订日期:2017-01-10
中文关键词: 数据挖掘  关联分析  频繁项集  高效用项集  
英文关键词: data mining  association rule  frequent itemsets  high utility itemsets  
黄坤* 中国舰船研究设计中心 430064
吴玉佳 武汉大学计算机学院 
摘要点击次数: 386
全文下载次数: 0
      Mining high utility itemsets (HUIs) is one of popular tasks in field of association analysis. Most of HUIs mining algorithm need to generate a lot of candidate itemsets(CIs) which will affect the performance of algorithm. HUI-Miner can mine all the HUIs from a transaction database without generating CIs. However, this algorithm generates a large numbers of utility lists(ULs) and so many ULs not only consumes too much storage space but also affects the operation performance. In this paper, to solve this problem, itemsets lists(ILs), a new data structure, are proposed, to maintain information of transaction and item utility. Three strategy are proposed, to reduce the number of ILs and can build the ILs just scanning the transaction database only once. Proposed a new algorithm namely MHUI which mines all the HUIs directly from the ILs without generating any CIs. The experimental results show that proposed method outperforms the state-of-the-art algorithms in terms of runtime and memory consumption on three different sparse datasets.
View Fulltext   查看/发表评论  下载PDF阅读器