朱志国,邓贵仕.挖掘频繁波动的Web访问模式算法研究[J].,2009,(2):282-287 |
挖掘频繁波动的Web访问模式算法研究 |
Algorithm research on mining frequent fluctuating Web access pattern |
|
DOI:10.7511/dllgxb200902022 |
中文关键词: 数据挖掘 Web使用挖掘 Web访问模式 动态数据挖掘 |
英文关键词: data mining Web usage mining Web access pattern dynamic data mining |
基金项目:国家自然科学基金资助项目(70671016). |
|
摘要点击次数: 1103 |
全文下载次数: 753 |
中文摘要: |
考虑到Web访问数据的动态特性,给出了一个从Web访问日志历史演变中挖掘频繁波动的Web访问模式的方法.首先采用无序树结构表示用户历史访问页面序列集合,然后给出了频繁波动Web访问模式的详细定义以及挖掘算法描述,最后,根据数据集中访问序列的大小和数量变化对于算法扩展性和性能的影响进行了实验.结果表明,该算法具备良好扩展性的同时,能够比较高效地提取出频繁波动的Web访问模式. |
英文摘要: |
Considering the dynamic feature of Web access data, a method for mining frequent fluctuating Web access pattern (FF-WAP) from historical change of Web users′ access sequence (WAS) data is presented. Firstly, unordered tree structure is adopted to represent historical WAS sets and then the detailed definition and mining algorithm description of FF-WAP are presented. Finally, according to the number of WAS and the average size of each WAS in data sets, experiments to analyze scalability and performance of the algorithm are conducted. The results show that the algorithm has better scalability and can extract frequent fluctuating Web access patterns efficiently. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |