DIGILIB FISIPOL UGM - YOGYAKARTA INDONESIA :: Learning, Passion, Knowledge, Empathy, Social Value and Digital Access::Parallel Mining Algorithm of Frequent Itemset Based on N-list and DiffNodeset Structure::-Technology Center for Human Social Computing-

Show simple item record

dc.contributor.author	ZHANG Yang, WANG Rui, WU Guanfeng, LIU Hongyi
dc.contributor.other	1 School of Mathematics,Southwest Jiaotong University,Chengdu 611756,China;2 National-Local Joint Engineering Laboratory of System Credibility Automatic Verification,Southwest Jiaotong University,Chengdu 611756,China;3 Aerospace Internet of Things Technology Co.,Ltd,Beijing 100094,China
dc.date.accessioned	2025-08-27T02:35:33Z
dc.date.accessioned	2025-10-08T08:22:44Z
dc.date.available	2025-10-08T08:22:44Z
dc.date.issued	01-11-2023
dc.identifier.uri	http://digilib.fisipol.ugm.ac.id/repo/handle/15717717/35661
dc.description.abstract	Frequent itemset mining is a basic problem of data mining and plays an important role in many data mining applications.In order to solve the problems of the parallel frequent itemset mining algorithm(MrPrePost) in big data environment,such as algorithm efficiency degradation,unbalanced load of computing nodes and redundant search,this paper proposes a parallel frequent itemset mining algorithm(PFIMND),which is based on N-lists and DiffNodeset.Firstly,according to the advantages of N-list and DiffNodeset data structures,the data set sparsity estimation function(SE) is designed,and one of them is selected to store data according to the data set sparsity.Secondly,the computational estimation function(CE) is proposed to estimate the load of each item in the frequent 1-item set F-list,and the load is evenly grouped according to the computational cost.Finally,the set enumeration tree is used as the search space.In order to avoid combination explosion and redundant search problems,the superset pruning strategy and the pruning strategy based on width first searches are designed to generate the final mining results.Experimental results show that compared with the similar algorithm(HP-FIMND),the effect of PFIMND algorithm in mining frequent itemsets on Susy dataset is improved by 12.3%.
dc.language.iso	ZH
dc.publisher	Editorial office of Computer Science
dc.subject.lcc	Computer software
dc.title	Parallel Mining Algorithm of Frequent Itemset Based on N-list and DiffNodeset Structure
dc.type	Article
dc.description.keywords	frequent itemset\|load estimation\|mapreduce\|sparse estimation\|set-enumeration tree
dc.description.pages	55-61
dc.description.doi	10.11896/jsjkx.221000011
dc.title.journal	Jisuanji kexue
dc.identifier.oai	5faf6eccf1524d4183eb25c761be37b1
dc.journal.info	Volume 50, Issue 11

This item appears in the following Collection(s)

doaj

Show simple item record