Professional Documents
Culture Documents
A New DataStructure For Finding Maximum
A New DataStructure For Finding Maximum
{
Do
Figure 1 Ordered Tree. {
Figure 1 shows the Ordered Tree for locating maximum if (single path is obtained)
frequent Itemset.it consist of 26 path for every alphabetical {
letter, thus it also known as multipath tree. Structure of a if (No element is found)
Ordered Tree such that label of root node is null and the child {Stop algorithm;
node start sequentially and end with letter Z.Node of every Return (empty frequent set)
path decreases one by one with the increasing path of a }
Ordered Tree. else
{
B. Proposed Algorithm Traverse all elements from Lattice l;
Using continues association rule;
It is online algorithm, thus it required one pass to find frequent Find frequent item set
sets. }
Algorithm for finding maximum frequent item set as well as }
user specified frequent item set shown in Figure 2. else
{
Maximum Frequent Item Set Generator Traverse all multipath one by one;
Input: Data Stream. Find all frequent item set using continues association rules. }
Output: Frequent Item Set, Maximum Frequent Item Set, User }}
Specified Frequent Set. Finding Maximum Frequent Item Set
Insertion Procedure Ordered Tree(T , Node ,E) {
{ From Lattice L[e];
Root Node== Null; After finding frequent item set;
Input Transaction T from database. Search each element from node.
Lattice L[]; i=1;
While(Transaction T!=empty ) While(Max.Frequent Item Set !=found)
do {
{ Max Chain=Linear search L[i];
For each transaction i++;
Sort Elements e using insertion sort }
Sort e; Return(maximum frequent item set);
Insert L[e]; }
If(child node==Null ) User Specified Frequent Item Set
{ {
} else From Lattice L[e];
if(child node!=null) User Input is applied;
{ While(Lattice !=empty)
If(Child node follow same path) {
{ Using linear Search in frequent itemset;
Insert L[e]; If (element= =L[e])
Support ++; }
}else Return (user specified maximum frequent set)
Created new node; }
Insert L[e];
} Figure 2 Algorithm for Maximum Frequent Item Set In Online Data Mining.
}
Pruning Procedure Ordered Tree (E, Support) Finding maximum frequent item set in online data mining the
{ New Data Structure works as follows.
Using previous process find unique elements and their Apply the sorting in each transaction. After that, sort element
support values in Lattice L. insert into Ordered Tree according to their alphabetical order
while(given support>Element support) if encoded in alphabets.
do In a Ordered Tree same element of a different path connected
{ to each other, so finding frequent itemsets will be an easier
e1=l[e]; process.
Delete Element’s from Lattice L [ ] whose support value is
less than given support value; TABLE 1 VARIOUS TRANSACTION
l[e--];
}
}
Frequent Set Mining (Node, L[e])
{
While (All Path of Lattice is not traverse)
IEEE International Conference on Computer, Communication and Control (IC4-2015).
Applying sorting to every transaction, which shows in table 2. S.N. Dataset Name Number of Size In
Transaction Kb
TABLE 2 SORT TRANSACTION 1 Accident 1101 27
Tid Transactions 2 Kosarak 2774 82
3 RetailSet 505 12
1 ABCDE
2 BCDE B. RESULTS
3 CDE This section present and analyze the results on the Accident,
4 AE Kosarak and RetailSet dataset for mining time and memory
usage for finding maximum frequent item set in online data
Insert transaction, one by one into Ordered Tree depend upon mining.
its prefix structure.
When same value will be repeated, increment the value by one 1) Mining Time
in Ordered Tree. How much time an algorithm take to complete whole process
to find frequent itemset from a data stream known as mining
time. Various dataset used to compare mining time of estDec
and proposed algorithm.
TABLE 4 MINING TIME BASED ON ACCIDENT D ATASET
S.N. Support EstDec Proposed
1 0.2 355 185
2 0.3 116 86
3 0.4 67 32
4 0.5 60 7
5 0.6 50 4
6 0.7 41 4
7 0.8 38 4
8 0.9 36 4
2) Memory Consumption
1 0.2 13 12
2 0.3 16 15
3 0.4 18 12
4 0.5 14 14
5 0.6 17 10
6 0.7 13 13
7 0.8 15 15
8 0.9 17 12
REFERENCES
[1] J. Chang and W. Lee. Finding Recent Frequent Item sets Adaptively over
Online Data Streams. In Proc. of the 9 th ACM SIGKDD International
Conference & Data Mining (KDD-2003), 2003.
[2] R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules.
In Conf. of the 20thVLDB conference, pages 487-499, 1994.
[3] Han.J, Pei.J, and Yin. Y. “Mining frequent patterns without candidate
generation”. In Proc. ACM-SIGMOD Int’l Conf. Management of Data
(SIGMOD), 2000.
[4] C. Borgelt. “An Implementation of the FP- growth Algorithm”. Proc.
Workshop Open Software for Data Mining, 1–5.ACMPress, New Yo rk, NY,
USA 2005.
[5] C. Hidber. Online association rule mining. In Proc. of the ACM SIGMOD
Int'l Conference on Management of Data, pages 145-156, Philadelphia, PA,
May 1999.
[6]Asuncion, A & Newman, and D.J. “UCI Learning
Repository”,http://www.ics.uci.edu/mlearn/MLRepository.html, CA:
University of California, Department of Information and Computer Science.
2007.