Professional Documents
Culture Documents
A PROJECT REPORT
Submitted by
ANTONY JAYASEELAN.G
MUTHU KUMARAN.D
PRAVVEEN.G
RAKESH.R
of
BACHELOR OF ENGINEERING
IN
BONAFIDE CERTIFICATE
SIGNATURE SIGNATURE
Dr.P.Palaniswamy, M.Tech(IIT-M), Ph.D(IISc) Mr.Mohana Prakash.T.A, B.E
HEAD OF THE DEPARTMENT SUPERVISOR
LECTURER
Computer Science & Engineering Computer Science & Engineering
Saveetha Engineering College, Saveetha Engineering College,
Saveetha Nagar, Saveetha Nagar,
Thandalam, Thandalam,
Chennai – 602 105 Chennai – 602 105
INTERNAL EXAMINER EXTERNAL EXAMINER
ACKNOWLEDGMENT
We would also like to thank our Project Coordinator Mr.Sridharan.K for his
support during the entire course of this project work.
We also thank all the staff members of our college and technicians for their
help in making this project a successful one.
ABSTRACT
Efficient algorithms for mining frequent itemsets are crucial for mining
association rules as well as for many other data mining tasks. Methods for mining
well.
In this paper, we present a novel FP-array technique that greatly reduces the
FP-tree-based algorithms. Our technique works especially well for sparse data sets.
Furthermore, we present new algorithms for mining all, maximal, and closed
frequent itemsets. Our algorithms use the FP-tree data structure in combination with
Even though the algorithms consume much memory when the data sets are sparse,
they are still the fastest ones when the minimum support is low. Moreover, they are
always among the fastest algorithms and consume less memory than other methods
TABLE OF CONTENTS
CHAPTER.NO TITLE PAGE NO
ABSTRACT i
LIST OF FIGURES iii
LIST OF ABBREVIATIONS iv
1. INTRODUCTION 1
2. LITERATURE REVIEW 3
2.1 EXISTING SYSTEM 6
2.2 PROPOSED SYSTEM 13
2.3 PROBLEM FORMULATION
3. SYSTEM REQUIREMENTS 15
3.2 PLATFORM 17
3.2.1 Software Requirements 17
3.2.2 Hardware Requirements 19
4. SYSTEM DESIGN 22
3.3 PROJECT DESCRIPTION 26
3.4 ALGORITHM 32
3.4.1 fp-growth 32
3.4.2 fp-max 34
3.4.3 cfi tree & fp close 36
5. IMPLEMENTATION 39
5.1 CODING 39
5.2 TESTING 42
APPENDICES 52
REFERENCES 64
ii
LIST OF FIGURES
PAGE NO.
FIGURE NO. TITLE
2.3.d ER DIAGRAM 38
2.3.d FP GROWTH 39
iii
LIST OF ABBREVIATIONS
FI Frequent Items
MFI Maximal Frequent Item
CFI Closed Frequent Item
FP Frequent Pattern
FP-MAX Frequent Pattern Maximum
FP-CLOSE Frequent Pattern Closed
J2EE Java 2 Enterprise Edition
AWT Abstract Windowing Toolkit
API Application Program Interface
JDBC Java Data Base Connectivity
DSN Data Source Name
iv