This document outlines 13 tasks related to data mining concepts:
1) Explaining types of attributes and calculating similarity measures for binary vectors
2) Defining similarity, dissimilarity, and data mining tasks
3) Explaining data pre-processing, association analysis, support and confidence
4) Writing the Apriori algorithm for frequent itemsets and calculating similarity measures
5) Explaining candidate generation procedures and using Apriori to find frequent itemsets
6) Generating association rules from sample transaction data with minimum support and confidence
7) Stating the Apriori principle with an example and defining/computing a frequent pattern tree
This document outlines 13 tasks related to data mining concepts:
1) Explaining types of attributes and calculating similarity measures for binary vectors
2) Defining similarity, dissimilarity, and data mining tasks
3) Explaining data pre-processing, association analysis, support and confidence
4) Writing the Apriori algorithm for frequent itemsets and calculating similarity measures
5) Explaining candidate generation procedures and using Apriori to find frequent itemsets
6) Generating association rules from sample transaction data with minimum support and confidence
7) Stating the Apriori principle with an example and defining/computing a frequent pattern tree
This document outlines 13 tasks related to data mining concepts:
1) Explaining types of attributes and calculating similarity measures for binary vectors
2) Defining similarity, dissimilarity, and data mining tasks
3) Explaining data pre-processing, association analysis, support and confidence
4) Writing the Apriori algorithm for frequent itemsets and calculating similarity measures
5) Explaining candidate generation procedures and using Apriori to find frequent itemsets
6) Generating association rules from sample transaction data with minimum support and confidence
7) Stating the Apriori principle with an example and defining/computing a frequent pattern tree
2) Find SMC and Jaccard Co-Efficient of two binary vectors X=(1,0,0,0,0,0,0,0,0,0) Y=(0,0,0,0,0,0,1,0,0,1) 3) Define similarity and dissimilarity 4) State and explain various data mining tasks. 5) Explain data pre-processing 6) What is Association Analysis? Explain association rule, Support and Confidence. 7) Write Apriori Algorithm for frequent itemsets. 8) For the given factors x and y calculate cosine, correlation and jaccard. X=(1,1,0,1,0,1) Y=(1,1,1,0,0,1) 9) Explain several candidate generation procedures. 10) A database has five transactions. Let min-sup = 60% and min-conf = 80%
Find all frequent item sets using appriori
11) Generate association rules for the given data with support 50% and confidence 70%.
TID 100 200 300 400
Items {1,3,4} {2,3,5} {1,2,3,5} {2,5}
12) State and explain Apriori principle with an example.
13) Define a frequent pattern tree. Discuss the method of computing a FP-Tree, with an algorithm.