Welcome to Scribd!

Test Mining

Uploaded by

0% found this document useful (0 votes)

4 views2 pages

This document contains 5 questions related to data mining algorithms and techniques: 1) The Levenshtein distance algorithm has been used for string matching and data cleaning. 2) The Apriori algorithm can be used to find frequent itemsets in a transactional dataset and association rule mining. 3) The k-means clustering algorithm can partition a set of data points into k number of clusters based on minimizing distances between points and cluster centers. 4) A decision tree classification algorithm like CART can identify the most important attribute to use as the root node of the decision tree for a given dataset. 5) Vector normalization and calculating the dot product distance between two normalized vectors can be used for text mining tasks like document

Original Description:

Original Title

test mining

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views2 pages

Test Mining

Uploaded by

waleed

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Q1:The Levenshtein distance algorithm has been used in:

(1 marks)

Q2: Association: Using the Apriori algorithm show how frequent item sets can be found in
the following data set. where the minimum support count is 2.
TID List -items bought

T1 computer , mouse , camera

T2 mouse, printer
T3 mouse , keyboard
T4 computer , mouse, printer
T5 computer, keyboard
T6 mouse , keyboard
T7 computer, keyboard
T8 computer , mouse , keyboard , camera
T9 computer , mouse , keyboard
You should explain what happens at each step along with the data produced.

(4 marks)

(Option )Q3::( clustering) Suppose that the data mining task is to cluster the following
eight points (with (x; y) representing location) into three clusters.
A1(2; 10);A2(2; 5);A3(8; 4);B1(5; 8);B2(7; 5);B3(6; 4);C1(1; 2);C2(4; 9):
The distance function is Euclidean distance. Suppose initially we assign A1, B1, and C1 as
the center of each cluster, respectively. Use the k-means algorithm to show only
(a) The three cluster centers after the first round of execution and
(b) The final three clusters
Q3:( clustering) what would be the distance matrix
after each of the first three mergers if complete-link
clustering

(4 marks)
Q4: Classification: Given the following data set:-

1
2
3
4
5
6
7
Using CRAT algorithm, which one of the above three attributes will be the root of the
decision tree?
(4 marks)

Q5:Text Mining : Normalize the vectors (20, 10, 8, 12, 56) and (0, 15, 12, 8, 0). Calculate
the distance between the two normalized vectors using the dot product formula.

(4 marks)

Com 124 Exam 2019 Regular Marking Scheme
Document10 pages
Com 124 Exam 2019 Regular Marking Scheme
Oladele Campbell
No ratings yet
Which of The Following Thing Can Be Data in Pandas? A) A Python Dict
Document3 pages
Which of The Following Thing Can Be Data in Pandas? A) A Python Dict
P P
No ratings yet
Chapter9 Nokey
Document14 pages
Chapter9 Nokey
Tran Quang Khoa (K18 HCM)
No ratings yet
Git Prototype Paper 1
Document6 pages
Git Prototype Paper 1
Karthicason Vigneshwaran
No ratings yet
2019-Dec ECD-216 270
Document2 pages
2019-Dec ECD-216 270
Anu Gau
No ratings yet
COMP90038 Practice Exam Paper (2) : Dit?usp Sharing
Document15 pages
COMP90038 Practice Exam Paper (2) : Dit?usp Sharing
Anupa Alex
No ratings yet
Ugc Cs Paper 2017
Document13 pages
Ugc Cs Paper 2017
Aman Deep Singh
No ratings yet
Ds&algoritms MCQ
Document14 pages
Ds&algoritms MCQ
WOLVERINEff
No ratings yet
Python For Data Science - Unit 4 - Week 2
Document6 pages
Python For Data Science - Unit 4 - Week 2
Shashikant Kale
100% (1)
Sheet 01
Document2 pages
Sheet 01
عبد الحميد عمرو عبد الحميد فرغلى هلالى
No ratings yet
210CT - QP (F)
Document8 pages
210CT - QP (F)
dxrshx101
No ratings yet
Signal & Systems
Document48 pages
Signal & Systems
vijai
No ratings yet
Open Book Component
Document3 pages
Open Book Component
Vishal Mittal
No ratings yet
T, Eft: T (N) - (2T (N - 1, If N 0 1, Otherwise
Document2 pages
T, Eft: T (N) - (2T (N - 1, If N 0 1, Otherwise
TS Origami
No ratings yet
University of Technology Department of Electrical Engineering Final Course Examination 2019-2020
Document2 pages
University of Technology Department of Electrical Engineering Final Course Examination 2019-2020
harith
No ratings yet
Department of Computer Science & Engineering End Semester Examination-May, 2019
Document2 pages
Department of Computer Science & Engineering End Semester Examination-May, 2019
Yash Gagneja
No ratings yet
CS20110SolvedMidtermPApersin1File PDF
Document105 pages
CS20110SolvedMidtermPApersin1File PDF
Ace Ace ace baby
No ratings yet
HW 2 - Traversing 2D Arrays
Document6 pages
HW 2 - Traversing 2D Arrays
William Gokey
No ratings yet
Data Structure and Algorithm
Document7 pages
Data Structure and Algorithm
jia jun soong
No ratings yet
Sem 3 ECE 2022 PYQs
Document19 pages
Sem 3 ECE 2022 PYQs
magicaltanya9
No ratings yet
Gate Sample Paper
Document7 pages
Gate Sample Paper
shettyashwin19
No ratings yet
Second Examination: Name: Netid: Lab Section (Day/Time)
Document14 pages
Second Examination: Name: Netid: Lab Section (Day/Time)
sandbox7117
No ratings yet
COMP 171 Data Structures and Algorithms Spring 2005
Document12 pages
COMP 171 Data Structures and Algorithms Spring 2005
Gobara Dhan
No ratings yet
Machine Learning
Document45 pages
Machine Learning
uxama
No ratings yet
2017 Data Structures and Algorithms
Document2 pages
2017 Data Structures and Algorithms
Lakshya
No ratings yet
Asn HS07
Document6 pages
Asn HS07
saravanan_nallusamy
No ratings yet
All Questions Are Mandatory. - Question Paper Is Divided Into 3 Sections-A, B, and C
Document3 pages
All Questions Are Mandatory. - Question Paper Is Divided Into 3 Sections-A, B, and C
Aryan Yadav
No ratings yet
Digital Signal Processing Manual
Document106 pages
Digital Signal Processing Manual
64emily64
100% (1)
Kalasalingam Academy of Research and Education Office of The Controller of Examinations
Document47 pages
Kalasalingam Academy of Research and Education Office of The Controller of Examinations
Buvanesh Buvi Vnr
No ratings yet
Information Technology Numerical Methods
Document5 pages
Information Technology Numerical Methods
api-26349602
No ratings yet
GCD Homework 3 - 202310
Document2 pages
GCD Homework 3 - 202310
Joseph Mancera
No ratings yet
Matlab Assignment-01 SEM-II-2016-2017 PDF
Document5 pages
Matlab Assignment-01 SEM-II-2016-2017 PDF
farhanfendi
No ratings yet
Grade10 PartI Eng2016
Document6 pages
Grade10 PartI Eng2016
saminduoshadha2003
No ratings yet
Review Questions S5mce
Document7 pages
Review Questions S5mce
gostzenda
No ratings yet
Data Structures and Algorithms
Document7 pages
Data Structures and Algorithms
Paula Gitu
No ratings yet
Lab 8 Arrays I: EKT 120 - Computer Programming Laboratory Module
Document9 pages
Lab 8 Arrays I: EKT 120 - Computer Programming Laboratory Module
ariff mohd
No ratings yet
GATE Questions
Document96 pages
GATE Questions
Akhil Verma
100% (1)
rr10202 Information Technology and Numerical Methods
Document6 pages
rr10202 Information Technology and Numerical Methods
SRINIVASA RAO GANTA
No ratings yet
K Means Algorithms
Document27 pages
K Means Algorithms
priyanshidubey2008
No ratings yet
Question #1 (9 Marks) : Operating Systems (CS C372) Comprehensive Exam Regular
Document4 pages
Question #1 (9 Marks) : Operating Systems (CS C372) Comprehensive Exam Regular
Vishal Mittal
No ratings yet
GATE Questions 16-7-13
Document106 pages
GATE Questions 16-7-13
shinde_jayesh2005
100% (1)
Exam2 s09 v2
Document10 pages
Exam2 s09 v2
serhatandic42
No ratings yet
Faculty of Engineering, Environment and Computing 310SE Advanced Digital Systems Open Time Constrained Assessment
Document6 pages
Faculty of Engineering, Environment and Computing 310SE Advanced Digital Systems Open Time Constrained Assessment
kelvin mwaniki
No ratings yet
Final Exercises
Document13 pages
Final Exercises
Ashish Darji
No ratings yet
Machine Learning Assignment
Document8 pages
Machine Learning Assignment
JoshuaDownes
No ratings yet
2022A FE AM Question
Document27 pages
2022A FE AM Question
Htet Myat
No ratings yet
Learn Lab3
Document12 pages
Learn Lab3
Andika Bayu Aji
No ratings yet
2010oct FE AM Questions PDF
Document34 pages
2010oct FE AM Questions PDF
Đinh Văn Bắc Đinh
No ratings yet
22TP201 1
Document2 pages
22TP201 1
julieakshaya
No ratings yet
Assignment 1 Lab
Document14 pages
Assignment 1 Lab
Uzair Ashfaq
No ratings yet
Suhasini ECE 170103015 1000009511 PDF
Document18 pages
Suhasini ECE 170103015 1000009511 PDF
vivek
No ratings yet
Digital Communications Exam Paper
Document2 pages
Digital Communications Exam Paper
DharmendraDixit
No ratings yet
Pgtrbcomputerscience
Document99 pages
Pgtrbcomputerscience
Desperado Manogaran M
No ratings yet
Class 12
Document5 pages
Class 12
kishlaysinha5
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
Document5 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
reilyshawn
No ratings yet
Ec2m Dseclzg519 Key
Document6 pages
Ec2m Dseclzg519 Key
Dimpu Shah
No ratings yet
Data Structure QP
Document8 pages
Data Structure QP
Tinku The Blogger
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
Rating: 3 out of 5 stars
3/5 (1)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
التكنلوجيا2015mining
Document2 pages
التكنلوجيا2015mining
waleed
No ratings yet
Data Mining 2020
Document2 pages
Data Mining 2020
waleed
No ratings yet
نهائي التكنولوجيا2015
Document3 pages
نهائي التكنولوجيا2015
waleed
No ratings yet
20410D - Installing and Configuring Windows Server 2012
Document4 pages
20410D - Installing and Configuring Windows Server 2012
waleed
No ratings yet