Welcome to Scribd!

Exam2018 2019

Uploaded by

0% found this document useful (0 votes)

16 views2 pages

1. The document is a practice exam for a master's course on information retrieval and data mining. It contains 4 exercises testing knowledge of indexing, retrieval models, decision trees, and key differences between retrieval and mining. 2. The first exercise asks questions about indexing, multilevel indices, exact vs. best matching, moving from term-document to inverted indices, and differences between data understanding and preparation phases. 3. The second exercise involves applying term-document and inverted indices to retrieve relevant documents for a query and identify an irrelevant document. 4. The third exercise calculates inverse document frequency and uses it to evaluate a query against a count vector using different retrieval models. 5. The fourth exercise

Original Description:

data mining

Original Title

Exam2018-2019

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

16 views2 pages

Exam2018 2019

Uploaded by

mohamed lamine hamrit

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

UNIVERSITY OF MOHAMED BOUDIAF – M’SILA

FACULTY OF MATHEMATICS AND INFORMATICS

DEPARTMENT OF COMPUTER SCIENCE
2nd year Master (IDO)
_________

Time duration: 1h:30m - Biannual Exam of Information Retrieval & Data Mining - University year: 2018/2019
By Dr. B. LOUNNAS

Exercise 01: Course question (06pt)

1. Does the indexation of data has any role in the process of information retrieval, and why? (1pt)
2. In what case we use multilevel indices? (1pt)
3. What is the difference between Information retrieval and Data mining? (0.5pt)
4. One of the differences of exact and best matching is: (0.5pt)
a. Exact matching: Query specifies precise retrieval criteria.
b. Best matching: Query describes retrieval criteria for desired documents
What does that means?
5. Why we moved from term-document incidence to inverted index? (1pt)
6. What is the difference between the second and the third phases of CRISP-DM (Data
understanding, and Data preparation)? (1pt)
7. We have three attributes: Age, Salary, and Position. After calculation of information gain we found
that Salary attribute is the best choose to be a root.
The question is, if you did not choose Salary as a root, instead you used Age as a root. Would your
decision tree gives false result or not? (1pt)

Exercise 02: Information Retrieval Models (08pt)

1. We have the following term-document incidence: (3pt)

 What is the result of the following query: (Brutus OR Caesar) AND NOT (Antony OR Cleopatra)
 Complete the values of Calpurnia based on the following:
o Document Julius Caesar mentioned the word Calpurnia 156 times.
o Documents The Tempest, Antony and Cleopatra, Hamlet, Othello, and Macbeth never
mentioned the word Calpurnia.
 After completing the values of Calpurnia, and assuming that those terms are the only ones, what
is the irrelevant document of this retrieval system?

2. Describe by graphical how the Merge Algorithm works on the following inverted indices: (2pt)

(Obs: Merge algorithm in the above example return 2, and 31 in linear time complexity O(n) )
- Write the algorithm?

Page 1/2
3. Considering the following table of count vector (Tfraw) of a 3 documents and query: (3pt)

Doc 1 Doc 2 Doc 3 Query

Two 2 0 0 0
Tea 2 2 0 1
Me 0 1 2 1
You 0 1 2 0

 Calculate the idf (inverse document frequency) for each word represented in the three
documents.
 What is the result of the query (Tea me) using NTC.NNN smart notation?

Exercise 04: Decision tree (06pt)

Imagine you only ever do four things at the weekend: go shopping, watch a movie, play tennis or just stay
in. What you do depends on three things: the weather (windy, rainy or sunny); how much money you
have (rich or poor) and whether your parents are visiting.

Weekend Weather Parents Money Decision

W1 Sunny Yes Rich Cinema
W2 Sunny No Rich Tennis
W3 Windy Yes Rich Cinema
W4 Rainy Yes Poor Cinema
W5 Rainy No Rich Stay in
W6 Rainy Yes Poor Cinema
W7 Windy No Poor Cinema
W8 Windy No Rich Shopping
W9 Windy Yes Rich Cinema

 Calculate the entropy of this collection of training examples. (2.5pt)

 Calculate the information gains of Weather, Parents and Money relative to these training
examples. (2.5pt)
 What is the best split (among Weather, Parents, and Money) according to the information gain?
Explain? (1pt)

Page 2/2

Question Bank: Subject Name: Artificial Intelligence & Machine Learning Subject Code: 18CS71 Sem: VII
Document8 pages
Question Bank: Subject Name: Artificial Intelligence & Machine Learning Subject Code: 18CS71 Sem: VII
Dileep Kn
100% (1)
3 Probability
Document54 pages
3 Probability
Souvik Ghosh
100% (1)
1 Prob & Stats FAST (Final Term-Online Paper)
Document3 pages
1 Prob & Stats FAST (Final Term-Online Paper)
RafayGhafoor
No ratings yet
Computer Excel Tests
Document4 pages
Computer Excel Tests
gman444
No ratings yet
asila-IR
Document16 pages
asila-IR
pu818950
No ratings yet
B.Tech Odd Semester Examination, 2018-19 Name of Subject: El-I (Data Warehousing & Data Mining)
Document3 pages
B.Tech Odd Semester Examination, 2018-19 Name of Subject: El-I (Data Warehousing & Data Mining)
Mou Dhara
No ratings yet
AVIA Development of An Aerial Fire Identification System Based On Visual Artificial Intelligence
Document6 pages
AVIA Development of An Aerial Fire Identification System Based On Visual Artificial Intelligence
mahruri arif
No ratings yet
3 - Design and Analysis of Algorithms
Document188 pages
3 - Design and Analysis of Algorithms
UdupiSri group
No ratings yet
Assignments Theory
Document9 pages
Assignments Theory
Dragon Ball Super
No ratings yet
Titanic Machine Learning From Disaster: M.A.D.-Python Team: Dylan Kenny, Matthew Kiggans, Aleksandr Smirnov
Document11 pages
Titanic Machine Learning From Disaster: M.A.D.-Python Team: Dylan Kenny, Matthew Kiggans, Aleksandr Smirnov
varsha
No ratings yet
Data Science Lab
Document66 pages
Data Science Lab
018 Neelima
No ratings yet
Tutorial1 Q&A PDF
Document4 pages
Tutorial1 Q&A PDF
darrenseah5530
No ratings yet
Chapter 3
Document22 pages
Chapter 3
rahma.brichnieln8
No ratings yet
AIMLIn Sem Exam Awasari QP
Document17 pages
AIMLIn Sem Exam Awasari QP
Jaswitha Lakshmi
No ratings yet
15A05602 Data Warehousing & Mining
Document2 pages
15A05602 Data Warehousing & Mining
Chitra Madhuri Yashoda
No ratings yet
Tutorial 1 Question
Document3 pages
Tutorial 1 Question
clement hung
No ratings yet
Information, Entropy, and The Motivation For Source Codes: Hapter
Document12 pages
Information, Entropy, and The Motivation For Source Codes: Hapter
john
No ratings yet
131346305.H Data For Statistics
Document44 pages
131346305.H Data For Statistics
Shane Rajapaksha
No ratings yet
Switching Circuits and Logic Design
Document13 pages
Switching Circuits and Logic Design
Pratyush
No ratings yet
Rec7 Sol
Document7 pages
Rec7 Sol
usersome6
No ratings yet
Tutorial 1 Question
Document3 pages
Tutorial 1 Question
Evan Duh
No ratings yet
MIT18 05S14 Exam1 PDF
Document9 pages
MIT18 05S14 Exam1 PDF
Rosenipah Alauya
No ratings yet
Ec3115 2020
Document8 pages
Ec3115 2020
dozen
No ratings yet
Updated IR
Document38 pages
Updated IR
kwra shazaka
No ratings yet
AEPSHEP Lecture1
Document91 pages
AEPSHEP Lecture1
Aloha
No ratings yet
Reg. No.: Name:: Architecture You Would Choose. What Is The Purpose of Each Component of This Architecture?
Document2 pages
Reg. No.: Name:: Architecture You Would Choose. What Is The Purpose of Each Component of This Architecture?
Delvin company
No ratings yet
Biologically Inspired Computing: Introduction
Document35 pages
Biologically Inspired Computing: Introduction
Aroul Dhase
No ratings yet
ML Question Bank
Document7 pages
ML Question Bank
arunwaghmare5
No ratings yet
15 Unit Wise Questions
Document2 pages
15 Unit Wise Questions
Nikita Mandhan
No ratings yet
Physics 8.01 Assignment 1
Document2 pages
Physics 8.01 Assignment 1
eternalshenron
No ratings yet
ch01 PDF
Document33 pages
ch01 PDF
m3gp13 yo
No ratings yet
Week-7 - Lecture Notes
Document143 pages
Week-7 - Lecture Notes
tejastaware7451
No ratings yet
EE Tut 1
Document5 pages
EE Tut 1
Tom
No ratings yet
A H1060 Pages: 3: Answer All Questions, Each Carries 4 Marks
Document3 pages
A H1060 Pages: 3: Answer All Questions, Each Carries 4 Marks
Srinivas R Pai
No ratings yet
Short Answer Type Questions: Question Bank
Document26 pages
Short Answer Type Questions: Question Bank
KRISHMA
No ratings yet
15A05602 Data Warehousing & Mining
Document2 pages
15A05602 Data Warehousing & Mining
Chitra Madhuri Yashoda
No ratings yet
Alvarez Arellano 2021
Document66 pages
Alvarez Arellano 2021
Anonymous RQQTvj
No ratings yet
Renewable Energy System Based On IFOA-BP Neural Ne
Document6 pages
Renewable Energy System Based On IFOA-BP Neural Ne
MD Moiedul Islam Moon
No ratings yet
Bab 1 Logic and Proof
Document10 pages
Bab 1 Logic and Proof
suhaila.mn
No ratings yet
Grade 11 Information and Communication Technology Past Paper 2020 3rd Term Test Western Province
Document25 pages
Grade 11 Information and Communication Technology Past Paper 2020 3rd Term Test Western Province
menulatwork
No ratings yet
7.los Pros y Los Contra de La Ciudad Conditional P. 104-105 Mon 5 Feb
Document10 pages
7.los Pros y Los Contra de La Ciudad Conditional P. 104-105 Mon 5 Feb
Jamali
No ratings yet
COMT Group09 2 PDF
Document9 pages
COMT Group09 2 PDF
Adil0k
No ratings yet
Exam2017 2018
Document2 pages
Exam2017 2018
mohamed lamine hamrit
No ratings yet
Ongc Answer Key
Document43 pages
Ongc Answer Key
Sachin Bavdane
No ratings yet
Introduction To Computing and Games Designing
Document16 pages
Introduction To Computing and Games Designing
VineetPandey
No ratings yet
Assignment #1: Date Material Covered Reading From Ohanian
Document2 pages
Assignment #1: Date Material Covered Reading From Ohanian
wonchai
No ratings yet
Section - I
Document16 pages
Section - I
Aman Jauhri
No ratings yet
Big Assignment 2
Document10 pages
Big Assignment 2
melesse bisema
No ratings yet
COMP1942 Question Paper
Document5 pages
COMP1942 Question Paper
pakaMuziki
No ratings yet
MIT6 02F12 Chap02
Document12 pages
MIT6 02F12 Chap02
gautruc408
No ratings yet
CIMA Part 7
Document10 pages
CIMA Part 7
cssaspirantresources
No ratings yet
COMP1942 Question Paper
Document7 pages
COMP1942 Question Paper
pakaMuziki
No ratings yet
The Bellkor 2008 Solution To The Netflix Prize
Document21 pages
The Bellkor 2008 Solution To The Netflix Prize
ken_nov21
No ratings yet
Afdeling B. / Section B
Document6 pages
Afdeling B. / Section B
Muhammed Nakooda
No ratings yet
GCD Detailed Syllabus
Document24 pages
GCD Detailed Syllabus
reddybharath
No ratings yet
Cs Masters Thesis Topics
Document6 pages
Cs Masters Thesis Topics
amhtnuwff
100% (2)
Mba 3 Sem Business Analytics 18mba302e 2020
Document2 pages
Mba 3 Sem Business Analytics 18mba302e 2020
HoD MBA
100% (1)
Prediction Problems
Document6 pages
Prediction Problems
Pablo Romero
No ratings yet
C12 IR M2021 TermWeighting
Document19 pages
C12 IR M2021 TermWeighting
Naveen Kumar Kalagarla
No ratings yet
Smarter Decisions – The Intersection of Internet of Things and Decision Science
From Everand
Smarter Decisions – The Intersection of Internet of Things and Decision Science
Jojo Moolayil
No ratings yet
Applied Mathematics: Made Simple
From Everand
Applied Mathematics: Made Simple
Patrick Murphy
Rating: 4 out of 5 stars
4/5 (7)
People Soft Bundle Release Note 9 Bundle17
Document23 pages
People Soft Bundle Release Note 9 Bundle17
rajiv_xguys
No ratings yet
Jannatul Ferdosh: Career Objectives
Document2 pages
Jannatul Ferdosh: Career Objectives
Nisha
No ratings yet
Comparative Study of Classification Algorithms Used For The Prediction of Non-Communicable Diseases
Document4 pages
Comparative Study of Classification Algorithms Used For The Prediction of Non-Communicable Diseases
WARSE Journals
No ratings yet
Project VBA Project Guru
Document14 pages
Project VBA Project Guru
pvenky_kkd
No ratings yet
BaseX For Dummies
Document16 pages
BaseX For Dummies
ssr@scribd
No ratings yet
E Commerce On Aws
Document57 pages
E Commerce On Aws
sbosquep
No ratings yet
Att A Price Form 160586 (11x14 Paper)
Document6 pages
Att A Price Form 160586 (11x14 Paper)
Ajay Dasari
No ratings yet
6 3 OTM Tracking Events
Document32 pages
6 3 OTM Tracking Events
jucaab
No ratings yet
Nitgen HamsterMouse Driver Installation For Fingerprint Sensors
Document44 pages
Nitgen HamsterMouse Driver Installation For Fingerprint Sensors
samtrack
No ratings yet
Mba HRM
Document12 pages
Mba HRM
Mohammed Aamer Khan
No ratings yet
City of Stars Tab by Ryan Gosling, Added: July 18th, 2016
Document3 pages
City of Stars Tab by Ryan Gosling, Added: July 18th, 2016
WillardFleming
No ratings yet
Transpose Rows To Column
Document34 pages
Transpose Rows To Column
Amit Sharma
100% (1)
NOW-Nest-Zerodha Trader To Excel To AmiBroker
Document16 pages
NOW-Nest-Zerodha Trader To Excel To AmiBroker
Ajay Goswami
No ratings yet
Bca PDF
Document84 pages
Bca PDF
Harder4 Fun
No ratings yet
Dimensionality Reduction in Automated Evaluation of Descriptive Answers Through Zero Variance, Near Zero Variance and Non Frequent Words Techniques - A Comparison
Document6 pages
Dimensionality Reduction in Automated Evaluation of Descriptive Answers Through Zero Variance, Near Zero Variance and Non Frequent Words Techniques - A Comparison
sunil_sixsigma
No ratings yet
E Governance Policies Practices
Document329 pages
E Governance Policies Practices
Kamalesh Lunkad
No ratings yet
Stm32 Tutorial
Document25 pages
Stm32 Tutorial
Alvaro Balvin Velasquez
100% (1)
Fundamentals of Software Testing
Document51 pages
Fundamentals of Software Testing
Dang Quyen
0% (3)
Report View
Document125 pages
Report View
ibookmarkx
No ratings yet
Altcademy S Back-End Web Development Syllabus
Document10 pages
Altcademy S Back-End Web Development Syllabus
jawadwafa795
No ratings yet
Admin Guide Core 70
Document972 pages
Admin Guide Core 70
Chetan Sharma
100% (1)
IMT - iBTS Maintenance Tool: Training IBK
Document28 pages
IMT - iBTS Maintenance Tool: Training IBK
Satyabrata Nayak
No ratings yet
Exception Class To Use Messages From T100
Document5 pages
Exception Class To Use Messages From T100
Anjan Kumar
No ratings yet
Programming Languages (OOP)
Document27 pages
Programming Languages (OOP)
Paul Lobos
No ratings yet
QuikEC3 Manual
Document54 pages
QuikEC3 Manual
Stelian Constantinescu
No ratings yet
Interview Questions and Answers For Freshers - TCP - IP PDF
Document8 pages
Interview Questions and Answers For Freshers - TCP - IP PDF
Nishant Raj
No ratings yet
Bdsi-Bobj DQV6
Document48 pages
Bdsi-Bobj DQV6
pateljimmy
100% (2)
C++ Loop Types - Tutorialspoint
Document3 pages
C++ Loop Types - Tutorialspoint
evilplue
No ratings yet