Test-2 Solution

Uploaded by

MADHUR SARAF JAIN

0% found this document useful (0 votes)

5 views3 pages

Original Title

Test-2 solution

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views3 pages

Test-2 Solution

Uploaded by

MADHUR SARAF JAIN

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

HYDERABAD CAMPUS
FIRST SEMESTER 2020 – 2021
INFORMATION RETREIVAL (CS F469) – TEST-2 SOLUTION

Date: 20.11.2020 Weightage: 12% [24 Marks] Duration: 30mins. Type: Open Book

Q1. Given the following SVD for an input matrix of size mXn (where m represents document and n
represent the terms). Answer questions A to C. [3 M]

A. How many singular values need to be retained if 85% of variance has to be preserved in the data?
17.92+15.17 / 17.92+15.17+3.56 = 90% Hence it is enough to retain the first two singular values to
preserve 85% of the variance.
B. What will the size of the matrices U,Σ and VT if 85% of variance is preserved?
After retaining the first two singular values the sizes are as follows
U will be of size 5 X 2
Σ will be of size 2 X 2
VT will be of size 2 X 5
C. Is it possible to apply SVD on a matrix of any size?
Yes SVD can be applied to matrix of any size.

Q2. Given the following Shingles and the document matrix. Answer questions A to C.
doc_1 doc_2 doc_3
S1 1 1 1
S2 1 0 1
S3 1 0 0
S4 1 0 0
S5 1 0 1
S6 1 0 0
S7 1 0 1
S8 0 1 0
S9 0 1 0
S10 0 1 0
S11 0 1 0
S12 0 0 1
S13 0 0 1
S14 0 0 1
S15 0 0 1
S16 0 0 1

A. Compute the Jaccard similarity between doc_1 and doc_3 assuming that the shingles are asymmetric
binary random variables. [1+3+4 =8M]
3/12 = 0.25

B. What will be signature generated if the following permutation is taken

s16,s15,s14,s13,s12,s11,s10,s9,s8,s7,s6,s5,4,s3,s2,s1?
7 11 16 or S7 S11 S16

C. If the following 4 hash functions what are the entries of the first row in the signature matrix?
(12x+8)mod17
(14x+3)mod17
(11x+5)mod17
(16x+7)mod17
Any of the following would be considered.
If the indexing is from 16 to 1 the following signature would be 0 2 0
If the indexing is from 0 to 15 the following signature would be 0 2 1
If the indexing is from 1 to 16 the following signature would be 0 2 0

Q3. A search Engine-A has returned 10 documents for a user query. If the total number of documents
relevant to this query are 5 and the following is the list of Relevant(R) and Non-Relevant(NR) documents
in a ranked order. [1+2=3M]

R NR NR R R NR NR R NR R

A. What will the precision and recall at rank 4?

Precision = 2/4 = 0.5

Recall = 2/5 = 0.4

B. If the same query is run on another search Engine-B and it returns the following results R NR R NR R R
R NR NR NR would you prefer to query on search engine-A or search Engine-B? Justify your answer in
one line.
Search Engine-B is preferable over A since as a user I am able to see all relevant documents at rank
5.

C. You are hired as expert to develop a recommender system for an ELearning platform. It has a very small
user base as it has recently been launched. What type of recommender systems is suitable? Justify you
answer in one line. [2 M]

In this case a content based recommender system would be suitable since the ELearning platform is
recently launched and do not have enough user data to learn.

D. What is the advantage of combining the collaborative filtering with the baseline approach? Justify you
answer in one line. [2 M]
It helps us combine Global and local effects of user preferences.

E. Once the utility matrix A is decomposed into U, Σ, VT using SVD under what conditions is the
matrix A invertible (i.e we can get back original A using U, Σ, VT ) and how? [2 M]

A is invertible if we don’t reduce the dimensions of Σ (or concepts with less variance)

F. A Latent Factor Model is learnt using 5 User and Item attributes. If 10,000 ratings are used as training
examples from the original matrix, how many parameters have to be learnt by Stochastic Gradient Descent
model? [3 M]
Since each user rantings are factorized into 5 user and 5 Item attributes/factors if 10,000 ratings are
to be used as training data the SGD model will have to learn 10X10,000 = 1,00,000 parameters.

AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
A Complete Guide to M.C.Q (Class-10, Mathematics): CBSE MCQ Series, #1
From Everand
A Complete Guide to M.C.Q (Class-10, Mathematics): CBSE MCQ Series, #1
Er. Sajal Kumar Ghosh
No ratings yet
MCA 3rd Sem Assignment 2016-17 PDF
Document13 pages
MCA 3rd Sem Assignment 2016-17 PDF
manju
No ratings yet
EEE350 Control Systems: Assignment 2
Document15 pages
EEE350 Control Systems: Assignment 2
Nur Afiqah
No ratings yet
CSI 4107 - Winter 2016 - Midterm
Document10 pages
CSI 4107 - Winter 2016 - Midterm
Amin Dhouib
0% (1)
Test-1 - Solution
Document3 pages
Test-1 - Solution
MADHUR SARAF JAIN
No ratings yet
HW 1
Document5 pages
HW 1
calvinlam12100
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
Document12 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
IAEME Publication
No ratings yet
PMSCS in CSE JU Questions & Slove
Document80 pages
PMSCS in CSE JU Questions & Slove
Aladin sabari
No ratings yet
Examqns2017 18
Document24 pages
Examqns2017 18
yes no
No ratings yet
Linear Algebra Course Project
Document7 pages
Linear Algebra Course Project
shiza asghar
No ratings yet
The University of Nottingham
Document4 pages
The University of Nottingham
P6E7P7
No ratings yet
Introduction To The K-Means Clustering Algorithm Based On The Elbow
Document4 pages
Introduction To The K-Means Clustering Algorithm Based On The Elbow
Asyraf Adnil
No ratings yet
Day04 Business Moments
Document10 pages
Day04 Business Moments
Divya
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
Document20 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
Ankur Borkar
No ratings yet
Tutor Marked Assignment #1: The Open University of Sri Lanka
Document8 pages
Tutor Marked Assignment #1: The Open University of Sri Lanka
Uditha Muthumala
No ratings yet
Ecf630-Final Examination - May 2021
Document12 pages
Ecf630-Final Examination - May 2021
Kalimanshi Nsakaza
No ratings yet
Solution To DMOP Make Up Exam 2016
Document5 pages
Solution To DMOP Make Up Exam 2016
Saurabh Kumar Gautam
No ratings yet
Practice Final sp22
Document10 pages
Practice Final sp22
Ajue Ramli
No ratings yet
Mock Final Examination Model Answer: Faculty of Computer Studies TM351 Data Management and Analysis
Document9 pages
Mock Final Examination Model Answer: Faculty of Computer Studies TM351 Data Management and Analysis
Christina Fington
No ratings yet
2011 Aricent Placement Paper:-1. Void Main
Document6 pages
2011 Aricent Placement Paper:-1. Void Main
Akhilesh Suman
No ratings yet
In Gate 2016 Paper
Document26 pages
In Gate 2016 Paper
Megha Singh
No ratings yet
Dsa2023 HW3 0518
Document17 pages
Dsa2023 HW3 0518
fromtaoyuanhsinchy you
No ratings yet
Design and Analysis of Algorithms - May-2013 PDF
Document4 pages
Design and Analysis of Algorithms - May-2013 PDF
Nagababu Pachhala
No ratings yet
Question Paper3
Document12 pages
Question Paper3
Divya Kakumanu
No ratings yet
Presentation Advanced Structural Dynamics Fin
Document57 pages
Presentation Advanced Structural Dynamics Fin
andyronaldo
No ratings yet
T2
Document2 pages
T2
Kriti Goyal
No ratings yet
Assignment 3 NPTEL DBMS January 2024
Document10 pages
Assignment 3 NPTEL DBMS January 2024
no.reply15203
No ratings yet
MCA Assignment 2013 14 - III Sem
Document14 pages
MCA Assignment 2013 14 - III Sem
Shagun Roy
No ratings yet
STA301 - Final Term Solved Subjective With Reference by Moaaz
Document28 pages
STA301 - Final Term Solved Subjective With Reference by Moaaz
Adnan Khawaja
61% (18)
6 Software Engineering
Document34 pages
6 Software Engineering
jp tech
No ratings yet
SEM 4 - 10 - BA-BSc - HONS - ECONOMICS - CC-10 - INTRODUCTORYECONOMETRI C - 10957
Document3 pages
SEM 4 - 10 - BA-BSc - HONS - ECONOMICS - CC-10 - INTRODUCTORYECONOMETRI C - 10957
Pranjal
No ratings yet
Faculty of Science and Technology OPENBOOK EXAM: COM 123 Numerical Analysis and Computation
Document5 pages
Faculty of Science and Technology OPENBOOK EXAM: COM 123 Numerical Analysis and Computation
JoshBarack Tshinemu
No ratings yet
Midterm2006 Sol Csi4107
Document9 pages
Midterm2006 Sol Csi4107
martin
100% (2)
BAUDM Assignment Predicting Boston Housing Prices
Document6 pages
BAUDM Assignment Predicting Boston Housing Prices
Suraj
No ratings yet
3 - Design and Analysis of Algorithms
Document188 pages
3 - Design and Analysis of Algorithms
Pravee
67% (3)
Machine Learning: Assignment-1
Document9 pages
Machine Learning: Assignment-1
Isha Aggarwal
No ratings yet
QB FDS
Document5 pages
QB FDS
thilakavathishanmugam
No ratings yet
TCS Technical Interview Questions and Answers 2011
Document16 pages
TCS Technical Interview Questions and Answers 2011
Rajesh Sinha
No ratings yet
Exercise Sheet 1 The Multiple Regression Model
Document5 pages
Exercise Sheet 1 The Multiple Regression Model
Ignacio Díez Lacunza
No ratings yet
Dynamic Programming
Document12 pages
Dynamic Programming
M.A raja
No ratings yet
Response Surface Approximation Using Sparse
Document20 pages
Response Surface Approximation Using Sparse
balajigandhirajan
No ratings yet
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
Document5 pages
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
kishh28
No ratings yet
Untitled Document
Document6 pages
Untitled Document
Aparna Singh
No ratings yet
Machine Learning Multiple Choice Questions
Document20 pages
Machine Learning Multiple Choice Questions
Satyanarayan Gupta
100% (1)
Review Questions DS
Document14 pages
Review Questions DS
Saleh Alizade
No ratings yet
Data Structures CS201: Instructor: Atif Khattak
Document33 pages
Data Structures CS201: Instructor: Atif Khattak
Muhammad Umer Arshid
No ratings yet
Sheet 02
Document4 pages
Sheet 02
Timo
No ratings yet
12s MidI - SampleExam Print1
Document8 pages
12s MidI - SampleExam Print1
Divya Gn
No ratings yet
Cloud Computing End Term Special QP
Document3 pages
Cloud Computing End Term Special QP
ರಾಘವೇಂದ್ರ ಟಿ ಎಸ್
No ratings yet
Physics Practical Guide New
Document21 pages
Physics Practical Guide New
ZiAd AhMed
100% (2)
WIPRO Preparation
Document40 pages
WIPRO Preparation
girisha6666
No ratings yet
Design and Analysis of Algorithms
Document5 pages
Design and Analysis of Algorithms
Veena K
No ratings yet
Data Structure Questions Bank
Document30 pages
Data Structure Questions Bank
Rj Sahoo
No ratings yet
MMZ XRF O0 Ra Pre 0 ZB XGXW W1 Er 02 OAYQum QDD78 HQP
Document4 pages
MMZ XRF O0 Ra Pre 0 ZB XGXW W1 Er 02 OAYQum QDD78 HQP
Grace Angelia
No ratings yet
FACE - TCS NQT 24th Oct 8 Am To 11 Am Slot Analysis PDF
Document35 pages
FACE - TCS NQT 24th Oct 8 Am To 11 Am Slot Analysis PDF
Naman Bairagi
No ratings yet
Suggession of Machine Learning
Document6 pages
Suggession of Machine Learning
Parthasarathi Hazra
No ratings yet
Problem 5 - Assignment 1
Document2 pages
Problem 5 - Assignment 1
Anand Bharadwaj
No ratings yet
Caringal Activity 9 Application of System of Linear Equation
Document13 pages
Caringal Activity 9 Application of System of Linear Equation
Dummy Acc
No ratings yet
Data Interpretation Guide For All Competitive and Admission Exams
From Everand
Data Interpretation Guide For All Competitive and Admission Exams
Mohmmad Khaja Shareef
Rating: 2.5 out of 5 stars
2.5/5 (6)
Er9000en 21204 1.00
Document106 pages
Er9000en 21204 1.00
Alexandru Anghel
No ratings yet
Inv - 2469306
Document2 pages
Inv - 2469306
rajesh
No ratings yet
Bootstrap 3 All Classes List Cheat Sheet Reference PDF (2020) PDF
Document21 pages
Bootstrap 3 All Classes List Cheat Sheet Reference PDF (2020) PDF
Honey Shine
No ratings yet
Synopsis Format-Practice School
Document4 pages
Synopsis Format-Practice School
Arjun Goyal
No ratings yet
Introduction To PFA
Document7 pages
Introduction To PFA
CHRISTINE KYLE CIPRIANO
No ratings yet
2D1N Night Nueva Vizcaya
Document3 pages
2D1N Night Nueva Vizcaya
Kaye Roldan
No ratings yet
Heat Transfer Equipment
Document28 pages
Heat Transfer Equipment
deepak.dce.me
No ratings yet
Chap1 Organizational Behavior 2020
Document52 pages
Chap1 Organizational Behavior 2020
Darshan
No ratings yet
Assignment 1: Instructions
Document6 pages
Assignment 1: Instructions
Asim Mughal
No ratings yet
Urban Bias in Community Development: Student: Tiongson Yvonne P. Instructor: Ar. Irene G. Florendo
Document9 pages
Urban Bias in Community Development: Student: Tiongson Yvonne P. Instructor: Ar. Irene G. Florendo
Yvonne Tiongson
No ratings yet
List of ROs Under VO
Document74 pages
List of ROs Under VO
vivek mishra
No ratings yet
Add Math Project 2012 Sabah
Document32 pages
Add Math Project 2012 Sabah
Irsyad
No ratings yet
Vilta-S: Stabilizer For Smartphone
Document28 pages
Vilta-S: Stabilizer For Smartphone
Nivin Kumar
No ratings yet
Egs630-6 Komatsu Genset
Document2 pages
Egs630-6 Komatsu Genset
imamfadili
No ratings yet
Thesis Correction Ranjit
Document3 pages
Thesis Correction Ranjit
ranjit makaju
No ratings yet
PokeDex Checklist
Document7 pages
PokeDex Checklist
Josh Strıke
No ratings yet
M Pump - Plunger 300
Document30 pages
M Pump - Plunger 300
hebert perez
No ratings yet
Basics On Piping Layout
Document11 pages
Basics On Piping Layout
puru55980
No ratings yet
Centroid + MOI (Students)
Document39 pages
Centroid + MOI (Students)
Usman Hafeez
No ratings yet
SENIOR HIGH SCHOOL-English For Academic and Professional Purposes
Document7 pages
SENIOR HIGH SCHOOL-English For Academic and Professional Purposes
joshua herrera
No ratings yet
Session 7 - Beyond Tests - Alternatives in Assessment
Document53 pages
Session 7 - Beyond Tests - Alternatives in Assessment
trandinhgiabao
No ratings yet
IHS Markit - The Global Ultrasound Market
Document2 pages
IHS Markit - The Global Ultrasound Market
wwtqfgtp
No ratings yet
Sample Waste Management Tracking Form
Document3 pages
Sample Waste Management Tracking Form
Sreekumar
No ratings yet
WESCAM MX-15-0503AA-Spec
Document2 pages
WESCAM MX-15-0503AA-Spec
AIT FARID
No ratings yet
STS Module 9
Document14 pages
STS Module 9
Claire Jacynth Floro
No ratings yet
The Singapore Success Story
Document14 pages
The Singapore Success Story
Maria Schipor
No ratings yet
Program - 1:: Lab - Data Structure Using C
Document50 pages
Program - 1:: Lab - Data Structure Using C
eshmnash9298
No ratings yet
Catalogue - FM-200 PFS - Masteco PDF
Document8 pages
Catalogue - FM-200 PFS - Masteco PDF
Nguyễn Minh Thiệu
No ratings yet
5 6316334533637570613
Document5 pages
5 6316334533637570613
Nishant Pathak
No ratings yet
Impact On Organizations
Document14 pages
Impact On Organizations
ogakhan
No ratings yet