Professional Documents
Culture Documents
COLLEGE OF ENGINEERING
PROFESSIONAL ELECTIVE - V
CS8080 – INFORMATION RETRIEVAL
TECHNIQUES
3) What are the three classic models in information retrieval system? [BTL 1] [CO1]
1. Boolean model
2. Vector Space model
3. Probabilistic model
wij {0,1}
Query q = ka b c)
sim(qi,dj) =1 , i.e. doc’s are relevant
13) What are the assumptions of vector space model? [BTL 2] [CO1]
Assumption of vector space model:
The degree of matching can be used to rank-order documents;
This rank-ordering corresponds to how well a document satisfying a usersinformation needs.
ii) Write the advantages and disadvantages for classic models which are used in IR.
(7) [BTL 5] [CO1]
2) Sort and rank the documents in descending order according to the similarity values:
Suppose we query an IR system for the query "gold silver truck"
The database collection consists of three documents (D = 3) with the following content,
D1: "Shipment of gold damaged in a fire“
D2: "Delivery of silver arrived in a silver truck“
D3: "Shipment of gold arrived in a truck" (14) [BTL 6] [CO1]
Answer:
Finally we sort and rank the documents in descending order according to the
similarityvalues
Rank R 1: Doc 2 = 0.8246
Rank a 2: Doc 3 = 0.3271
n
k 3: Doc 1 = 0.0801
3) Estimate sparse vectors and its efficiency with diagram. (7) [BTL 02] [CO1]