You are on page 1of 4

BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani

Pilani Campus
Instruction Division

SECOND SEMESTER 2016-2017

Course Handout Part II


Date: 4-01-2017

In addition to part-I (General Handout for all courses appended to the time table) this portion gives
further specific details regarding the course.

Course No. : CS F469


Course Title : Information Retrieval
Instructor-in-Charge : Dr. Lavika Goel (lavika.goel@pilani.bits-pilani.ac.in)

Scope and Objective of the Course:

Textbooks:
1. T1. C. D. Manning, P. Raghavan and H. Schutze. Introduction to Information Retrieval, Cambridge
University Press, 2008.

Reference books:

R1 Modern Information Retrieval, Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Addison-Wesley, 2000.
http://people.ischool.berkeley.edu/~hearst/irbook/
R2 Search Engines: Information Retrieval in Practice by Bruce Croft, Donald Metzler, and Trevor Strohman,
Addison-Wesley, 2009.
R3 Cross-Language Information Retrieval by By Jian-Yun Nie Morgan & Claypool Publisher series 2010.

R4 Multimedia Information Retrieval by Stefan M. Rüger Morgan & Claypool Publisher series 2010.

R5 Information Retrieval: Implementing and Evaluating Search Engines by S. Buttcher, C. Clarke and G.
Cormack, MIT Press, 2010.
R6 Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data by B. Liu, Springer, Second Edition,
2011.
R7 Ricci, F.; Rokach, L.; Shapira, B.; Kantor, P.B. (Eds.), Recommender Systems Handbook. 1st Edition.,
2011, 845 p. 20 illus., Hardcover, ISBN: 978-0-387-85819-7
R8 Koehn P., “Statistical Machine Translation”, Cambridge University Press, 2010.

Please Do Not Print Unless Necessary


BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani
Pilani Campus
Instruction Division

Course Plan:

Lecture Learning Objectives Topic title Reference


no.

To understand the reason to study this


1 Introduction and Motivation R1 Ch1, Ch2
course

The term vocabulary postings lists and T1 Ch. 1 & 2,


2-3 Boolean retrieval
Introduction to ad-hoc search R1 Ch2 section 5

Wildcard queries,
Understand the importance of Spelling correction, Edit
4-5 T1 Ch. 3
Dictionaries and tolerant retrieval distances and
Phonetic correction

Blocked sort-based indexing


Single-pass in-memory
To be able to identify and implement indexing
6-8 suitable Index and Compression Distributed indexing T1 Ch. 4
techniques Dynamic indexing
Parametric and zone indexes
Weighted zone scoring

Parametric and zone indexes


Weighted zone scoring
Learning weights
9-11 T1 Ch. 6
Term frequency and
weighting
Understand the importance of Tf-idf weighting
Scoring, term weighting
Dot products,
Queries as vectors,
12-13 Variant tf-idf functions, T1 Ch. 6
Document and query
weighting schemes

Text Mining: Classification,


Naïve Bayes, Vector space
To understand the concepts of text T1 Ch13, 14, 16,
14-17 classification, Evaluating
mining 17
Classification, Clustering, Flat
clustering, Hierarchical

Please Do Not Print Unless Necessary


BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani
Pilani Campus
Instruction Division

clustering

Open Challenges in text retrieval


European Languages
To be able to identify the issues while
18 East Asian Languages R3 Ch. 1
working with languages other than
Other Languages
English

Translation Approaches for


Ability to implement a CLIR using the CLIR
IBM model Handling many Languages R3 Ch.2
19-23
To gain insight into the research issues Using manually constructed R8: Ch. 4, 5, 6
in CLIR Translation systems and
resources for CLIR

Introduction to
Differentiate between different
recommendation system
recommender systems and suggest a
Collaborative , Content based R7 Ch. 1,2,3,4,5
24-30 suitable system based on the problem
recommendation, Hybrid
and data available.
recommendation systems

Basic Multimedia search


To be able to identify the issues while
technologies
working with multimedia like Image,
Content based retrieval
31-37 Audio and video R4 Ch. 1,2,3
Image and Audio data
To learn about the research issues in
challenges
MIR
Multimedia IR Research

Web Search Basics, Web


Crawlers and Indexes
38-40 Web Searching Link Analysis: The web as a T1 Ch. 19, 20, 21
graph, Google’s page rank.

Please Do Not Print Unless Necessary


BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani
Pilani Campus
Instruction Division

Evaluation Scheme:

Nature of
Component Duration Date & Time Weightage (%)
component

Assignments Open Book To be announced 30%

Midsem exam Closed Book 90min 11/3 9:00 - 30%


10:30 AM

Comprehensive Partially Open book / 3 hours 15/5 FN 40%


Closed Book

Chamber Consultation Hour: Monday 4-5 PM.

Notices: All notices related to the course will be displayed on the CSIS Notice Board, and / or course
website on Nalanda.

Make-up Policy:

Make ups for tests shall be granted by the I/C on prior permission and only to genuine cases with the
permission of the warden concerned.

Make-up for comprehensive examination will be decided and scheduled by the Instruction Division.

INSTRUCTOR-IN-CHARGE

Please Do Not Print Unless Necessary

You might also like