Professional Documents
Culture Documents
NAME: IDNO:
BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI
I SEMESTER 2005-2006
SS G515 DATA WAREHOUSING
Comprehensive Examination
th
Date: 08 December 2005
Time: 3 Hours (2.00 – 5.00 pm)
Weightage: 35% [Part A (closed book) – 16 & Part B (open book) – 19]
Part A – Closed Book
Points to note:
Answer multiple choice questions in the Question paper itself
Some questions may have more than one correct option. You will get credit only if you
mark all the correct options
There is NO NEGATIVE MARKING
ENCIRCLE the correct option(s)
Short answer questions are to be answered in the supplementary answer sheet provided
Multiple-Choice Questions (20*0.5=10)
1. The characteristic that indicates that a data warehouse is organized around key
high-level entities of the enterprise is:
(a) Time-variant
(b) Non-volatile
(c) Subject-oriented
(d) Integrated
2. An ODS contains data that is:
(a) Detailed
(b) Current-valued
(c) Integrated
(d) Subject-oriented
3. Class IV ODS is different from class I, II, III ODS because:
(a) It is supported by the data warehouse
(b) Its granularity is different
(c) It contains enriched profile data
(d) Its refresh cycle is adhoc
4. The level of data transformations is highest in:
(a) Class I ODS
(b) Class II ODS
(c) Class III ODS
(d) Class IV ODS
5. The dimension that is not available in the operational systems:
(a) Product
(b) Store
(c) Customer
(d) Time
6. Which of the following operation differentiates HOLAP architecture from ROLAP
& MOLAP architectures:
(a) Drill-across
(b) Drill-through
(c) Drill-down
(d) Roll-up
Page 1 of 5
Comprehensive Examination SS G515 – Data Warehousing
Page 2 of 5
Comprehensive Examination SS G515 – Data Warehousing
17. Finding out about products that were on promotion but did not sell requires:
(a) Roll-up
(b) Slicing & dicing
(c) Drill-through
(d) Drill-across
18. Dimensional modeling is more restrictive that ER modeling because:
(a) Data is always classified as fact or dimension
(b) Dimension tables must have single field primary keys
(c) Dimensional tables can not be normalized
(d) Two dimension tables cannot be linked through foreign keys
19. In a data warehouse, bitmap indexes are created on:
(a) Fact tables
(b) Dimension tables
(c) Helper tables
(d) Minidimension tables
20. Snowflaking:
(a) Removes low cardinality columns
(b) Prohibits use of bitmap indexes
(c) Makes browsing difficult
(d) Saves space
Page 3 of 5
Comprehensive Examination SS G515 – Data Warehousing
Page 4 of 5
Comprehensive Examination SS G515 – Data Warehousing
Problem 2
ONLINE PALCEMENT COMPANY DATA WAREHOUSE
Itplacement.com is an online placement company. The portal allows companies
looking for IT professionals to publish/post their requirements on the portal. The
portal also allows the applicants (job seekers) to post their resumes for possible
placements.
Design a data warehouse for the placement company. The data warehouse
should help the job seekers in getting better placements and also the companies
that are hiring. The DW should also help Itplacement.com in doing more
business.
Identify the requirements and design star schema(s). Give details of the
dimensions created by you.
Identify the advanced dimensional modeling features used by you in the design.
For all the three categories of users, write a typical analytical query that they
would want to ask. Also show how your data warehouse would be able to answer
them efficiently.
[4+2]
Problem 3
SPARSITY FAILURE
Consider a factless fact table containing the finest granularity attendance data of
students at BITS. The table contains data from academic year 2000-2001
onwards. Queries requiring aggregated attendance data are quite common.
Suggest a suitable aggregation strategy. Will the phenomenon of sparsity failure
occur when you pre-compute the aggregates? Justify your answer.
[3]
Page 5 of 5