You are on page 1of 43

Answer

TopicQuestions Option (A) Option (B) Option (C) Option (D) s

Contains
numerous naming Organized around
A data warehouse is which of Can be updated by end conventions and important subject Contains only
DM the following? users formats. areas. current data. C

A star schema has what type


of relationship between a
DM dimension and fact table? Many-to-many One-to-one One-to-many All of the above B
Fact tables are which of the Partially Completely Partially
DM following? Completely denoralized denoralized normalized normalized C
A snowflake schema is which
of the following types of
DM tables? Fact Dimension Helper All of the above D
A goal of data mining To explain some To analyze data for
includes which of the observed event or To confirm that expected To create a new
DM following? condition data exists relationships data warehouse A
OLAP databases are called
DM decision support system ? 1 0 A
In Star Schema Dimension
DM tables are Short and Fat Long and Thin Long and Fat Short and thin A

The data in Data Warehouse Clean and Dirty


DM is generally Clean Data Dirty Data Data None of above A
Inmon believes
Ralph Kimball believes that portions of
that portions of data data can be Ralph Kimball
can be combined based combined based Inmon believes believes that DW
on relevance of data on relevance of that DW is built is built and
and can be used for data and can be and should be used should be used
DM Choose two reporting used for reporting for reporting. for reporting. B and D

In which type of SCD(Slowly


changing dimensions) do we
DM preserve history of data: Type One Type Two Type Three None of above B

Does not matter if


we use Sorted or
During ETL load we Unsorted data for Sorted data for Unsorted data for None of the
ETL generally have Aggregator Aggregator Aggregation Above B

First load data Does not matter


First load data into fact into dimension First Aggregates if we load either
tables then dimension tables, then fact then load data into of fact,
Sequence of jobs to load data tables, then Aggregates tables, then dimension tables, dimensions, or
DM in to warehouse if any Aggregates if any then fact tables aggregates B
Denormalizing the
DM Snowflaking means Normalizing the data data None of Above A
Drill Across generally use the
OLA following join to generate None of the
P report Self Join Inner Join Outer Join Above C
In general data in Data
DM Warehousing is Normalized Denormalized None of Above C
Second level data
DM Consolidated data mart is First level data mart mart All of these None of Above B
In 4 step dimensional
process, declaring grain of
DM business process is First Step Second Step Third Step Fourth Step B

They are either When they can be


Dimensions are Confirmed same or one is compared
DM when They are different subset of another mathematically None of these B

You need to create an index


on the SALES table, which is
10 GB in size. You want your
index to be spread across
many tablespaces,
decreasing contention for
index lookup, and increasing
scalability and
manageability.Which type of
index would be best for this
DM table? bitmap unique partitioned reverse Key C

A data
A data warehouse A data warehouse warehouse is
is valuable only if is valuable to those necessary to all
A data warehouse is the organisation organisations that those
useful to all has an interest in need to keep an organisations
Which of the following organisations that analysing audit trail of their that are using
DM statements is true? currently use OLTP's historical data. activities relational OLTP's B
the act of using a
relational database
the act of using to produce reports the act of
software to analyse the act of giving data sumarising data
highly consolidated exporting data summaries on a on a regular
data, often to view the into a spreadsheet regular basis (e.g. basis (e.g. month
DM Analytical processing is changes over time. for analysis monthly) end summaries) A

The fact table of a


The fact table of a data data warehouse is
warehouse is the main the main store of A fact table A fact table
store of descriptions of all of the recorded describes the describes the
Which of the following the transactions stored transactions over granularity of data transactions
DM statements is true? in a DWH time. held in a DWH stored in a DWH B
Which of the following is A hierachical
associated with a data and/or network
DM warehouse A relation A flat file structure A star schema D

Adding data for


the sake of it
A data warehouse may well
automatically degrade the
makes a copy of A data warehouse effectiveness of
The more data a data every transaction is a relatively data
Which of the following warehouse has, the recorded in an straighttforward warehouseing
DM statements is true? better it is OLTP system thing to set up. analysis D
takes regular
copies of
transaction data
must import data from and stores it in a has to work on
transactional systems way that is live transactional
whenever significant takes regular optimised for data to provide
changes occur in the copies of query and up to date and
DM A data warehouse transactional data. transaction data reporting vaild results C

The level of detail


of the data The level of detail
The number of fact descriptions held of the data stored The number of
tables in a data in a data in a data dimensions in a
DM Granularity refers to warehouse warehouse warehouse. data warehouse C

The data that The number of


The level of detail of describes the The level of detail dimension tables
data that is held in the transactions in the that is held in the that exist in a
DM Dimensionality refers to fact table fact table. Data Warehouse star schema B

The main organisational


justification for Cheaper ways of
implementing a data lagre scale transaction handling Storing large
DM warehouse is to provide processing transactions volumes of data Decision support D
On Line
OLA On Line Transaction On Line Terminal On Line Terminal Transaction
P OLTP stands for Processing Protocol Processing Protocol A

must be in must be in
normalised form can be normalised normalised form
DM Data in a data warehouse in a flat file format to at least 3NF but often isn't to at least 2NF C
Capable of
integrating data
A data warehouse needs to from a wide
DM be time varient Subject orientated variety of sources non-volatile A,B,C,D

the act of
processing,
the act of recording and
analysing each the act of analysing storing
transaction to transactions on a individual
the act of processing verify that it is regular basis (e.g. transactions in a
DM Transaction processing is individual transactions valid monthly) database D
On Line On Line On Line
OLA On Line Analytical Abstraction Abstraction Analytical
P OLAP stands for Protocol Processing Protocol Processing D

What is a formal way to


express data relationships to Entity-
a database management relationship
DM system? Attributes Entity identifier Data model diagram C

What is a technique for


documenting the
relationships between Entity-
entities in a database relationship
DM environment? Attributes Entity identifier Data model diagram D

What indicates having the


potential to contain more
than one value for an None of the
DM attribute at any given time? Constraint Single-valued All of the above above D
Which relationship is
between two entities in
which an instance of entity A
can be related to zero, one, or
more instances of entity B
and entity B can be related to
zero, one, or more instances One-to-many One-to-one Many-to-many Many-to-one
DM of entity A? relationship relationship relationship relationship C

Which of the following uses a


series of logically related
two-dimensional tables or Database
files to store information in management None of the
DM the form of a database? Database system Data warehouse above D

Updating existing
information to
reflect to the
gathered and
OLA All of the following terms The gathering of input Processing input processed None of the
P describe OLAP, except information information information above D
Which tool is used to help an Database
organization build and use management
DM business intelligence? Data warehouse Data mining tools systems All of the above D
What does the data
DM dictionary identify? Field names Field types Field formats All of the above D

Which of the following is a Query by example Structure question


DM data manipulation tool? File generators tool language All of the above B
When gathering business
information requirements,
you should focus only on the
requirements provided by
DM the business groups. 1 0 B

One difference between the


design of online transaction
processing (OLTP) and
online analytical processing
(OLAP) systems is that the
OLTP system design is
OLA optimized for getting data
P into the database. 1 0 A

Designing a data warehouse


in first normal form (1NF) is
DM not recommended. 1 0 B

Cardinality is defined as the


number of relationships
DM existing between entities. 1 0 A

It is not important to include


metadata when designing a
DM data warehouse. 1 0 B
There is no need to include a
time dimension in the data
DM warehouse. 1 0 B
The level of granularity you
choose for the time
dimension has no significant
impact on the size of your
DM database. 1 0 B

Surrogate keys are generated


on tables in the data
warehouse after the table is
DM populated. 1 0 B

To improve performance, all


tables in the data warehouse
DM should be indexed. 1 0 B

Fact tables are often referred


to as the measures of
DM business performance. 1 0 A

Dimension tables are used to


provide descriptions of the
business subjects and
descriptive information
about each row in the fact
DM table. 1 0 A

A high level of granularity


means more detail; a low
level of granularity mean less
DM detail. 1 0 A
One method of managing the
history in dimension tables is
to drop the dimension and
rebuild the table from
DM scratch. 1 0 B

You do not need to be


concerned with maintaining
the history of changing data
DM in the dimension tables. 1 0 B

Effective use of summaries is


the best technique for
improving performance in
DM data warehouses. 1 0 A
Summary data cannot be
combined with detailed fact
DM data. 1 0 B

When choosing a level of


summarization, there are
two approaches:
summarizing the entire
dimension, or summarizing
part of the dimension and
partially improving
DM performance 1 0 A
Table partitioning splits the
storage of a table into
DM smaller individual units. 1 0 A
Denormalization is the factor
that increases the sparseness
DM in a database. 1 0 A

What are the actual data


values that occupy the cells
OLA as defined by the dimensions
P selected? Nesting Aggregation Dimensions Measures D
The term that defines
OLA filtering data in an OLAP
P cube is ___________ . dicing slicing rotating nesting B

What is an item that matches


OLA a specific description or
P classification? Category Measure Dimension Nest A
The cube structure in OLAP
OLA achieves the __________ multidimensiona
P functionality. shared information collection l D
OLA Aggregation provides OLAP pre-calculated slow data
P with __________ multidimensional data data nested data retrieval B
OLA What is the acronym that
P defines OLAP? FHTMI FASMI ASFMI MASHF B
__________ in OLAP allows you
OLA to define a subcube of the
P original space. Dicing Slicing Rotating Nesting A

What term in OLAP defines


changing the dimensional
OLA orientation of the report
P from the cube data? Dicing Slicing Rotating Nesting C
The _________ in OLAP enable
you to drill-up or drill-down
OLA to view different levels of
P detail about your data. dimensions measures nesting aggregation A
OLA When you nest in OLAP, you select multiple cube select multiple select multiple select multiple
P _________ . aggregations cube measures cube dimensions cube slices C

A process that
A process that A process that extracts
transforms information loads information information from
Which of the following using a common set of into a data internal and
ETL describes ETL? enterprise definitions warehouse external databases All of the above D

Uses a variety of
techniques to
find patterns and
relationships in
large volumes of
information and
The process of infer rules from
The common term analyzing data to them that
for the extract predict future
representation of information not behavior and
A particular attribute of multidimensional offered by the raw guide decision
DM What is data mining information information data alone making C
A collection of related data
DM fields is called a ____. byte record character bit B
interface between the
database and unique group of
DM A DBMS is a(n) ____. application programs data repository knowledge base records A

A(n) ____ is a generalized


class of people, places, or
things for which data is
collected, stored, and
DM maintained record entity attribute file B

Which attribute would make


DM the best primary key? Social security number Last name First name Age A

The ____ data model follows a


DM treelike structure. distributed hierarchical network relational B
The most popular database
model currently in use is the
DM ____. relational model hierarchical model network model object model A
A primary key is a field or set
of fields that uniquely
DM identifies a record. 1 0 A

One of the goals of a DBMS is


to increase data redundancy
thereby making it less
DM vulnerable to hackers. 1 0 B
A Data Warehouse would
DM most likely be part of a(n) ERP System Small MIS System DBMS Expert system A
to help
transform data
to help transform from different
data into useful sources so that
to streamline a information that they can be
Data Mining would most Transaction Processing to model data in a can be used by a stored in a single
DM likely be used System DBMS DSS Data Warehouse. C
Initials + Family
Which of the following is a House number + Car Registration Name + Date of
DM valid key field A Book Title Street Name number Birth C
Consists of
Can only store data of Alphanumeric Consists of Rows
DM A Table one type data and Columns Cannot be empty B
A Character
A RDBMS cannot store data A Logical data type can (Text) data type
without knowing the data store three values, Numerical data Some DBMS's can can contain
type. Which of the following TRUE, UNKNOWN and can be stored in use DATE data 0,1,2,3,4,5,6,7,8
DM statements are true? FALSE different formats types and 9 B,C,D

A DBMS that
contains records
A DBMS that can A DBMS that can that have a large
A FLAT FILE database A database design that only have simple only have one number of fields
DM management system is only has one table in it data tables in it table in it in them C

Assume you are extending


the design of The College
Student Records System to
include details on each
classroom. The college is
never likely to have more
than ten classrooms and
definitely not ever going to
have more than 25
classrooms. What data type Numeric - Long
DM would you select Numeric - Byte Numeric - Single Numeric - Integer integer A

be based on an
underlying data
OLA be exported to a word source (a table or Be redefined each be password
P A report must processor for printing a query) time it is used protected B

The layout of a report is


independant of the number
OLA of records held in a table or
P query True 0 A
produce output produce output
that is ready for that is formatted produce output
OLA produce output that is publication on the for display on a formatted for
P A report is used to ready for e-mailing Web (HTML) computer screen print D
The rule that prohibits second normal None of the
DM transitive dependencies is third normal form first normal form form Above A

The rule that requires that


each non-key field (attribute)
should be fully functionally
dependent on the primary First Normal Second Normal None of the
DM key is Third Normal Form Form Form Above C

The rule that specifies that


there should be no repeating
fields and that fields should second normal None of the
DM be atomic is third normal form form first normal form Above C
The process of combining
two tables in a relational
DM database is known as a Join a Combine a Relate a Construc A

enable detailed
be close to a users descriptions of
enable low level replace relational perception of the data query
DM The ER model is meant to descriptions of data design data processing C
Entities,
The Entity Relation Model Relationships and Entities and
DM models Entities Relationships Processes Relationships D
An ER model is An ER model is
concerned concerned
primarily with a primarily with a
An ER model provides a physical An ER model is logical view of
Which of the following view of the logic of the implementation of entirely concerned the data and
statements best decribes the data and not the the data and with modelling the secondly with
function of an entity relation physical secondly with the physical the physical
DM model? implementation. logical view implemetation implementation A

Sequential Question Structured Query Structured Sequential Query


DM SQL stands for Language Language Question Language Language B
Data
Which of the following are Data Definition Data Modification Manipulation
DM elements of SQL? Data Query Language Language Language Language A,B,D

Consider the table


(STUDREC). Which of the
following statements will list SELECT init
columns INIT, SNAME, SELECT init, sname, SELECT init, SELECT init, sname, gender,
GENDER and KIDS (in that gender, kids FROM sname, gender, sname, gender, kids FROM
order) for all students who studrec WHERE kids kids FROM studrec kids FROM studrec studrec WHERE
DM have more then 1 child. <1; WHERE kids >1; WHERE kids >'1'; kids >1; D
OLA A typical data warehouse Analytical
P consists of … Staging area Data Marts environment All of the above A

Data Modelling Data Extraction


What are the three layers of Data staging layer, Data layer, Data layer, Data
OLA Data warehouse Extract layer, Data Accesses layer, Accesses layer,
P architecture? transactional layer Data Storage layer Data Storage layer None C
OLA Staging Area comes under
P which layer? Data Storage layer Data Access layer Data Extract layer None D

SQL does not have


a natural way of
providing flexible
view
reorganizations
OLA What are Limitations of Extensive Redundant that will transpose
P Traditional techniques ? programming reporting the data All of the above D
OLA Different categories of Data
P Access are? Web Access Data Mining Both A and B None C
OLA Online Access Online Analytic
P OLAP stands for Processing Processing Both A and B None B

A process that uses a variety


of statistical and artificial
intelligence frameworks to
OLA discover patterns and Data Mining Web Access
P relationships in data Data Access Process Process Process None B

A category of data access


solutions in which
OLA information is viewed Data Mining Web Access
P through a web browser Data Access Process Process Process Reporting C

Data Access is the


‘last mile’ that
OLA What is importance of Data Businesses today face enables decision Prompt, reliable
P Access? challenges like makers to data access All of the above D
Enterprise Data
OLA What are different types of Transaction Systems Warehouse
P reporting? Reporting Reporting Both A and B None C

In Transaction Systems
OLA Reporting, Reporting Tool
P has a native connectivity to ? Views Tables OLAP OLTP D

To provide
consolidated and
An enterprise data cleansed data to
OLA warehouse (EDW) is To combine data from an array of data
P designed to multiple OLTP systems marts Both A and B None C
OLA Examples of Managed Query
P Tool Business Objects MS Query Microsoft Access All of the above A

Time Intelligence -
OLA Which are the OLAP Multidimensional Time Series
P features ? viewing Capabilities analysis Only A Both A and B D

Relatively
standardized and
simple queries
returning
OLA relatively few
P OLAP system is Decision support records Both A and B None A
OLA represents factual description of
P What is measure? Is not a number data subject Both B and C B

Support for large Good to access Compilation


OLA Data is stored in databases with pre-aggregated intensive
P What is ROLAP? multidimensional cubes good performance data architecture B
SQL Which one is DDL command? Insert Update Drop Select C
How many types of
Normalization rules are
SQL there? 4 5 6 7 B

SQL Which are pseudocolumns CURRVAL NEXTVAL ROWID All of the above D

Can you use select in FROM


SQL clause of SQL select ? YES NO A

It allows you to
It allows you to It allows you to associate a
associates a variable associate a variable with an
Describe the use of with a single column variable with an entire table
SQL %ROWTYPE in PL/SQL ? type entire table row column Both A and C B
How many types of triggers
SQL are there? 9 10 11 12 D

What is the default ordering


of an ORDER BY clause in a
SQL SELECT statement? Descending Ascending B

All rows selected All rows selected


by either query All distinct rows by the first query
All rows selected by and including selected by both but not the
SQL Union All returns either query duplicates queries seconds B
ETL is the set of ETL is the set of
processes by which processes by
data is extracted from which data is
various sources, extracted from
transformed and various sources
loaded into target and loaded into
ETL What is ETL process? systems target systems Both A and B None A

High speed
Closely integrated with loading of target
ETL What is Importance of ETL? RDBMS’s data warehouses Both A and B Only A C

Data Extraction,
Data
Data Extraction, transformation,
Data Extraction, Data Data Extraction – Data Extraction –
transformation, Data Cleanup, Data Data Extraction, Cleanup, Data
ETL Which are ETL Activities ? loading loading Data loading loading D
Real Time
ETL Data Extraction Methods are Incremental Extraction Extraction Full Extraction All of the above D
Which are the examples of Informatica
ETL ETL tools? PowerCenter Ab Initio Business Objects Both A and B D

It limits your
ability to recover
Format of Archived because no
data different from database logging Lengthy and
ETL What is Bulk Load? operational data occurs Complex process All of the above B
Which one is not GUI based
ETL Scheduler ? Tool Specific Autosys CRON jobs All of the above C
performs final
formatting of data
to produce load-
ready files for the
target table;
identifies and final stage, uses
segregates rows to the load- ready
performs the be inserted vs. files from Stage 4
access and updated (if to build
extraction of data applicable); aggregation
perform a variety of from the source applies remaining tables needed to
transformations unique system and builds technical meta improve query
to the source, a temporal view of data tagging; and performance
What do you mean by Source depending on business the data at the processes data into against the
ETL alteration stage in ETL ? requirements time of extraction the RDBMS warehouse A

What are the different types Source-based


ETL of Commit intervals? Target-based commit commit Only A Both A and B D
Which is the first step of the Data Extraction – Data
ETL ETL process ? Cleanup Data Extraction transformation Data loading B

Not event driven--


does not facilitate
notification or Almost all
Quick and relatively change in another applications
easy to write scripts for Does not usually application at the provide utilities
Which is not pros of Batch doing exports and require additional time of a change in for exporting
ETL Extraction ? imports hardware first application and importing C
Which tool does not support
Change-Data-Capture Informatica
ETL Feature ? Ascential Data Stage XE PowerCenter Ab Initio All of the above B

A data warehouse
is a subject-
oriented,
integrated,
Data Warehouse is nonvolatile, time-
integarted of data in variant collection
support of of data in support
management's of management's
DW What is Data Warehouse ? decisions decisions Both A and B None B
Better business
What is the Need of Data To store Operational intelligence for Used by
DW Warehousing ? Data end-users Operational users Both B and C B

Which one is not Restrictive, non Flexible,


DW Characteristic of Data Mart ? extensible Short life/tactical Project Orientation extensible D
Which is the information
DW need for recent data ? ODS OLTP OLAP All of the above A

What type of Data Structure Detailed and


Characteristic does Data Detailed and lightly
DW Warehousing has ? Detailed Summarized Summarized summarized C
What are Components of a
Data Warehouse Data Modelling
DW Architecture ? Data Cleansing tool ETL tool tool All of the above D
Generate and
maintain Automatic
What is use of Data Cleaning Clean up source data in- centralized generation of data
DW Tools ? place on the host metadata extract programs All of the above A
What is the use of Data
DW Mining Tools ? Slice and Dice What If analysis Dill Down Static Reports B

It is designed,
built, and
A known fact that can The data is populated with
be recorded and that perceived by the data for a specific
DM What is Database ? have implicit meaning user as tables purpose All of the above C

Representation of
a set of business
requirements in a
A collection of concepts standard
that can be used to structured
describe the structure framework
of a database understood by the
DM What is Data Model ? users Both A and B None A

Which Data Modelling


approach suit for corporate Entity Relational
DM data Warehouse ? Dimensional Approach Approach Both A and B None B

What are the different types


DM of relationship notations ? IEX IDFIX IE Both B and C C
Geared for
performance and
What is Physical Data may consists of
DM Model ? Conceptual redundant data Both A and B None B
What are different types of Physical model, Logical
DM Data Model ? model, Hybrid model Conceptual model Both A and C D
Can we have multiple foreign
DM keys in a table ? 1 0 A

You might also like