You are on page 1of 3

1. Define data warehouse? Discuss design principles.

A Data Warehousing (DW) is process for collecting and managing data from varied sources to
provide meaningful business insights. A Data warehouse is typically used to connect and analyze
business data from heterogeneous sources. The data warehouse is the core of the BI system
which is built for data analysis and reporting. It is a blend of technologies and components
which aids the strategic use of data. It is electronic storage of a large amount of information by a
business which is designed for query and analysis instead of transaction processing. It is a
process of transforming data into information and making it available to users in a timely
manner to make a difference.

The 5 design principles are:-


i) Balance:- in this case relates to our physical balance, or what we
perceive to be balanced. Balance may be symmetrical or
asymmetrical. In both cases, it is meant to provide visual stability.
In a two-dimensional design, balance can be understood based
only on our perception

ii) Scale and Proportion


Scale refers to the size of an object in comparison to other objects
in a design. Designs have to be composed to scale, depending on
the size of the sheet it is being prepared on. Each object has to be
designed to scale, and all objects are to be designed to the same
scale
iii) Repetition and Rhythm:- Repetition is used to create patterns
within a design, where a particular element is repeatedly utilized.
This repetition may be clearly visible, or may be underlying, and
could be noticed after a clear understanding of the image.
Repetition is used to create rhythm. Rhythm is used to create
order in a design composition. 
iv) Emphasis:- emphasis is used to grab the attention of the viewer.
An element of design can be emphasized by the use of color,
gradation, contrast, texture, shape, or placement in comparison to
remainder of the design
v) Unity:- Any design should elicit harmony and should look unified.
The various principles when combined to create a design should
work in harmony with each other. A splatter of unrelated objects
or images cannot create unity. Instead, unity is achieved by the
use of all or any of the aforementioned principles of design

2) STAR SCHEMA:-

 A star schema is a multi-dimensional data model used to organize data in a


database so that it is easy to understand and analyze. Star schemas can be
applied to data warehouses, databases, data marts, and other tools. The star
schema design is optimized for querying large data sets.

Benefits of Star Schema:-

i) Great for simple queries because of their reduced dependency on


joins when accessing the data, as compared to normalized models like
snowflake schemas.
ii) Adapt well to fit OLAP models.
iii) Improved query performance as compared to normalized data,
because star schemas attempt to avoid computationally expensive
joins.

3) Snow Flake Schema:-

  The snowflake schema is a variant of the star schema. Here, the centralized
fact table is connected to multiple dimensions. In the snowflake schema,
dimensions are present in a normalized form in multiple related tables. The
snowflake structure materialized when the dimensions of a star schema are
detailed and highly structured, having several levels of relationship, and the
child tables have multiple parent tables.
Advantages: There are two main advantages of snowflake schema given
below: 
 It provides structured data which reduces the problem of data integrity.
 It uses small disk space because data are highly structured.
Disadvantages: 
 Snowflaking reduces space consumed by dimension tables but compared
with the entire data warehouse the saving is usually insignificant.
 Avoid snowflaking or normalization of a dimension table, unless required and
appropriate.

4) Fact constellation schema


You might also like