You are on page 1of 5

Why is data integration required in a data warehouse, more so than in an

operational Application(5)

Data integration becomes increasingly important in cases of merging systems of two companies or
consolidating applications within one company to provide a unified view of the company's data assets.
The benefit of a data warehouse enables a business to perform analyses based on the data in the data
warehouse.
Data across separate sources needs to be aligned and harmonized, and standardized. The need for data
cleansing and data quality control is significant.
This is why integration work and scope in data warehousing is higher than in operational systems.

A dimension table is wide, the fact table is deep. Explain

Fact tables are deep whereas dimension tables are wide as fact tables will have a higher number of
rows and a lesser number of columns.
Dimension table has got all the detail in formations of their respective table, for egg, customer
dimension table will contain all the related info about customers whereas fact table contains the main
data, which contains the surrogate keys of every dimension; along with other measures.
What is Metadata? Why do we need metadata when search engines like
Google seem so effective?
Metadata :-
• Metadata is the information used to describe another piece of data. For example, metadata for a
book might include the author’s name, publishing date, publisher’s name, etc. Metadata for a music
file could include the singer’s name, the album’s name, and the release date.
• A common place where most people face metadata is a computer file. It contains the size, the
creation date, and location. Some files may have their metadata stored in separate files.
• Metadata is behind-the-scenes data used in many places and by numerous industries. Be it online
retailing, social media, websites, and software, you are likely to find useful metadata describing files.
• Metadata can be generated automatically or typed in manually, depending on the purpose it serves.
Its key goal is to make the search for information easier.
Need of Metadata :-
• Metadata describes unseen HTML elements that directly communicate and clarify website
information for search engines, playing a critical role in effective Search Engine Optimization for
retailers.
• This series of micro-communications includes page titles, description tags and other protocols, and
they may describe purposes, characteristics and general content.
• Metadata is a structured way to communicate information about a data set, which is used in a
variety of settings with special relevance for ecommerce businesses.
• When it comes to SEO, the importance of metadata is hard to overestimate. It’s critical to
demonstrating the relevance of a website to a search engine. As the search engine evaluates your
website according to its purpose, your metadata is the means of communication. The website sends
information to the search engine using the metadata.
Write applications of OLAP

Business Reporting for sales: The Business Reporting gives an overview of the sales activity in
the sales activities within an organization. It shows the trends in the sales over a certain time period.

Marketing: Industries like digital marketing, health care, eCommerce, and finance uses OLAP in
their marketing.Example: Market Basket Analysis is a technique that gives the careful study of
purchases done by a customer in a supermarket.

Management Reporting: It aims to inform the managers of different aspects of the organizations
about the data from the various departments of the company in order to help them to make better
decisions.

Business Process Management: Business process management refers to improve a business


process from end to end by analyzing it. It helps organizations to steps required to carry out a
business task.

Financial Reporting: Financial Reporting refers to financial reports of an organization that are
released to stakeholders and the public. It includes the financial statements which include the balance
sheet, income sheet, statement of cash flows, etc. It shows the financial information that the company
chose to show.

A data warehouse can be modeled by either a star schema or a snowflake


schema. Describe the similarities and the differences of the two models.
• The star schema is the simplest type of Data Warehouse schema. It is known as star
schema as its structure resembles a star.
• Comparing Snowflake vs Star schema, a Snowflake Schema is an extension of a Star
Schema, and it adds additional dimensions. It is called snowflake because its diagram
resembles a Snowflake.
• In a star schema, only single join defines the relationship between the fact table and any
dimension tables.
• Star schema contains a fact table surrounded by dimension tables.
• Snowflake schema is surrounded by dimension table which are in turn surrounded by
dimension table
• A snowflake schema requires many joins to fetch the data.
• Comparing Star vs Snowflake schema, Start schema has simple DB design, while
Snowflake schema has very complex DB design.
Differentiate between OLTP and OLAP.
How is data warehouse different from a database? How are they similar

Similarities
1. Both the database and data warehouse is used for storing data.
These are data storage systems.
2. Generally, the data warehouse bottom tier is a relational database
system. Databases are also relational database system. Relational DB
systems consist of rows and columns and a large amount of data.
3. The DW and databases support multi-user access. A single instance
of database and data warehouse can be accessed by many users at a
time.
4. Both DW and database require queries for accessing the data. The
Data warehouse can be accessed using complex queries while OLTP
database can be accessed by simpler queries.
5. The database and data warehouse servers can be present on the
company premise or on the cloud.
6. A data warehouse is also a database.
Compare star and snowflake schema dimension table

You might also like