You are on page 1of 5

PIBM

Data Visualization
1st Assignment

Notes –
1. All questions carry equal marks.
2. Short and to-the-point answers are expected rather than irrelevant long answers.

Q. 1. What is the difference between Database and Data Warehouse?

Q.2. Which characteristics (factors) do we need to consider so as to make Data Visualization effective?

Q.3. Where does data used in data visualization come from? Give 5 sources of data.

Q.4. What do you mean by ‘Pattern Recognition’?

Q.5. What do you mean by ‘Data Integrity’ in a database? Give 5 business examples that show clear loss of data
integrity.

Q.6. For each term in Column A match ONLY 1 appropriate term from column B. (Note: Column B has additional
(extra) content and not all of the content would match column A properly)

Column A Column B
Graph Type Property or Business Situation

1 Heat Map Values of multiple variables are


arranged in an oval shape.

2 Pictograph Use of colours of different


brightness or shades.

3 Sunburst Chart Makes use of quartile concept


from statistics.

4 Box and Whisker Chart Sales data hierarchically presented


for quarter, month and week.

5 Stacked Bar Chart Makes use of standard deviation


concept from statistics.

Use of videos.

Use of real life image.

Values of multiple variables are


arranged on the top of the other.

Answer 1
DATABASE DATA WAREHOUSE
1. An organised collection of related 1. A central location which stores
data which stores data in tabular format consolidated data from multiple
databases
2. Contains detailed data 2. Contains summarised data
3. Uses Online Transactional Processing 3. Uses Online Analytical Processing
(OLTP) (OLAP)
4. Helps to perform fundamental 4. Helps to analyse the business
operations of a business
5. Slow and less accurate 5. Faster and accurate
6. Application oriented 6. Subject oriented
7. Tables and joins are complex because 7. Tables and joins are simple because
they are normalised they are denormalised
8. Design is helped by entity relationship 8. Design is helped by data modelling
modelling technique

Answer 2

Below mentioned are the characteristics to make data visualisation effective :

1. Clear headings and keys :The more compelling the heading, the more attention
the reader will pay to the data in the chart or graph. To create the most
compelling heading possible, you’ve got to understand your audience. For
example, don’t use technical terms or jargon to communicate to the general
public.
2. Obvious trends : Charts or graphs can help you to spot the trends which going
through report is a tedious job.The reader can get specific information from the
bar graph while immediately picking up on the trend that the line depicts.
3. Simple analysis : Sometimes, a well organised table will do the trick which
graphs and charts can’t. Do the maths for your readers so to make the takeaways
as obvious and inarguable as possible. Keep the information simple and brief,
and use bold and/or coloured font for key points. You can even add a short
summary sentence or two to emphasise the overall message.
4. Relevant comparisons : Comparisons can be a very effective data visualisation
tool, whether you’re trying to drive home a point or encourage a particular
behaviour in readers. Comparisons help put your data into context. This works
best when you’ve got various sets of data that are directly comparable.
Displaying current data alongside averages and standards helps to make your
data actionable.
5. Summaries of key points : In certain situations, data visualisation in the
traditional sense (with colorful graphics) is not the most effective way to display
information. You need something shorter and clearer than detailed reports, you
just have to present the most important points as efficiently as possible. With
clear headings and minimal rows and columns, your reader has all the
information they need at a glance.
6. Add design elements : Adding design elements to data visualization can help
enhance your message in subtle but important ways. It can not only help catch
readers’ eyes and spark interest, but also add to the professional look and feel of
the presentation. Design can also bring visual cues that add relevance for the
reader.
7. Consolidated Information : Another great thing about data visualization is that it
can convey several ideas at once in an easily understandable way. Visually
representing individual data points helps convey detailed information and a
bigger picture simultaneously.

Answer 3

There are various sources of data for data visualisation which depends on which
domain we are trying to bring our insights. Some of the common domains of data
visualisation are healthcare, government, e-commerce, business directory, social
media, real estate, etc.

Five sources of data are-


1. Business directory – Linkedin, Zoominfo, Owler, etc.
2. Social media – Facebook, Twitter, Instagram, etc
3. Ecommerce – Amazon, Snapdeal, Flipkart, etc
4. Financial data – NSE, BSE, IMF Economic data, World Bank Data
5. Healthcare – WHO, UNICEF, Healthdata.gov, etc
Answer 4

Pattern recognition can be defined as the classification of data based on knowledge


already gained or on statistical information extracted from patterns and/or their
representation. Pattern recognition is the ability to detect arrangements of
characteristics or data that yield information about a given system or data set.

Answer 5
The term data integrity refers to the accuracy and consistency of data. When
creating databases, attention needs to be given to data integrity and how to
maintain it. A good database will enforce data integrity whenever possible.

Five business examples that show clear loss of data integrity –

1. Not Attributable – Data should clearly demonstrate who observed and


recorded it, when it was observed and recorded, and who it is about.
2. Not Legible – Data should be easy to understand, recorded permanently and
original entries should be preserved.
3. Not Contemporaneous – Data should be recorded as it was observed, and at
the time it was executed.
4. Not Original – Source data should be accessible and preserved in its original
form.
5. Not Accurate – Data should be free from errors, and conform with the
protocol
Answer 6

Column A Column B
Graph Type Property or Business Situation

1 Heat Map Use of colours of different


brightness or shades.

2 Pictograph Use of real life image.

3 Sunburst Chart Sales data hierarchically presented


for quarter, month and week.

4 Box and Whisker Chart Makes use of quartile concept


from statistics.

5 Stacked Bar Chart Makes use of standard deviation


concept from statistics.

You might also like