Professional Documents
Culture Documents
DV Unit 1 QB
DV Unit 1 QB
(Autonomous)
DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA SCIENCE
VI Semester
20AI602 – Data Visualization
Regulations 2020
Question Bank
UNIT – I ( Introduction )
PART- A
Q.No. Questions Marks CO BL
1 Define Data Visualization. 2 CO1 R
2 Why data visualization is considered as a powerful tool? 2 CO1 R
3 List out 5 key factors that influence data visualization. 2 CO1 R
4 State the purpose of visualization in business. 2 CO1 R
5 Identify the advantages of Data visualization. 2 CO1 U
6 Which visualization method is used for decision making? 2 CO1 R
7 State any 5 examples of data visualization. 2 CO1 R
8 List out the functions of Data visualization. 2 CO1 R
9 Identify the essential factors of Your data. 2 CO1 U
10 Recall the factors that influence data representation. 2 CO1 R
11 Identify the visualization method to represent two categories of data. 2 CO1 U
12 Infer Heat Map. 2 CO1 R
13 Define Area chart. 2 CO1 R
14 How Funnel charts display a pipeline, typically for sales figures? 2 CO1 R
15 List out data types used in parsing. 2 CO1 R
16 State the seven stages of data visualization. 2 CO1 R
17 State the seven stages of data visualization. 2 CO1 R
18 Name the widgets provided by MarkLogic Server. 2 CO1 R
19 Classify data visualization tools. 2 CO1 U
20 Define Whatagraph tool. 2 CO1 R
PART- B
Q.No. Questions Marks CO BL
1. Discuss in detail about Methodology of Data Visualization. 16 CO1 U
2. Describe visual functions and tones of data visualization in detail. 16 CO1 U
3. Summarize any five Data Visualization Representation with example. 16 CO1 U
4. Explain in detail about Visualization design options with example. 16 CO1 U
5. Illustrate the seven stages of Data visualization with neat diagram. 16 CO1 U
6. Describe Data visualization tools in detail with neat diagram. 16 CO1 U
Histogram
Unlike bar charts, histograms illustrate the distribution of data over a continuous
interval or defined period. These visualizations are helpful in identifying where
values are concentrated, as well as where there are gaps or unusual values.
Gantt Chart
Gantt charts are particularly common in project management, as they’re useful
in illustrating a project timeline or progression of tasks. In this type of chart,
tasks to be performed are listed on the vertical axis and time intervals on the
horizontal axis. Horizontal bars in the body of the chart represent the duration of
each activity.
Heat Map
A heat map is a type of visualization used to show differences in data through
variations in color. These charts use color to communicate values in a way that
makes it easy for the viewer to quickly identify trends. Having a clear legend is
necessary in order for a user to successfully read and interpret a heatmap.
Waterfall Chart
A waterfall chart is a visual representation that illustrates how a value changes
as it’s influenced by different factors, such as time. The main goal of this chart is
to show the viewer how a value has grown or declined over a defined period.
For example, waterfall charts are popular for showing spending or earnings over
time.
Area Chart
An area chart, or area graph, is a variation on a basic line graph in which the
area underneath the line is shaded to represent the total value of each data
point. When several data series must be compared on the same graph, stacked
area charts are used.
Scatter Plot
A scatter plot displays data for two variables as represented by points plotted
against the horizontal and vertical axis. This type of data visualization is useful
in illustrating the relationships that exist between variables and can be used to
identify trends or correlations in data.
Pictogram Chart
Pictogram charts, or pictograph charts, are particularly useful for presenting
simple data in a more visual and engaging way. These charts use icons to
visualize data, with each icon representing a different value or category. For
example, data about time might be represented by icons of clocks or watches.
Each icon can correspond to either a single unit or a set number of units (for
example, each icon represents 100 units).
Timeline
Timelines are the most effective way to visualize a sequence of events in
chronological order. They’re typically linear, with key events outlined along the
axis. Timelines are used to communicate time-related information and display
historical data.
Word Cloud
A word cloud, or tag cloud, is a visual representation of text data in which the
size of the word is proportional to its frequency. The more often a specific word
appears in a dataset, the larger it appears in the visualization. In addition to size,
words often appear bolder or follow a specific color scheme depending on their
frequency.Network Diagram
Correlation Matrices
A correlation matrix is a table that shows correlation coefficients between
variables. Each cell represents the relationship between two variables, and a
color scale is used to communicate whether the variables are correlated and to
what extent.
3. 16 CO1 U
pivot tables help us summarize key statistics in a single view. The type of
information collected in pivot tables might include sums, means or other numerical
summaries.
While pivot tables aren’t always the most visually inspiring form of data viz, they are
useful in the right context. For instance, highlight tables, as shown in the image,
use different shades or colors to easily flag the highest and lowest values in a
dataset.
2. Boxplots
Another useful type of descriptive visualization is the boxplot (also known as a box-
and-whisker plot). Like pivot tables, boxplots are useful for visualizing a dataset’s
key statistics. We can use them to represent minimum and maximum values, the
median value, and the lower and upper quartiles (i.e. the median of the lower and
upper halves of the data).
This makes them useful for exploratory and explanatory data analysis, i.e. getting to
understand a dataset’s key features before drawing any broad conclusions about it.
3. Scatterplots
Area charts, similar to line charts, are also used for tracking data over time.
However, in an area chart, the space between the plotted line and the x-axis is
shaded or colored for visibility. This is particularly useful for highlighting the
difference between multiple variables, or for measuring overall volumes (rather than
highlighting the difference between discrete data points).
4. Explain in detail about Visualization design options with example. 16 CO1 U
Visualization during design refers to the visual mental images used by the designer
during the design process. Visualization enables the generation, interpretation and
manipulation of information through spatial representation. In other words, it is the
mental pictures used by a designer when completing a design task.
1) Choose the chart that tells the story. There may be more than one way to
visualize the data accurately. In this case, consider what you’re trying to achieve,
the message you’re communicating, who you’re trying to reach, etc.
2) Remove anything that doesn’t support the story. The great thing about data
visualization is that design can help do the heavy lifting to enhance and
communicate the story. Let it do its job. (But don’t use 3D charts.
3) Design for comprehension. Once you have your visualization created, take a
step back and consider what simple elements might be added, tweaked, or
removed to make the data easier for the reader to understand. You might add a
trend line to a line chart, or you might realize you have too many slices in your pie
chart
4) Include a zero baseline if possible. Although a line chart does not have to start at
a zero baseline, it should be included if it gives more context for comparison. If
relatively small fluctuations in data are meaningful (e.g., in stock market data), you
may truncate the scale to showcase these variances.
5) Always choose the most efficient visualization. You want visual consistency so
that the reader can compare at a glance. This might mean you use stacked bar
charts, a grouped bar chart, or a line chart. Whatever you choose, don’t overwhelm
by making the reader work to compare too many things.
6) Watch your placement. You may have two nice stacked bar charts that are
meant to let your reader compare points, but if they’re placed too far apart to “get”
the comparison, you’ve already lost.
7) Tell the whole story. Maybe you had a 30% sales increase in Q4. Exciting! But
what’s more exciting? Showing that you’ve actually had a 100% sales increase
since Q1.
8) Don’t over explain. If the copy already mentions a fact, the subhead, callout, and
chart header don’t have to reiterate it.
9) Keep chart and graph headers simple and to the point. There’s no need to get
clever, verbose, or pun-tastic. Keep any descriptive text above the chart brief and
directly related to the chart underneath. Remember: Focus on the quickest path to
comprehension.
10) Use callouts wisely. Callouts are not there to fill space. They should be used
intentionally to highlight relevant information or provide additional context.
11) Don’t use distracting fonts or elements. Sometimes you do need to emphasize
a point. If so, only use bold or italic text to emphasize a point—and don’t use them
both at the same time.
12) Use a single color to represent the same type of data. If you are depicting sales
month by month on a bar chart, use a single color. But if you are comparing last
year’s sales to this year’s sales in a grouped chart, you should use a different color
for each year. You can also use an accent color to highlight a significant data point.
13) Watch out for positive and negative numbers. Don’t use red for positive
numbers or green for negative numbers. Those color associations are so strong it
will automatically flip the meaning in the viewer’s mind.
14) Make sure there is sufficient contrast between colors. If colors are too similar
(light gray vs. light, light gray), it can be hard to tell the difference. Conversely, don’t
use high-contrast color combinations such as red/green or blue/yellow.
15) Avoid patterns. Stripes and polka dots sound fun, but they can be incredibly
distracting. If you are trying to differentiate, say, on a map, use different saturations
of the same color. On that note, only use solid-colored lines (not dashes).
16) Select colors appropriately. Some colors stand out more than others, giving
unnecessary weight to that data. Instead, use a single color with varying shade or a
spectrum between two analogous colors to show intensity. Remember to intuitively
code color intensity according to values as well.
Illustrate the seven stages of Data visualization with neat diagram.