Professional Documents
Culture Documents
Programming
Syntax
matrix(data, nrow, ncol, byrow, dimnames)
Where:
• data is the input vector which becomes the data elements of the matrix.
• nrow is the number of rows to be created.
• ncol is the number of columns to be created.
• byrow is a logical clue. If TRUE, then the input vector elements are arranged by
row.
• dimname is the names assigned to the rows and columns.
Example
Mymatrix <- matrix(c(1:25), nrow = 5, ncol = 5, byrow = TRUE)
Arrays
Arrays in R are data objects which can be used to store data in more
than two dimensions. It takes vectors as input and uses the values
in the dim parameter to create an array.
The basic syntax for creating an array in R is −
array(data, dim, dimnames) Where:
• data is the input vector which becomes the data elements of the
array.
• dim is the dimension of the array, where you pass the number of
rows, column and the number of matrices to be created by
mentioned dimensions.
• dimname is the names assigned to the rows and columns.
Example - Myarray <- array( c(1:16), dim=(4,4,2))
Array
Example for Array
vector1 <- c(6,2,1)
vector2 <- c(10,16,22,43,15,25)
Result <-array(c(vector1,vector2),dim=c(3,3,2))
print (Result)
List
•What is a llist?
Lists are the R objects which contain elements of different
types like − numbers, strings, vectors and another list inside
it.
•Listcan containelementsofdifferentdata types
•Itcancontain numbers, vectorsorlistinsideitself
•Itiscreatedusinglist()function
Syntax
Listname <- list(values)
List
• Create a list
listdata <-list("Green","Red", c( 21, 32, 11),TRUE, 24.5, 11)
print (listdata)
print (listdata [1])
print (listdata [3])
• Assign names to the list
names(listdata) <- c("lst quarter","A_matrix","A Innerlist")
• Add element to some position in the list
listdata[4] <- "New element"
Cont…..
print (listdata[4])
• Remove the fourth element from list
listdata[4] <- NULL
• Print 4th Element
print(listdata[4])
• Update 3rd element
listdata[3] <- "Updated element"
print(listdata[3])
Difference
ggplot2 can serve as a replacement for the base graphics in R and contains
a number of defaults for web and print display of common scales.
Since 2005, ggplot2 has grown in use to become one of the most popular
R packages. It is licensed under GNU GPL v2.
•Basic Visualization
•Histogram
•Bar / Line Chart
•Box plot
• Scatter plot
•Advanced Visualization
•Heat Map
•Mosaic Map
•Map Visualization
• 3D Graphs Correlogram
Histogram
• Histogram is a visual tool for presenting variable
data. It
• organises data to describe the process performance.
• Additionally histogram shows the amount and
pattern of the
• variation from the process.
• Histogram offers a snapshot in time of the process
performance.
When Are Histograms Used?
• Summarize large data sets graphically.
• Compare measurements to specifications.
• Communicate information to the team.
• Assist in decision making.
What are the parts of a Histogram?
Histogram is made up of Four parts:
(1).TITLE: The title briefly describes the information that is
contained in the Histogram.
(2).HORIZONTAL or X AXIS: The horizontal or X-axis shows you
the scale of values into which the measurements fit. These
measurements are generally grouped into intervals to help you
summarize large data sets.
(3).BARS: The bars have two important characteristics—height
and width. The height represents the number of times the values
within an interval occurred. The width represents the length of
the interval covered by the bar. It is the same for all bars.
(4).VERTICAL or Y AXIS: The vertical or Y-axis is the scale that
shows you the number of times the values within an interval
occurred.
37
Constructing a Histogram
• Step 1 - Count number of data points
• Step 2 - Summarize on a tally sheet
• Step 3 - Compute the range
• Step 4 - Determine number of intervals
• Step 5 - Compute interval width
• Step 6 - Determine interval starting points
• Step 7 - Count number of points in each interval
• Step 8 - Plot the data
• Step 9 - Add title
38
Advantages of Histogram
piepercent<- round(100*x/sum(x), 1)
png(file = "city_percentage_legends.png")
png(file = "3d_pie_chart.png")
dev.off()
3D Pie Chart
Bar charts
• A bar chart represents data in rectangular bars
with length of the bar proportional to the
value of the variable.
• R uses the function barplot() to create
bar charts.
• R can draw both vertical and horizontal bars
in the bar chart.
• In bar chart each of the bars can be
given different colors.
Bar charts – Syntax
• The basic syntax to create a bar-chart in R is −
barplot(H, xlab, ylab, main, names.arg, col)
• Following is the description of the parameters used −
– H is a vector or matrix containing numeric values used
in bar chart.
– xlab is the label for x axis.
– ylab is the label for y axis.
– main is the title of the bar chart.
– names.arg is a vector of names appearing under
each bar.
– col is used to give colors to the bars in the graph.
Bar charts – Example
# Create the data for the chart. H <-
c(7,12,28,3,41)
dev.off()
Bar chart with attributes
Bar chart – Stacked
colors <- c("green","orange","brown")
months <- c("Mar","Apr","May","Jun","Jul")
regions <- c("East","West","North")
Values <-
matrix(c(2,9,3,11,9,4,8,7,3,12,5,2,8,10,11
),nrow =
3,ncol = 5,byrow = TRUE)
png(file = "barchart_stacked.png")
dev.off()
v <- c(2, 3, 8 , 7.5, 9,10,15)
barplot(v, col="red", horiz=TRUE)
group <- LETTERS[1:7]
barplot(v, col="red", names.arg = group)
Bar chart – Stacked
Boxplot
• Boxplots are a measure of how well distributed
is the data in a data set.
• It divides the data set into three quartiles. This
graph represents the minimum, maximum,
median, first quartile and third quartile in the
data set.
• It is also useful in comparing the distribution of
data across data sets by drawing boxplots for
each of them.
• Boxplots are created in R by using the
boxplot() function.
Boxplot – Syntax
• The basic syntax to create a boxplot in R is −
boxplot(x, data, notch, varwidth, names, main)
• Following is the description of the parameters used −
– x is a vector or a formula
– data is the data frame.
– notch is a logical value. Set as TRUE to draw a notch.
– varwidth is a logical value. Set as true to draw width of
the box proportionate to the sample size.
– names are the group labels which will be printed
under each boxplot.
– main is used to give a title to the graph.
Boxplot – Example
• We use the data set "mtcars" available in the
R environment to create a basic boxplot.
Let's look at the columns "mpg" and "cyl" in
mtcars.
dev.off()
Multiple lines in chart
ScatterPlot
• Scatterplots show many points plotted in
the Cartesian plane.
• Each point represents the values of
two variables.
• One variable is chosen in the horizontal axis
and another in the vertical axis.
• The simple scatterplot is created using
the plot() function.
ScatterPlot – Example
• We use the data set "mtcars" available in the
R environment to create a basic scatterplot.
• Let's use the columns "wt" and "mpg" in
mtcars.
png(file = "scatterplot.png")
# Plot the chart for cars with weight between 2.5 to 5 and
mileage between 15 and 30.
plot(x = input$wt,y = input$mpg,
xlab = "Weight",
ylab = "Milage",
xlim = c(2.5,5),
ylim = c(15,30),
main = "Weight vs Milage"
)
dev.off()
ScatterPlot – Example
Scatter Plot matrices
• When we have more than two variables and we want
to find the correlation between one variable versus
the remaining ones we use scatterplot matrix.
• We use pairs() function to create matrices of
scatterplots.
– Syntax:
pairs(formula, data)
– Following is the description of the parameters used
−
formula represents the series of variables used in
pairs.
data represents the data set from which the variables
will be taken.
Scatter Plot matrices – Example
# Give the chart file a name.
png(file = "scatterplot_matrices.png")
pairs(~wt+mpg+disp+cyl,data = mtcars,main =
"Scatterplot Matrix")
dev.off()
Scatter Plot matrices – Example
Useful resources
Thank you
This presentation is created using LibreOffice Impress 4.2.8.2, can be used freely as per GNU General Public License
tushar@tusharkute.com
Histogram
• A histogram represents the frequencies
of values of a variable bucketed into
ranges.
• Histogram is similar to bar chart but
the difference is it groups the values
into continuous ranges.
• Each bar in histogram represents the height
of the number of values present in that
range.
• R creates histogram using hist() function.
This function takes a vector as an input and
uses some more parameters to plot
Thank You
Any Queries