Computer Graphics

SYBSc (IT) - Sem IV Computer Oriented Statistical Techniques
PRACTICAL NO.1
AIM: Program to execute the basic commands, array, list and frames in R programming
language.
Q1. Take any two variables a and b. Write a R-code to perform arithmetic operations on them.
R-CODE:
> a = 10
>b=5
> a+b
15
> a-b
> a*b
50
> a/b
> a%%b (a mod b) // Gives

remainder of first value with
second
R.K.Talreja College > plot(y,xlab='Probability',

ylab='Quantiles', type='h',col='Purple')
Page No
> a^3
1000
> exp(a)
22026.47
> a = 25 ; sqrt(a)
> b = - 9 ; abs(b)
Q.2 Create two vectors p and q. Write a R code to perform Arithmetic

operations. p = c(1, 3, 4, 7, 5, 0, 11) and q = c(4, 11, 0, 8, 1, 2, 6)

Page No
Q.3 Create two vectors p and q. Write a R code to perform Relational

operations.
p = c(4, 2.2, 8, 6) and q = c(5, 10, 8, 6.6)
Q.4 Create two vectors p and q. Write a R code to perform Logical operations.
p = c(4, 2.2, 8, 6) and q = c(5, 10, 8, 6.6)
Q5. Write a R code to combine following data structures:
Name Age Vote
John 18 FALSE
Alex 19 TRUE
Jay 20 TRUE
Name Age Vote
Monisha 18 TRUE
Sam 21 TRUE
Rhea 19 TRUE
If we want to combine two data frames vertically, the column name of the two
data frames must be the same.
Result = rbind(df1, df2)
Q6. Write a R code to combine following data structures:

Page No
Name Age Vote
Monisha 18 TRUE
Sam 21 TRUE
Rhea 19 TRUE
City Country
Mumbai India
Bengluru India
Toronto Canada
The cbind() function combines two or more data frames horizontally.
Result = cbind(df1, df2)
PRACTICAL NO.2
AIM: Program to create a Matrix and to perform the matrix operations.
Q.1. Create two matrices A and B. Write a R-code to perform mathematical operations.

Page No
3 4 5
6 7 8
9 10 11
3 6 9
4 7 10
5 8 11
R-code:
> A = matrix ( c(3:11),nrow=3,byrow="true" )
>A
[,1] [,2] [,3]
[1,] 3 4 5
[2,] 6 7 8
[3,] 9 10 11
> B = matrix ( c(3:11),nrow=3,byrow="false" )

>B
[,1] [,2] [,3]
[1,] 3 6 9
[2,] 4 7 10
[3,] 5 8 11
> A+B
[,1] [,2] [,3]
[1,] 6 10 14
[2,] 10 14 18
[3,] 14 18 22
> A-B
[,1] [,2] [,3]
[1,] 0 -2 -4
[2,] 2 0 -2
[3,] 4 2 0
> A%*%B
[,1] [,2] [,3]

[1,] 50 86 122
[2,] 86 149 212
[3,] 122 212 302

Page No
> A/B
[,1] [,2] [,3]
[1,] 1.0 0.6666667 0.5555556
[2,] 1.5 1.0000000 0.8000000
[3,] 1.8 1.2500000 1.0000000
>t(A)
[,1] [,2] [,3]
[1,] 3 6 9
[2,] 4 7 10
[3,] 5 8 11
> t(B)
[,1] [,2] [,3]
[1,] 3 4 5
[2,] 6 7 8
[3,] 9 10 11
Q.2. Write a R-Code to verify the following expressions for given matrices:
i) 2 2
( A+ B) = A + AB+BA +B
2
ii) ( AB )' =B ' A'

iii) (A2 )’ = (A’)2
Q.3. Write a R-Code to verify whether a given matrix is orthogonal?

Page No
PRACTICAL NO. 3
AIM: Program to execute the Statistical functions on the ungrouped / Raw data.
A): Measures of Central Tendency in R.

Q.1 Write a R code to calculate measures of central tendency of given data:
20, 30, 10 ,10, 25, 12
R-CODE:
> x = c(20, 30, 10 ,10, 25, 12)
> result1 = mean(x)
> print(result1)
[1] 17.83333
> result2 = median(x)
> print(result2)
[1] 16
> getmode = function(v)
{m = unique(v)
m[which.max(tabulate(match(x, m)))]
> result = getmode(x)
> print(result)
[1] 10

Page No
Result: Mean = 17.83333, Median = 16, Mode = 10
Q.2 Write a R code to calculate measures of central tendency of given

marks obtained by 10 students: 97, 78, 57, 78, 97, 66, 87, 64, 87, 78
B) : Study of measures of Dispersion.
Q.1. write a R-Code to find the Range, IQR, Coefficient of range of Rainfall
measured during 9 days in millimeters: 29,19, 26,12, 24, 21, 36, 35, 33
R-Code:
> Rainfall = c(29,19,26,12,24,21,36,35,33)
> r = max(Rainfall) - min(Rainfall)
>r
[1] 24
> IQR(Rainfall)
[1] 12
> cr = ( max(Rainfall) - min(Rainfall) ) / ( max(Rainfall) + min(Rainfall) )
> cr
[1] 0.5
Result: Range=24, IQR=12, Coefficient of range=0.5
Q2. Write a R-Code to calculate Quartile deviation and Coefficient of QD for

given data: 12, 18, 35, 42, 50, 45, 20, 8
R-Code:
> x = c(12, 18, 35, 42, 50, 45, 20, 8)
> Q = quantile(x, probs = c(0.25,0.50,0.75) )
>Q

Page No
25% 50% 75%
16.50 27.50 42.75
> QD = ( Q[3] - Q[1]) / 2
> QD
75%
13.125
> CQ = ( Q[3] - Q[1] ) / ( Q[3] + Q[1] )
> CQ
75%
0.443038
Result: Quartile deviati on = 13.125, Coefficient of QD= 0.443038
Q3. Write a R-Code to calculate the Standard deviation, Variance of Rainfall

measured during 9 days: 29, 19, 26, 12, 24, 21, 36, 35, 33
R-Code:
> Rainfall = c (29,19,26,12,24,21,36,35,33)
> sd(Rainfall)
[1] 8.006941
> var(Rainfall)
[1] 64.11111
Result: Standard deviation = 8.006941, Variance of Rainfall = 64.11111

Page No
PRACTICAL NO. 4
AIM: To import the data from Excel / .CSV file and to perform the measures of central
tendency.
Q1. Import the given data from the .csv file. Write a R-code to calculate measures
of central tendency of marks for each subject.
Name Marks Marks (Maths)

(English)
Jay 30 40
Alex 25.5 50
Caroline 35 45.5
Esmita 45.5 25.5
Nick 45 33
Nimit 42.5 35
R-Code:
>dir()
> d = read.csv("Data.csv")
>d
Name Marks..English. Marks..Maths.
1 Jay 30.0 40.0
2 Alex 25.5 50.0
3 Caroline 35.0 45.5
4 Esmita 45.5 25.5
5 Nick 45.0 33.0
6 Nimit 42.5 35.0
> Mean = mean (d$Marks..English)

> Mean
37.25
Page No
> Median = median (d$Marks..English)

> Median
38.75
> Mode = mode(d$Marks..English)
> Mode
"numeric"
> getmode = function(v)
{m = unique(v)
m[which.max(tabulate(match(d$Marks..English, m)))]
> result = getmode(x)
> print(result)
> Mean = mean (d$Marks..Maths)

> Mean
38.16667
> Median = median (d$Marks..Maths)
> Median
37.5
> Mode = mode (d$Marks..Maths)
> Mode
"numeric"

Page No
Q2.
id name salary start_date dept
1 A 6000 2012-01-01 IT
2 B 5000 2013-09-23
Operations
3 C 6500 2014-11-15 IT
4 D 7300 2014-05-11 HR
5 E 8000 2015-03-27 Finance
6 F 5500 2013-05-21 IT
7 G 6500 2013-07-30 Operations
8 H 7900 2014-06-17 Finance
Read the given data and calculate:
Q1. Get the number of rows and columns.
Q2. Get the persons having maximum salary and minimum salary.
Q3. Get the people who joined on or after 2014.
Q4. Get the mean, median and mode of salary.
R-code:
>dir()
> data = read.csv("input.csv")
> print(data)

Page No
1 1 A 6000 2012-01-01 IT
2 2 B 5000 2013-09-23 Operations
3 3 C 6500 2014-11-15 IT
4 4 D 7300 2014-05-11 HR
5 5 E 8000 2015-03-27 Finance
6 6 F 5500 2013-05-21 IT
7 7 G 6500 2013-07-30 Operations
8 8 H 7900 2014-06-17 Finance
Q1.
print(ncol(data))
print(nrow(data))
[1] 5
[1] 8
Q2.
> Max = subset(data, salary == max(salary))
> print(Max)
5 5 E 8000 2015-03-27 Finance
> Min = subset(data, salary == min(salary))
> print(Min)
2 2 B 5000 2013-09-23 Operations

Page No
Q3.
> result = subset(data, as.Date(start_date) > as.Date("2014-01-01"))
> print(result)
3 3 C 6500 2014-11-15 IT
4 4 D 7300 2014-05-11 HR
5 5 E 8000 2015-03-27 Finance
8 8 H 7900 2014-06-17 Finance
Q4.
> Mean = mean(data$salary)
> Median = median(data$salary)
> Mode = mode(data$salary)
> print(Mean)
[1] 6587.5
> print(Median)
[1] 6500
> print(Mode)
[1] "numeric"

Page No
PRACTICAL NO. 5
AIM: To import the data from Excel / .CSV file and to perform the measures of
dispersion.
Q1. Import the given data from the .csv file. Write a R-code to calculate measures of
dispersion of marks for each subject.
Name Marks Marks (Maths)

(English)
Jay 30 40
Alex 25.5 50
Caroline 35 45.5
Esmita 45.5 25.5
Nick 45 33
Nimit 42.5 35
R-Code:
>dir()
> d = read.csv("Data.csv")
>d
Name Marks..English. Marks..Maths.
1 Jay 30.0 40.0
2 Alex 25.5 50.0
3 Caroline 35.0 45.5
4 Esmita 45.5 25.5
5 Nick 45.0 33.0
6 Nimit 42.5 35.0

Page No
> SD = sd(d$Marks..English)
> SD
8.383019
> Variance = var(d$Marks..English)

> Variance
70.275
> r = max(d$Marks..English) - min(d$Marks..English)

>r
20
> IQR(d$Marks..English)
13.125
> cr = ( max(d$Marks..English) - min(d$Marks..English) ) /

( max(d$Marks..English) + min(d$Marks..English) )
> cr
0.2816901
> SD = sd(d$Marks..Maths)
> SD
8.880691
> Variance = var(d$Marks..Maths)

> Variance
78.86667
> r = max(d$Marks..Maths) - min(d$Marks..Maths)

>r
24.5
> IQR(d$Marks..Maths)
10.625
> cr = ( max(d$Marks..Maths) - min(d$Marks..Maths) ) /

( max(d$Marks..Maths) + min(d$Marks..Maths) )
> cr
0.3245033

Page No
> cov(d$Marks..English, d$Marks..Maths) // indicates the relationship of

two variables
[1] -66
There exists a negative relationship between two subjects.
Q2. Import the given data from the .csv file.
i) Write a R-code to calculate measures of dispersion of salary of each employee.

ii) What is the relation between current salary and expected salary.
iii) Plot the relation between current salary and expected salary.
id name salary start_date dept exp_salary

1 A 6000 2012-01-01 IT 6000
2 B 5000 2013-09-23 Operations 8000
3 C 6500 2014-11-15 IT 7000

4 D 7300 2014-05-11 HR 10000
5 E 8000 2015-03-27 Finance 15000
6 F 5500 2013-05-21 IT 12000
7 G 6500 30-07-2013 Operations 9000

Page No
8 H 7900 2014-06-17 Finance 9500
i)
> SD = sd(d$salary)
> SD
1088.167
> Variance = var(d$salary)

> Variance
1184107
> r = max(d$salary) - min(d$salary)

>r
3000
> IQR(d$salary)
1575
> cr = ( max(d$salary) - min(d$salary) ) / ( max(d$salary) + min(d$salary) )

> cr
0.2307692
ii)
>cov(d$salary, d$exp_salary)
>cov
1443750
There exists a positive correlation
iii)
plot(col1 , col2)
There exists a positive correlation among current salary and
expected salary.
PRACTICAL NO. 6
AIM: To import the data from .CSV file and to calculate Skewness.

Page No
Q1. Import the given data from the .csv file. Write an R-code to
calculate the Skewness and Kurtosis of scores obtained by 9
candidates. Also plot the skewness.
Candidates Alex John Steve Nimita James Caroline Vansh David Sumati
Score 10 45.5 80 65 57.5 39.5 35 25 15
R-Code:
> dir()
"Pract 6 Data.csv"
> d = read.csv( "Pract 6 Data.csv")
>d
Candidates Score
1 Alex 10.0
2 John 45.5
3 Steve 80.0
4 Nimit 65.0
5 James 57.5
6 Caroline 39.5
7 Vansh 35.0
8 David 25.0
9 Sumati 15.0
> Mean = mean(d$Score)
> Median = median (d$Score)
> SD = sd(d$Score )
Page No
> Sk = ( 3 * ( Mean - Median ) ) / SD
> Sk
[1] 0.2438718
To Plot Skewness:
> x = c(10, 45.5, 80, 65, 57.5, 39.5, 35, 25, 15)
> plot(x, type="o", main = "Skewness")
> plot(x, ylab = "score", type="o", main = "Skewness")
> kurtosis(d$Score)
1.992505
Interpretation:

Page No
The value of Sk > 0, hence the score obtained by candidates is Positively

Skewed (Skewed to right side).
The coefficient of kurtosis < 3, hence distribution of score of candidates is
Platykurtic (Flat distribution).
Q2. Import the given data from the .csv file. Write an R-code to
calculate the Skewness and kurtosis of scores obtained by 7
candidates. Also plot the skewness.
Candidates Alex John Steve Nimita James Carolin Vansh

e
Score 10 20 30 50 40 30 20
> kurtosis(d$Score)
2.123961
Practical No- 07
AIM: Study of Hypothetical testing (Upper tailed & Lower tailed tests) in R.

Page No
qnorm: To get percentiles from a normal distribution with mean µ and standard
deviation σ. It is inverse of pnorm i.e. CDF (Cumulative Distribution Function).
(Note: No need to write theory in Journal, this is just for

understanding)

Page No
Q.1. Suppose the food label on a cookie bag states that there is at most 2 grams of
saturated fat in a single cookie. In a sample of 35 cookies, it is found that the mean
amount of saturated fat per cookie is 2.1 grams. Assume that the population standard
deviation is 0.25 grams. At a 0.05 significance level, can we reject the claim on the food
label?
Hypothesis:
The null hypothesis is H0 : μ 2
The Alternative hypothesis is H1 : μ = 2
R-Code:
xbar = 2.1 # sample mean
mu = 2 # hypothesized value
sigma = 0.25 # population standard deviation
n = 35 # sample size
z
[1] 2.3664
We then compute the critical value at 0.05 significance level.
# critical value (table value)

[1] 1.6449
z cal > z.alpha
Result:
The z-value 2.3664 is greater than the critical value of 1.6449
Hence at 0.05 significance level, we reject (null hypothesis) the claim that there is at
most 2 grams of saturated fat in a cookie.
So we accept H1

Page No
Q.2. Suppose the food label on a cookie bag states that there is at most 2 grams of
saturated fat in a single cookie. In a sample of 35 cookies, it is found that the mean
amount of saturated fat per cookie is 2.1 grams. Assume that the sample standard
deviation is 0.3 gram. At a 0.05 significance level, can we reject the claim on the food
label?
Hypothesis:
The null hypothesis is H0 : μ 2
The alternative hypothesis is H1 : μ = 2
R-Code:
xbar = 2.1 # sample mean

mu = 2 # hypothesized value
s = 0.3 # sample standard deviation
n = 35 # sample size
t=(xbarmu) / ( s/sqrt(n) )
t
[1] 1.9720
We then compute the critical value at 0.05 significance level.

alpha = 0.05
t.alpha=qnorm(1alpha)
t.alpha # critical value (table value)
[1] 1.6991
tcal > t.alpha
Result:
The t-value 1.9720 is greater than the critical value of 1.6991.
Hence at 0.05 significance level, we can reject (null hypothesis) the claim that there is
at most 2 grams of saturated fat in a cookie.
So we accept H1

Page No
Practical No- 8
AIM: A. Study of R - Binomial distributions.
Functions for Binomial Distribution
Functions Description
dbinom(x, n, p) This function is used to find probability at a particular

value & generates pdf i.e. it finds: P(X = x)
x is the value at which the probability has to be found

out, n is the total number of trials and p is the
probability of success.
The function pbinom() is used to find the

pbinom(x, n, p) cumulative probability (cdf) of a data i.e. P(X <=
x)
qbinom(k, n, p) This function is used to find the nth quantile, that is if

P(X <= x) = k is given, it finds x.
Where k is the probability, n is the total number of

trials and p is the probability of success.
This function generates N random samples of a

particular probability.
rbinom(N, n, p)
Where N is numbers of samples (sample size), n is the
total number of trials, p is the probability of success.

Page No
Q.1 A fair coin is tossed 10 times. What is the probability of getting heads exactly 8
times? Generate pdf function for Binomial distribution in R.
R-Code:
> x = 1 : 10
> n = 8; p = 0.5
> y = dbinom(x, n, p)
> data.frame(y)
1 0.03125000
2 0.10937500
3 0.21875000
4 0.27343750
5 0.21875000
6 0.10937500
7 0.03125000
8 0.00390625
9 0.00000000
10 0.00000000
> plot(x, y, xlab='Success', ylab='Probability', type='h', col='Purple')

Page No
Q.2 A fair coin is tossed 10 times. What is the probability of getting heads at most 8
times (8 or less than)? Generate cdf function for Binomial distribution in R.
R-code:
> x = 1 : 10
> n = 8; p = 0.5
> y = pbinom(x, n, p)
> data.frame(y)
1 0.03515625
Page No
2 0.14453125
3 0.36328125
4 0.63671875
5 0.85546875
6 0.96484375
7 0.99609375
8 1.00000000
9 1.00000000
10 1.00000000
> plot(x,y,xlab='Success', ylab='Probability', type='h', col='Green')

Page No
Q3. Write a R-code to calculate the 19th quantile of a Binomial distribution with 30
trials with the prob of success to 0.5.
R-code:
> k=19; n=30; p = 0.5
> y=qbinom(0.19, 30, 0.5)
>y
[1] 13

Page No
Q.4 Write a R-code to calculate the 6 random samples from 10 samples with
probability 0.5
R-code:
> N= 6; n=10; p=0.5
> x = rbinom(N, n, p)
>x
[1] 4 3 6 4 6 4
> plot(x,xlab='Samples',ylab='Probability',type='h',col='Purple')

Page No
AIM: B. Study of R - Normal distribution
Functions Description
dnorm (x, mean, sd) Normal density
(Probability Density Function)
pnorm (x, mean, sd) Normal distribution
(Cumulative Distribution Function)
qnorm (p1, mean, sd) Quantile function of the Normal distribution. Quantile
function gives value of x such that P(X <= x)=p1.
Hence, the qnorm function is the inverse of the pnorm
function.)
rnorm (N, mean, sd) Generates random sample of size N
· x is a vector of numbers.
· p is a vector of probabilities.
· N is the number of samples (sample size).
· Here, mean is the mean value of the sample data. Also, its default value is zero.
· sd is the standard deviation. Its default value is 1.
(Note: No need to write theory in Journal)

Page No
Q.1 Obtain the PDF for x, where x∈

(4,4), with mean 1 and standard deviation of 3.
R Code:
x = -4:4
mean = 1
sd = 3
y=dnorm(x, mean, sd)
plot(x, y)

Page No
Q.2 Obtain the CDF for x, where x∈

(4,4), with mean 1 and standard deviation of 1.
R-Code:
x = - 4:4
mean = 1
sd = 1
y=pnorm(x, mean, sd)
plot(x,y)

Page No
Q.3. Generate a sample of 8 observations of a standard Normal distribution with 0

mean and SD of 3.
R-Code:
N=6
mean = 40
sd = 2
Page No
y = rnorm(N, mean, sd)
Output:
40.85860 38.49336 42.20388 36.85891 37.73531 41.91403

Page No

Computer Graphics

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Computer Graphics

Uploaded by

Copyright:

Available Formats

SYBSc (IT) - Sem IV Computer Oriented Statistical Techniques

> a%%b (a mod b) // Gives

R.K.Talreja College > plot(y,xlab='Probability',

Q.2 Create two vectors p and q. Write a R code to perform Arithmetic

R.K.Talreja College > plot(y,xlab='Probability',

Q.3 Create two vectors p and q. Write a R code to perform Relational

Q5. Write a R code to combine following data structures:

Name Age Vote

Name Age Vote

Result = rbind(df1, df2)

Q6. Write a R code to combine following data structures:

R.K.Talreja College > plot(y,xlab='Probability',

Name Age Vote

The cbind() function combines two or more data frames horizontally.

Result = cbind(df1, df2)

AIM: Program to create a Matrix and to perform the matrix operations.

R.K.Talreja College > plot(y,xlab='Probability',

> B = matrix ( c(3:11),nrow=3,byrow="false" )

[,1] [,2] [,3]

R.K.Talreja College > plot(y,xlab='Probability',

ii) ( AB )' =B ' A'

Q.3. Write a R-Code to verify whether a given matrix is orthogonal?

R.K.Talreja College > plot(y,xlab='Probability',

A): Measures of Central Tendency in R.

> result1 = mean(x)

> result2 = median(x)

> getmode = function(v)

> result = getmode(x)

R.K.Talreja College > plot(y,xlab='Probability',

Result: Mean = 17.83333, Median = 16, Mode = 10

Q.2 Write a R code to calculate measures of central tendency of given

B) : Study of measures of Dispersion.

> Rainfall = c(29,19,26,12,24,21,36,35,33)

> r = max(Rainfall) - min(Rainfall)

> cr = ( max(Rainfall) - min(Rainfall) ) / ( max(Rainfall) + min(Rainfall) )

Result: Range=24, IQR=12, Coefficient of range=0.5

Q2. Write a R-Code to calculate Quartile deviation and Coefficient of QD for

> x = c(12, 18, 35, 42, 50, 45, 20, 8)

> Q = quantile(x, probs = c(0.25,0.50,0.75) )

R.K.Talreja College > plot(y,xlab='Probability',

25% 50% 75%

16.50 27.50 42.75

> QD = ( Q[3] - Q[1]) / 2

> CQ = ( Q[3] - Q[1] ) / ( Q[3] + Q[1] )

Result: Quartile deviati on = 13.125, Coefficient of QD= 0.443038

Q3. Write a R-Code to calculate the Standard deviation, Variance of Rainfall

> Rainfall = c (29,19,26,12,24,21,36,35,33)

Result: Standard deviation = 8.006941, Variance of Rainfall = 64.11111

R.K.Talreja College > plot(y,xlab='Probability',

Name Marks Marks (Maths)

> Mean = mean (d$Marks..English)

> Median = median (d$Marks..English)

> getmode = function(v)

> result = getmode(x)

> Mean = mean (d$Marks..Maths)

R.K.Talreja College > plot(y,xlab='Probability',

id name salary start_date dept

5 E 8000 2015-03-27 Finance

7 G 6500 2013-07-30 Operations

8 H 7900 2014-06-17 Finance

Read the given data and calculate:

Q1. Get the number of rows and columns.

Q3. Get the people who joined on or after 2014.