Welcome to Scribd!

Skip carousel

Tutorial 2

Uploaded by

James Dakota

0% found this document useful (0 votes)

3 views10 pages

math

Original Title

Tutorial2

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

math

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views10 pages

Tutorial 2

Uploaded by

James Dakota

math

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 10

Search inside document

MATH 2411: Desciptive Statistics and Graphics

with R

CAI Mingxuan, ZHAO Jia

Department of Mathematics, HKUST

2020/2/22
Summary statistics for a single group

I simple summary statistics

set.seed(1234)
x <- rnorm(1000)
mean(x) #mean

## [1] -0.0265972

sd(x) #standard deviation

## [1] 0.9973377

var(x) #variance

## [1] 0.9946825

median(x) #median

## [1] -0.03979419
Summary statistics for a single group

I simple summary statistics

quantile(x) #quantile

By default you get the minimum, the maximum, and the three
quartiles — the 0.25, 0.50, and 0.75 quantiles.
pvec <- seq(0,1,0.1)
quantile(x, pvec)

It is also possible to obtain other quantiles; this is done by adding an

argument containing the desired percentage points.
Summarize an entire data frame
I Galton Height data
Galton <- read.table("/Users/jiazhao/Documents/HKUST/TA/20Spr
summary(Galton)
## family father mother midparentHei
## 185 : 15 Min. :62.0 Min. :58.00 Min. :64.4
## 066 : 11 1st Qu.:68.0 1st Qu.:63.00 1st Qu.:68.1
## 120 : 11 Median :69.0 Median :64.00 Median :69.2
## 130 : 11 Mean :69.2 Mean :64.09 Mean :69.2
## 166 : 11 3rd Qu.:71.0 3rd Qu.:65.88 3rd Qu.:70.1
## 097 : 10 Max. :78.5 Max. :70.50 Max. :75.4
## (Other):865
## children childNum gender childHeig
## Min. : 1.000 Min. : 1.000 female:453 Min. :56
## 1st Qu.: 4.000 1st Qu.: 2.000 male :481 1st Qu.:64
## Median : 6.000 Median : 3.000 Median :66
## Mean : 6.171 Mean : 3.586 Mean :66
## 3rd Qu.: 8.000 3rd Qu.: 5.000 3rd Qu.:69
## Max. :15.000 Max. :15.000 Max. :79
##
Graphical display of distributions
I Histograms
hist(x)
Histogram of x
200
150
Frequency

100
50
0

−3 −2 −1 0 1 2 3

x
Graphical display of distributions
I Empirical cumulative distribution
y <- rnorm(50)
n <- length(y)
plot(sort(y),(1:n)/n,type="s",ylim=c(0,1))
1.0
0.8
0.6
(1:n)/n

0.4
0.2
0.0

−2 −1 0 1 2

sort(y)
Graphical display of distributions

I Q-Q plots One purpose of calculating the empirical cumulative

distribution function (c.d.f.) is to see whether data can be assumed
normally distributed. For a better assessment, you might plot the kth
smallest observation against the expected value of the kth smallest
observation out of n in a standard normal distribution.
Graphical display of distributions
I The point is that in this way you would expect to obtain a straight
line if data come from a normal distribution with any mean and
standard deviation.
qqnorm(y)
Normal Q−Q Plot
2
1
Sample Quantiles

0
−1
−2

−2 −1 0 1 2
Graphics for grouped data

I In dealing with grouped data, it is important to be able not only to

create plots for each group but also to compare the plots between
groups.
I Histograms

h1 <- min(Galton$mother)
h2 <- max(Galton$father)
hist(Galton$father,breaks = 100,xlim = c(h1,h2),
ylim=c(0,150),col="white")
hist(Galton$mother,breaks = 100,xlim = c(h1,h2),
ylim=c(0,150),col="grey")
Graphics for grouped data
I Parallel boxplots

boxplot(Galton[c("father","mother")])
75
70
65
60

father mother

MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
A1
Document8 pages
A1
DASHPAGAL
No ratings yet
WINSEM2019-20 MAT2001 ELA VL2019205002696 Reference Material I 31-Jan-2020 7 R LAB - Normal Distribution
Document38 pages
WINSEM2019-20 MAT2001 ELA VL2019205002696 Reference Material I 31-Jan-2020 7 R LAB - Normal Distribution
jeevan sai
No ratings yet
Z-Score Problems With The Normal Model: Objective
Document24 pages
Z-Score Problems With The Normal Model: Objective
Cleverton da Veiga
No ratings yet
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
Document10 pages
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
Levi Grantz
No ratings yet
WINSEM2019 20 MAT2001 ELA VL2019205002696 Reference Material I 31 Jan 2020 7 R LAB Normal Distribution
Document39 pages
WINSEM2019 20 MAT2001 ELA VL2019205002696 Reference Material I 31 Jan 2020 7 R LAB Normal Distribution
as
No ratings yet
NormalDistribution2012 PDF
Document29 pages
NormalDistribution2012 PDF
Elisa Dela Reyna Baladad
No ratings yet
Computational Techniques in Statistics: Exercise 1
Document5 pages
Computational Techniques in Statistics: Exercise 1
وجدان الشدادي
No ratings yet
Normal Distribution 2012
Document29 pages
Normal Distribution 2012
Gabriel Lloyd
No ratings yet
K Nearest Neighbours (KNN) : Short Intro To KNN
Document13 pages
K Nearest Neighbours (KNN) : Short Intro To KNN
Luka Filipovic
No ratings yet
Experiment No. 1: Objective: Write A MATLAB Program To Generate An Exponential Sequence X (N) (A)
Document53 pages
Experiment No. 1: Objective: Write A MATLAB Program To Generate An Exponential Sequence X (N) (A)
Shinibali Mandal
No ratings yet
QM2 Tutorial 3
Document26 pages
QM2 Tutorial 3
ducminhlniles
No ratings yet
8 Probability Distributions: 8.1 R As A Set of Statistical Tables
Document6 pages
8 Probability Distributions: 8.1 R As A Set of Statistical Tables
sansantosh
No ratings yet
Genetica Cuantitativa
Document120 pages
Genetica Cuantitativa
Alexis Josue Vallecillo Godoy
No ratings yet
R Commands
Document2 pages
R Commands
sowkya redd
No ratings yet
Document Stat
Document5 pages
Document Stat
Yohannes Alemu
No ratings yet
Prac 5 AP (Nikita)
Document3 pages
Prac 5 AP (Nikita)
Preet Jain
No ratings yet
02data Part2
Document34 pages
02data Part2
baigsalman251
No ratings yet
SB K49 Lecture3.2
Document34 pages
SB K49 Lecture3.2
hamoweyy
No ratings yet
Statistics Handbk Act08
Document12 pages
Statistics Handbk Act08
Yuni Wardani
No ratings yet
Tutprac 1
Document8 pages
Tutprac 1
Pham Truong Thinh Le
No ratings yet
ML Support Vector Machines 2
Document22 pages
ML Support Vector Machines 2
23mb0072
No ratings yet
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
Document34 pages
All All: % (A) Construct Side-By-Side Stem-And-Leaf Plots
JASHWIN GAUTAM
No ratings yet
Unit3-Data Science
Document37 pages
Unit3-Data Science
DIVYANSH GAUR (RA2011027010090)
No ratings yet
STA101 Formula Sheet
Document4 pages
STA101 Formula Sheet
Conner Bieker
No ratings yet
R Assignment 8
Document9 pages
R Assignment 8
Stella Parker
No ratings yet
Homework 1: Statistics 109 Due February 17, 2019 at 11:59pm EST
Document23 pages
Homework 1: Statistics 109 Due February 17, 2019 at 11:59pm EST
BrianLe
No ratings yet
Lecture 2 - R Graphics PDF
Document68 pages
Lecture 2 - R Graphics PDF
AnsumanNath
No ratings yet
Fuzzy Decision Trees
Document9 pages
Fuzzy Decision Trees
ChakshuGrover
No ratings yet
Programming With R Test 2
Document5 pages
Programming With R Test 2
KamranKhan
50% (2)
Bioestadistica: Clara Carner 2023-05-29
Document4 pages
Bioestadistica: Clara Carner 2023-05-29
Clara Carner
No ratings yet
1 The Empirical Rule and Distribution
Document5 pages
1 The Empirical Rule and Distribution
Sarah Mendes
No ratings yet
Chapter 1 - Solutions of Exercises
Document4 pages
Chapter 1 - Solutions of Exercises
DanyValentin
No ratings yet
Week 2
Document7 pages
Week 2
Indri Br Situmorang
No ratings yet
04 Normal Approximation For Data and Binomial Distribution
Document24 pages
04 Normal Approximation For Data and Binomial Distribution
admirodebrito
No ratings yet
APL Assignment 3: Histogram
Document12 pages
APL Assignment 3: Histogram
Rituraj Chanda
No ratings yet
Industrial Statistics - A Computer Based Approach With Python
Document140 pages
Industrial Statistics - A Computer Based Approach With Python
htapiaq
No ratings yet
1 - Descriptive Statistics Part 3
Document41 pages
1 - Descriptive Statistics Part 3
Stephanie Cañete
No ratings yet
Project - P1: Simulation and Basic Inferential Data Analysis Project
Document6 pages
Project - P1: Simulation and Basic Inferential Data Analysis Project
Ougoust Drake
No ratings yet
Chapter 9 Cox Proportional Hazards: Library (Survival)
Document10 pages
Chapter 9 Cox Proportional Hazards: Library (Survival)
Yosef GUEVARA SALAMANCA
No ratings yet
String Functions: Extract 1st Word From String "Name"
Document28 pages
String Functions: Extract 1st Word From String "Name"
blakewil
No ratings yet
Normal Distribution 2012
Document29 pages
Normal Distribution 2012
kya karega
No ratings yet
Financial Accouting
Document7 pages
Financial Accouting
tinale1603
No ratings yet
Task 1
Document9 pages
Task 1
Dương Vũ Minh
No ratings yet
R Lecture#3
Document24 pages
R Lecture#3
Muhammad Hamdan
No ratings yet
Regression Analysis
Document280 pages
Regression Analysis
A.Benhari
100% (1)
Fundamental of Statistics 2
Document2 pages
Fundamental of Statistics 2
PrashansaBhatia
No ratings yet
Active Learning Task 8
Document7 pages
Active Learning Task 8
hapty
No ratings yet
Problem Set #1
Document6 pages
Problem Set #1
cflores48
No ratings yet
The Problem of Overfitting
Document40 pages
The Problem of Overfitting
Sahil Kaushish
No ratings yet
Activity
Document11 pages
Activity
kiyara
No ratings yet
APL Assignment 3: Histogram
Document11 pages
APL Assignment 3: Histogram
Balaji balu
No ratings yet
Implementing Custom Randomsearchcv: 'Red' 'Blue'
Document1 page
Implementing Custom Randomsearchcv: 'Red' 'Blue'
Tayub khan.A
No ratings yet
Lecture21 HypothesisTest1
Document53 pages
Lecture21 HypothesisTest1
Sonam Alvi
No ratings yet
Asim
Document27 pages
Asim
Muhammad Asim Muhammad Arshad
No ratings yet
Probability and Statistics With R For Engineers and Scientists 1St Edition Michael Akritas Solutions Manual Full Chapter PDF
Document35 pages
Probability and Statistics With R For Engineers and Scientists 1St Edition Michael Akritas Solutions Manual Full Chapter PDF
EarlCollinsmapcs
100% (8)
ECON1203 PASS Week 3
Document4 pages
ECON1203 PASS Week 3
mothermonk
No ratings yet
Chapter6 Stats
Document4 pages
Chapter6 Stats
Poonam Naidu
No ratings yet
STA101 Formula Sheet
Document4 pages
STA101 Formula Sheet
Olawale Aweda
No ratings yet
Vi. Standard Scores and The Normal Distribution
Document6 pages
Vi. Standard Scores and The Normal Distribution
carterrees
No ratings yet
Raksha Mantralaya Ministry of Defence
Document16 pages
Raksha Mantralaya Ministry of Defence
subhasmita sahu
No ratings yet
Daily Lesson Log Quarter 1 Week 1
Document5 pages
Daily Lesson Log Quarter 1 Week 1
John Patrick Famadulan
100% (1)
Reaserch On Effect of Social Media On Academic Performance: Study On The Students of University of Dhaka
Document27 pages
Reaserch On Effect of Social Media On Academic Performance: Study On The Students of University of Dhaka
Fatema Tuz Johoora
88% (114)
Marine Cargo Insurance
Document72 pages
Marine Cargo Insurance
Khanh Duyen Nguyen Huynh
No ratings yet
Honda Izy
Document16 pages
Honda Izy
Terry Ford
No ratings yet
Benedict Anderson, Imagined Communities
Document2 pages
Benedict Anderson, Imagined Communities
Monir Amine
0% (1)
Extract The .Msi Files
Document2 pages
Extract The .Msi Files
vladimir
No ratings yet
Flow of Food
Document2 pages
Flow of Food
Geneva
No ratings yet
Directorate of Technical Education, Admission Committee For Professional Courses (ACPC), Gujarat
Document2 pages
Directorate of Technical Education, Admission Committee For Professional Courses (ACPC), Gujarat
gamailkabaaaap
No ratings yet
LFF MG
Document260 pages
LFF MG
Rivo Roberalimanana
No ratings yet
IMDSI22
Document82 pages
IMDSI22
Dang Jinlong
No ratings yet
Veritas™ High Availability Agent For WebSphere MQ Installation and Configuration Guide / WebSphere MQ Installation
Document64 pages
Veritas™ High Availability Agent For WebSphere MQ Installation and Configuration Guide / WebSphere MQ Installation
karthickmsit
No ratings yet
Survivor's Guilt by Nancy Sherman
Document4 pages
Survivor's Guilt by Nancy Sherman
Ginnie Faustino-Galgana
No ratings yet
ST3 Manual
Document48 pages
ST3 Manual
Ron Foster
No ratings yet
Cpar Characteristics and Functions Week 3
Document128 pages
Cpar Characteristics and Functions Week 3
christianwood0117
No ratings yet
The Rock Reliefs of Ancient IranAuthor (
Document34 pages
The Rock Reliefs of Ancient IranAuthor (
mark_schwartz_41
No ratings yet
Outline Calculus3
Document20 pages
Outline Calculus3
Joel Curtis
No ratings yet
Lalit Resume-2023-Latest
Document2 pages
Lalit Resume-2023-Latest
Drew Ladlow
No ratings yet
IOT Questions and Answers - Solution
Document8 pages
IOT Questions and Answers - Solution
Omar Cheikhrouhou
No ratings yet
Refutation Essay
Document6 pages
Refutation Essay
api-314826327
No ratings yet
Toshiba Motors
Document16 pages
Toshiba Motors
Sergio Cabrera
100% (1)
Daewoo 710B PDF
Document59 pages
Daewoo 710B PDF
bgment
No ratings yet
Lect2 - 1151 - Grillage Analysis
Document31 pages
Lect2 - 1151 - Grillage Analysis
Cheong
100% (1)
Algorithms For Automatic Modulation Recognition of Communication Signals-Asoke K, Nandi, E.E Azzouz
Document6 pages
Algorithms For Automatic Modulation Recognition of Communication Signals-Asoke K, Nandi, E.E Azzouz
GONG
No ratings yet
Pelayo Pathopyhsiology
Document13 pages
Pelayo Pathopyhsiology
E.J. Pelayo
No ratings yet
Eapp Melc 12
Document31 pages
Eapp Melc 12
Christian Joseph Herrera
No ratings yet
Healthymagination at Ge Healthcare Systems
Document5 pages
Healthymagination at Ge Healthcare Systems
Prashant Pratap Singh
100% (1)
Zimbabwe - Medical - CPIN - v2.0 - GOV - UK
Document39 pages
Zimbabwe - Medical - CPIN - v2.0 - GOV - UK
sammy redganji
No ratings yet
Data Network Unit 6 - UC
Document15 pages
Data Network Unit 6 - UC
ANISHA DONDE
No ratings yet
21 Tara Mantra-Wps Office
Document25 pages
21 Tara Mantra-Wps Office
Alteo Falla
No ratings yet