Professional Documents
Culture Documents
Setia Pramana
2015
Computational Statistics 1
Course Outline
• Introduction
– Different Statistical Software
• Data Preparation, Management, Manipulation,
Summarization with:
– SPSS
– R (R Commander)
– Ms. Excel
• Data Tabulation and Visualization
Computational Statistics 2
Course Outline
Computational Statistics 3
Course Outline
• Hypothesis testing: one, two sample t-test (test
for mean difference, proportion and variance)
• Analysis of Variance (Anova): one and two way
Anova.
• Introduction to Design of Experiment
• Final Project
Computational Statistics 4
Course Workload
Computational Statistics 5
Reference Books
Computational Statistics 6
Reference Books
• John Maindonald dan W. John Braun. Data Analysis and
Graphics Using R – an Example-Based Approach. 3rd
Edition. Cambridge University Press: Cambridge.2010.
• John Fox. Journal of Statistical Software, The R
Commander : A Basic-Statistics Graphical User Interface
to R.Volume 14, Issue 9, September 2005.
• Chris Beeley. Web Application Development with R
Using Shiny. Packt Publishing: Birmingham.2013.
• SPSS Statistics Base User’s Guide 17.0. Polar
Engineering and Consulting : Chicago, 2007.
Computational Statistics 7
Reference Books
• Jurusan Komputasi Statistik. Modul Mata Kuliah
Komputasi Statistik. 2014
• Kerns, G. Jays. Introduction to Probability and Statistics
Using R. E book. GNU Free Documentation License.
2010.
• Geof H. Givens dan Jennifer A. Hoeting. Computational
Statistics, 2nd edition. John Wiley and Sons : New
Jersey. 2013
• Jochen Voss. Statistical Computing. E book. 2011.
• Brent B. Welch, Ken Jones dan Jeffrey Hobbs. Practical
Programming in Tcl and Tk. 4Th edition. Prentice Hall
PTR: New Jersey.2003.
Computational Statistics 8
Other Materials
• https://sites.google.com/site/biostatinfocor
e/home/rworkshop
• https://sites.google.com/site/biostatinfocor
e/biostatistics-workshop
Computational Statistics 9
Introduction
Computational Statistics 10
Statistics?
Computational Statistics 11
Computational Statistics 12
What is Statistics?
Computational Statistics 13
Observations on the
Bills of
Mortality (1662)
Recorded Plague
related death for
100 years
Computational Statistics 14
What is Statistics?
• Exploring data: Using graphical and numerical
techniques to study patterns and departures from
patterns (in order to interpreting data)
• Sampling and experimentation: Clarifying the
question, deciding on methods of collection and analysis
to produce valid information.
• Anticipating patterns: Exploring random phenomena
using probability and simulation. Probability is our tool for
anticipating distributions...
• Statistical Inference: Estimating population parameters
and testing hypothesis
Computational Statistics 15
“Statistical thinking will one day be as
necessary for efficient citizenship as the
ability to read and write” HG Well
Computational Statistics 16
Areas of Statistics
Computational Statistics 17
Statistics Descriptive
– 40% in the group of drivers who did not wear seat belts
– 20%in drivers who did wear seat belts
Computational Statistics 18
Inferential Statistics
Computational Statistics 20
Available Statistical Packages
Computational Statistics 21
Available Statistical Packages
SPSS R
MINITAB CS Pro
SAS WinBugs
Stata EpiInfo
Many more ……
Computational Statistics 22
Computational Statistics 23
Computational Statistics 24
Computational Statistics 25
Computational Statistics 26
Microsoft Excel
Computational Statistics 27
Which one do you use?
Why?
Computational Statistics 28
Statistical Software Used
Computational Statistics 29
Statistical Software Used
Computational Statistics 30
R is HOT !
Computational Statistics 31
R is HOT !
• R is HOT !
Computational Statistics
http://r4stats.com/articles/popularity/ 32
R is HOT !
Computational Statistics
http://r4stats.com/articles/popularity/ 33
R is HOT !
Computational Statistics
http://r4stats.com/articles/popularity/ 34
What is R?
Computational Statistics 38
Why R?
Computational Statistics 39
But…..
• R has no warranty
• Command Line Interface : difficult for some users.
• Users must learn a new way of thinking about data
and data analysis sequence
• That’s all ….. I guess
Computational Statistics 40
Companies using R in 2013
• The New York Times routinely uses R for interactive and print data
visualization.
• Google has more than 500 R users.
• The FDA supports the use of R for clinical trials of new drugs.
• The National Weather Service uses R to predict the extent of flooding
events.
• Zillow uses R to model housing prices.
• The Consumer Financial Protection Bureau uses R and other open
source tools.
• Twitter uses R for data science applications on the Twitter database.
• FourSquare uses R to develop its recommendation engine.
• Facebook uses R to model all sorts of user behaviour.
Source: Revolutionanalytics
Computational Statistics 41
R Library/packages
IsoGene nlme
lme4
foreign
survival
zoo R Base Packages
reshape2
ggplot2
zoo
Computational Statistics 42
My R Packages
• IsoGene
• IsoGeneGUI
• nea
• neaGUI
• biclustGUI
• OCRME
• More detail: http://setiopramono.wordpress.com/r-
programming/
Computational Statistics 43
R For Cutting Edge
Technologies
Computational Statistics 44
R Graphics and Visualization
Computational Statistics 46
R Graphics
RCircos
Computationalhttps://gjabel.wordpress.com/
Statistics 47
R Graphics
Computational Statistics 48
R Graphics
Computational Statistics 49
R Graphical User Interfaces
• R uses Command line interface and it is preferred for
advanced users allows direct control, more accurate,
flexible and the analysis is reproducible.
• Requires good knowledge of the language difficult for
beginners or less frequent users.
• R provides tools for building GUIs RGUI
Computational Statistics 50
R GUI Projects
Computational Statistics 51
R GUI Projects
• Python: OpenMeta-Analyst (Wallace et al, 2012)
• Java: JGR (Java GUI for R), Deducer (Fellows, 2012),
and Glotaran (Snellenburg, 2012).
• Php: R-php (http://dssm.unipa.it/R-php/)
• Other extensions connect R to graphical toolboxes for
developing menus and dialog boxes: Tcltk, Gtk.
Computational Statistics 52
R Studio
• Download from
Rstudio.com
• Powerfull IDE
(Integrated
Development
Environment) for R.
Computational Statistics 53
RGUI Developed using tcltk
Computational Statistics 54
RGUI: RCommander
• Rcommander.com
• Helpful for R beginner
• Install inside R
Computational Statistics 55
RGUI using C#: Wires
• Developed by STIS
students
• For Spatial Data
Analysis
• Still developing…
Computational Statistics 56
RGUI using C#: Wires
Computational Statistics 57
RGUI: Web Based App
Computational Statistics 58
WebBUGS
• Conducting Bayesian
Statistical Analysis
Online
• Combines
OpenBUGS and R
www.webbugs.psychstat.org
Computational Statistics 59
RGUI: Shiny
Computational Statistics 60
RGUI using Shiny: FAST
Computational Statistics 61
Dynamic Report Generation
• Sweave
• knitr
• markdown
Computational Statistics 62
Want to Learn R? Need Help?
Lots of Self learning Resources
http://www.rdatamining.com/resources/onlinedocs
Blogs:
Software # Blogs Blogs Source
R 550 R-Bloggers.com
Python 60 SciPy.org
SAS 40 PROC-X.com, sasCommunity.org Planet
Stata 11 Stata-Bloggers.com
User Group: Stockholm R User group, etc…
Indonesia/Jakarta?
https://sites.google.com/site/biostatinfocore/introduction-to-r
Computational Statistics 63
Need Help?
Computational Statistics 64