You are on page 1of 12

107.

258 Computerstatistik
General course information

Camilla Damian (material from Laura Vana Gür)

WS 2023
Course design
I Topics:
I Thorough introduction to the statistical software system in R:
data structures, data manipulation and visualization
I Statistical methods: hypothesis testing and regression

I Online sessions on zoom (link in TUWEL)


I Lecture – Thursdays, 13:00-16:00 (recordings from previous
semesters are available, lecture will serve as discussion and
Q&A). The exception (due to public holidays) is October 30,
2023 (a Monday)
I Practical – Mondays, 09:00-11:00

I Lecture: mix of slides, live coding in R and interactive exercises


(not graded)
I Practicals: student presentations of submitted homework
assignments (ca. 90 min)
Grading
I The grade consists of:
I 75 points homework assignments
I 25 points final project (case study on the statistical analysis of a
dataset)
I Requirements for a positive grade:
I ≥ 30% for each of 3 out of 6 homework assignments
I ≥ 50% for the final project

Grade Points
excellent (sehr gut) >= 91
good (gut) [81, 91)
satisfactory (befriedigend) [71, 81)
sufficient (genügend) [61, 71)
Deliverables: homework assignments

I 6 homework assignments to be handed in individually over


TUWEL
I To be submitted as an .Rmd and .PDF (created with R
Markdown)
I A new assignment will be available online each week after the
lecture (evening) and shall be handed in after 10 days (by
midnight). For example, if the lecture is on a Thursday, you will
need to upload the assignment by Sunday of the following week,
by midnight
I Best 5 out of 6 will enter the final grade
I During the practicals, students will be randomly asked to
present the solutions to the submitted assignment. If the
selected student is not present, the grade for the whole
assignment is set to 0
Deliverables: Final project

I A ca. 10 page report containing a full statistical analysis on a


data set which will be given to you in advance (one month
before submission deadline)
I Project should be completed individually
I Release of project: 10.11.2023
I Deadline for submission of the final project: 10.12.2023
(hopefully graded by Christmas)
Tentative schedule (I)
I Thursday, 05.10.2023 (Lecture): Intro to R and Rstudio, data
structures, subsetting. Release of Assignment 1
I Monday, 09.10.2023 (Practical): Initial session

I Thursday, 12.10.2023 (Lecture): R functions, basic statistics,


basic graphs. Release of Assignment 2
I Monday, 16.10.2023 (Practical): Student presentations of
Assignment 1 (due Sunday, 15.10.2023 at 23:59)
I Thursday, 19.10.2023 (Lecture): Data handling in R & Further
topics in R. Release of Assignment 3
I Monday, 23.10.2023 (Practical): Student presentations of
Assignment 2 (due Sunday, 22.10.2023 at 23:59)
I Monday, 30.10.2023 (Lecture): Further topics in R. Release of
Assignment 4
I Monday, 06.11.2023 (Practical): Student presentations of
Assignment 3 (due Sunday, 29.10.2023 at 23:59)
Tentative schedule (II)

I Thursday, 09.11.2023 (Lecture): Regression. Release of


Assignment 5
I Monday, 13.11.2023 (Practical): Student presentations of
Assignment 4 (due Thursday, 09.11.2023 at 23:59)
I Thursday, 16.11.2023 (Lecture): Linear algebra and
optimization. Release of Assignment 6
I Monday, 20.11.2023 (Practical): Student presentations of
Assignment 5 (due Sunday, 19.11.2023 at 23:59)
I Thursday, 23.11.2023 (Lecture): Testing
I Monday, 27.11.2023 (Practical): Student presentations of
Assignment 6 (due Sunday, 26.11.2023 at 23:59)
Software used in the course
In the course we will use the following software (note dependency on
operating system)
I R (version ≥ 4.1.0): https://www.r-project.org
I Windows: Miktex http://miktex.org
I macOS: MacTeX
https://tug.org/mactex/mactex-download.html
I Windows: Rtools
https://cran.r-project.org/bin/windows/Rtools
I Rstudio: https://www.rstudio.com/
Note that all this software is free and can be installed on your own
computer
As a hint - to make your life easier, install especially R and Rtools
such that no folder has a space in its name (e.g., ‘Program Files’)
Also, it is best install them in the order mentioned above
Course goals

After the course, I hope everyone will


I have a sound understanding of R
I be able to program in R
I be able to make high quality graphics using R
I have some experience analyzing data using R
Why R I
Among the most popular programming languages in 2021 according
to IEEE Spectrum:
Why R II

From a survey among “quantitative business professionals”:


Resources

I All materials used in the class (slides, R codes etc) can be


found in TUWEL
I Style and code conventions:
I https://google.github.io/styleguide/Rguide.html
I http://adv-r.had.co.nz/Style.html

I Info on installing R and Rstudio:


https://rstudio-education.github.io/hopr/starting.html
I Recommended books:
I A First Course in Statistical Programming with R by W. John
Braun and Duncan J. Murdoch.
I Hands-On Programming with R by Garrett Grolemund:
https://rstudio-education.github.io/hopr/
I Advanced R by Hadley Wickham https://adv-r.hadley.nz/

You might also like