Professional Documents
Culture Documents
Scatter Plot
Data frame for analysis
We will use a data frame (ÇETINKAYA-RUNDEL 2019) comprising information on movies. It contains 600
observations (rows), each representing a movie, and 31 variables (columns). Let us load this data frame by
using read.csv function. You can download this data frame from the Code Files link of this tutorial.
movies <- read.csv("moviesData.csv")
paste("Information on", dim(movies)[1], "movies loaded.")
1
Movies' IMDB rating
10
8
IMDB rating
6
4
2
0
Filly Brown
The Dish
Waiting for Guffman
The Age of Innocence
Malevolence
Old Partner
Lady Jane
Mad Dog Time
Beauty Is Embarrassing
The Snowtown Murders
Superman II
Leap of Faith
The Royal Tenenbaums
School for Scoundrels
Rhinestone
Burn After Reading
The Doors
The Wood
Jason X
Dragon Wars
Now, the names are not being truncated, as it is evident from the graph plotted above.
2
IMDB rating
0 2 4 6 8 10
Fi
lly
References
W T Bro
a h w
Th itin e D n
e g i
Ag for sh
e G
M of I uffm
al nn a
e
edu/~mc301/data/movies.html.
O vol oce n
ld en n
La Par ce ce
Be Ma dy tne
au d D Jan r
Th ty og e
e Is
Sn Em Tim
ow b e
Su tow arra
s
3
p
Th Le ermn M sin
e ap a rd g u
R
Sc oy of n II ers
ho al Fa
ol Ten ith
fo
R r en
Bu h Sc bau
Movies' IMDB rating
rn ine oun m
Af sto dr s
t n e
Th er R e ls
e e
Th Do adin
e ors g
W
Ja oo
Dr so d
ag n
on X
W
ar
s
ÇETINKAYA-RUNDEL, MINE. 2019. “movies.RData – Mine Çetinkaya-Rundel.” http://www2.stat.duke.