Professional Documents
Culture Documents
Nivaldo Torres
06/11/2020
R Markdown
This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and
MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.
When you click the Knit button a document will be generated that includes both content as well as the
output of any embedded R code chunks within the document. You can embed an R code chunk like this:
Questão 1 a)
bnames <- read.csv("bnames.csv")
births <- read.csv("births.csv")
head(bnames)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
1
bnames2 <- left_join(bnames, births, by = c("year","sex"))
bnames2 <- mutate(bnames2, n = prop * births)
bnames2 <- mutate(bnames2, n = round(prop * births))
meninas_2003 <- filter(bnames2,sex=="girl",year=="2003")
slice(meninas_2003, (1:10))
20000
name
Emily
n
Madison
10000
2
library(dplyr)
bnames2 <- left_join(bnames, births, by = c("year","sex"))
bnames2 <- mutate(bnames2, n = prop * births)
bnames2 <- mutate(bnames2, n = round(prop * births))
View(bnames2)
e) descreva sucintamente como usar a função “str_sub” do pacote “stringr” library(stingr) Esta função
extrai partes de um string. Assim como o string, str_sub () leva os argumentos de início e fim que
fornecem a (inclusive) posição do substring.
library(stringr)
x <- c("Apple", "Banana", "Pear")
str_sub(x, 1, 3)
g) Inicial mais popular para meninos e para meninas considerando os dados de todos os anos?
library(stringr)
library(reshape2)
library(dplyr)
inicial_masculino <- bnames3 %>% select (year,sex,n, initial) %>% filter(sex=="boy") %>% group_by(initi
h) representar graficamente as informações da letra g - letra J para menino e letra M para menina
library(ggplot2)
fig1 <- qplot(initial, frequencia, data = inicialmascfem, geom = "point", color = sexo)
fig1 + geom_point(aes(size = frequencia))
3
frequencia
2e+07
5.0e+06
1.0e+07
1.5e+07
frequencia
2.0e+07
2.5e+07
1e+07
sexo
feminino
masculino
0e+00
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
initial
options(scipen=999)
fig2 <- ggplot(data= inicialmascfem, aes(x=initial,y= frequencia, fill= sexo)) + geom_bar(stat="identity
fig2
4
20000000
frequencia
sexo
feminino
masculino
10000000
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
initial