Professional Documents
Culture Documents
FACULTY OF
DEPARTMENT OF
ASSIGNMENT REPORT
ANALYSIS USING R
1
2
3
Activity 1. Chicken Feed Data
Command
data = read.csv("Data/chicken_feed-1.csv")
print(colnames(data))
head(data)
str(data)
summary(data)
Output:
Command:
4
data = subset(data,select=-c(X))
data = data[complete.cases(data),]
print(colnames(data))
head(data)
str(data)
summary(data)
Output:
Command:
data_no_factor = subset(data,select=-c(feed))
5
lapply(seq(data_no_factor), function(x)
hist(x=data_no_factor[[x]], xlab=names(data_no_factor)[x],
main=paste("Histogram", names(data_no_factor)[x])))
par(op) # restore
########################################
Output:
6
Command:
lapply(seq(data), function(x)
par(op)
Output:
7
Command:
pairs(data)
Output:
Command:
8
LinearRegressions = lm(weight~.,data=data)
summary(LinearRegressions)
Output:
9
Activity 2. Concrete Data
Command:
data = read.csv("Data/caffeine.csv")
print(colnames(data))
str(data)
summary(data)
head(data)
10
Picture 2.7: Output Name unclean column
Data need to clean: Column drink.
Command:
data = subset(data,select=-c(drink))
11
data_no_factor = subset(data,select=-c(type)) ######### Draw table
histogram ########
lapply(seq(data_no_factor), function(x)
hist(x=data_no_factor[[x]], xlab=names(data_no_factor)[x],
main=paste("Histogram", names(data_no_factor)[x])))
par(op) # restore
########################################
Output:
lapply(seq(data_no_factor), function(x)
boxplot(x=data_no_factor[[x]], xlab=names(data_no_factor)[x],
main=paste("Boxplot", names(data_no_factor)[x])))
par(op) # restore
12
########################################
Output:
pairs(data)
Output:
13
Picture 2.10: Output Pairs of data
Comment: data doesn’t have bias.
Command:
summary(flm_model)
Output:
14
Picture 2.11: Output Linear Regression
REFERENCES
[1]"Using the R programming language in Jupyter Notebook — Anaconda
documentation", Docs.anaconda.com, 2021. [Online]. Available:
https://docs.anaconda.com/anaconda/navigator/tutorials/r-lang/. [Accessed:
04- Dec- 2021].
[3]H. function and S. Humby, "How to split data into training/testing sets using
sample function", Stack Overflow, 2021. [Online]. Available:
15
https://stackoverflow.com/questions/17200114/how-to-split-data-into-
training-testing-sets-using-sample-function. [Accessed: 04- Dec- 2021].
[8]H. package?, "How to plot AIC values when using the leaps package?", Cross
Validated, 2021. [Online]. Available:
https://stats.stackexchange.com/questions/11115/how-to-plot-aic-values-
when-using-the-leaps-package. [Accessed: 04- Dec- 2021].
16
[12]H. "R-essentials"? and Z. Eunicien, "How to install R packages
that are not available in "R-essentials"?", Stack Overflow, 2021. [Online].
Available: https://stackoverflow.com/questions/34705917/how-to-install-r-
packages-that-are-not-available-in-r-essentials. [Accessed: 04- Dec- 2021].
[14]R. Science and V. Science, "Check for multicollinearity with the car package
in R", R Functions and Packages for Political Science Analysis, 2021. [Online].
Available: https://rforpoliticalscience.com/2020/08/03/check-for-
multicollinearity-with-the-car-package-in-r/#:~:text=To%20check%20for
%20multicollinearity%20problem,multicollinearity%20in%20the%20overall
%20model. [Accessed: 04- Dec- 2021].
17