You are on page 1of 3

Content for Data Science with R programming

Data Science course in R helps you to to understand the frameworks, tools and techniques in end-to-end
process of data collection, data cleaning, data visualization, data modeling and comparison, model
evaluations, data orchestration. You will be able to understand business case studies and do a project
and skill yourself for the data scientist jobs.

1. Introduction to Data Analytics Installing and Using Packages

Applications/Use Cases R-List of Useful Packages
Stages of Business Analytics 3. Introduction to RDBMS
Sources of Data Anomalies and normalization
Different Kind Of Data Sources SQL overview
What is a Model? DDL,DML,SQL Statement Fundamentals
Steps in Problem Solving Aggregate Functions
Different tools used for analysis Joins, Sub queries,
Introduction to data science Advance SQL
Prerequisite for data science
4. Data Structures in R
2. Introduction to R Programming Vectors, Matrices, Data frames, Lists
R programming compare to other languages Arithmetic Operators
for analytics Logical Operators
application of R Relational operator
Installing R and R Studio on Desktop? Conditional operator
R Studio Screen Bitwise operator
Knowing R Studio
Knowing R Environment and History 5. Data Structures in R
Packages Vectors, Matrices, Data frames, Lists

1
Content for Data Science with R programming
Arithmetic Operators Types of Graph
Logical Operators Line Plots, Dot Plots
Relational operator Choosing the right visualisation
Conditional operator Visualization using ggplot
Bitwise operator Use of plotly for Visualization
Interactive Graphs
6. Data Types Pie Charts, Histograms, Scatterplots,
Sub-Setting Functions and Indexing Bar Plots
Control and Flow Operators, Loops
Data Acquisition (Import and Export) 12. Basics Of Statistics
Why Study Statistics?
7. Memory Management Types of Data
Memory Allocation Types of Statistics
String Manipulation Descriptive statistics, Inferential Statistics
Sorting/Merging/Cleaning Data Understanding Summary Statistics
Mean, median, mode, variance, Standard
8. Functions deviation
Writing Functions in R
Simple Functions in R 13. Probability
Complex Functions in R Random Variables
Built-in Functions Continuous Probability Distribution
Apply Family in R-Apply(), Apply(), Supply(), Two Variable Probability of Events
Apply() Base theorem
Conditional probability
9. Data Management
Data Manipulation using dplyr
Heavy Data Management using data.table 14. Types Of Distribution
Data Management using tidyr Normal Distribution
Text Manipulation using stringr Probability Distribution
Different sources of Binomial Distribution
data(Primary,Secondary,Tertiary) Expected Value
Frequency and Cross Tabs The Standard Normal Distribution
Summarize the Data Negative Binomial Distribution
Discrete Probability Distribution

10. Data Management continue 15. Hypothesis Testing

Use of readr library Central Limit Theorem. Steps of Hypothesis
Data Analysis using sqldf Testing
Use of tidyr library Practical
Working with lubridate library Date and Time Application of CLT
Management using How does CLT work
lubridate
Reshape for Data Management 16. T-test, Poison Distribution
Confidence Interval and Probability
11. Types of Visualization T-distribution,
Graphs in R P-value and Significance Level

2
Content for Data Science with R programming
Variance for Means Comparison
Z-test and Proportions test 19. Linear Regression Model
Chi-square Test Analysis of Assumptions of a Linear Regression
Covariance(ANOVA) Model Building
Validations of Assumptions,
17. Introduction Representation of Regression Results
Regression Analysis
Types of Regression 20. Multiple Linear Regression Models
Steps to Implement a Regression Model Multiple Linear Regression Intuition
Multiple Linear Regression In R
18. Machine Learning
Overview Of Machine Learning 21. Logistic Regression(Overview)
Deep Learning Concepts(Overview)
Natural Language Processing(Overview)