Professional Documents
Culture Documents
Data Science
06/30/2017
Introduction
2
Data Science - Specialties
Our data science curriculum is designed with our students in mind. For those
interested in a specific domain of tech, we offer three specialities which
students can also declare during phase 2 of the program
Week 2
Week 2
Statistics & Linear Algebra
Descriptive Statistics
Distributions & Histograms
Week 2 is dedicated to
creating a deep Cumulative Distribution Functions
understanding of Skewness
mathematical concepts
we’ll later see in topics like Conditional Probability
Bayes Theorem
machine learning and
Estimation
statistical analysis.
Contrary to the traditional Hypothesis Testing
mathematics course, Correlation
students will learn
statistics and linear algebra Vectors & Matrices
Matrix Operations
through a computational
lens.
Tools Utilized:
Numpy, SciPy
4
Week 3
Data Wrangling Week 3
Week 4 Week 4
Line and Scatter Plots
Data Visualization & Histograms
Exploratory Analysis Visualization Customization
5
Week 5
Week 5 - Regression Analysis
Regression Analysis
Intro to Machine Learning
Types of Learning & Data
Week 5 begins the official
start of the statistical Maximum Likelihood
analysis and prediction Linear Regression
portion of this course. We’ll Multiple Linear Regression
spend week 5 engaging
Non-Linear Regression
with the basics of machine
Logistic Regression
learning and work our way
towards learning and Time Series Analysis
implementing several Stepwise Regression
regression models.
Ridge & Lasso Regression
Exercises & Examination
Tools Utilized:
scikit-learn
6
Weeks 6-7
Review of Bayes
Week 6-7 Naive Bayes & Joint Models
7
Weeks 8-9
Regular Expressions
Week 8-9 Components of Speech
Text Normalization
Natural Language Word Tagging
Processing & Deep
Sentiment Analysis
Learning Information Extraction
Named Entity Extraction
Once again expanding on
the knowledge gained from Topic Modeling
weeks 5-7, we will enter the Summarization
realm of machine learning
Neural Networks
involving textual analyses
BackProp & Gradient Descent
and artificial neural
networks. Because the two Mini Project
can be complementary, we
will also engage with topics Feedforward Neural Networks
like word2vec and more. Recurrent Neural Networks
8
Week 10
Week 10 - Databases
Databases
Intro to Databases
Week 10 dives into SQL Basics
database systems headfirst. Database Modeling
During this week,
Advanced SQL
becoming fluent with SQL,
Database Design
NoSQL, and MySQL is a
must that will carry over NoSQL
into the rest of this course. MongoDB
Week 11
Week 11 - Big Data
Big Data
MapReduce
With the consistent Hadoop
growing of data every day, Spark
engineers are forced to Hadoop Ecosystem
become equipped to
handle, prepare, and Kafka
process this data in a Storm
computationally efficient
Amazon Web Services
manner. This week reviews Cloud Computing
the different big data
architecture tools available Project Day
in the data science industry
today.
Tools Utilized:
10
More information?
info@byteacademy.co
www.byteacademy.co
Licensed by NY State Dept of Education
11