You are on page 1of 2

1904607 DATA SCIENCE LTPC

3003
OBJECTIVES:
 Be exposed to basic introduction of big data
 To impart necessary knowledge of the mathematical foundations
 Be familiar with basic concepts on Machine Learning
 Learn the different classification algorithm for appropriate decision making.
 To Learn the tools to implement Data science and its application.
UNIT I INTRODUCTION TO DATA SCIENCE 9
Introduction to Data Science-Concept of Data Science-Traits of Big data-Web Scraping- Analysis
vs Reporting

UNIT II MATHEMATICAL FOUNDATIONS 9


Linear Algebra: Vectors, Matrices- Statistics: Describing a Single Set of Data, Correlation,
Simpson’s Paradox-Correlation and Causation- Probability: Dependence and Independence,
Conditional Probability, Bayes’s-Theorem, Random Variables-Continuous Distributions- The
Normal Distribution-The Central Limit Theorem.

UNIT III MACHINE LEARNING 9


Overview of Machine learning concepts –Types of Machine learning - Linear Regression- model
assumptions-Classification and Regression algorithms- Naïve Bayes, K-Nearest Neighbors,
logistic regression- support vector machines (SVM), decision trees, and random forest.

UNIT IV PROGRAMMING TOOLS FOR DATA SCIENCE 9


Introduction to Programming Tools for Data Science-Toolkits using Python: Matplotlib, NumPy,
Scikit-learn, NLTK-Visualizing Data: Bar Charts, Line Charts and Scatterplots-Working with data:
Reading Files, Scraping the Web, Using APIs (Example: Using the Twitter APIs).

UNIT V Case Studies of Data Science Application 9


Weather forecasting-Stock market prediction-Object recognition- Real Time Sentiment Analysis.

TOTAL: 45 PERIODS

OUTCOMES:
At the end of the course, the student should be able to:
 Basic foundations of Big data.
 Demonstrate understanding of the mathematical foundations needed for data science.
 Implement models such as k-nearest Neighbors, Naive Bayes, linear and logistic
regression, decision trees.
 Build data science applications using Python based toolkits.
 Familiar in Data science applications and implementation.

Text Book
1. Joel Grus, "Data Science from Scratch: First Principles with Python", O'Reilly Media
First edition (April 30, 2015)
2. Aurélien Géron, "Hands-On Machine Learning with Scikit-Learn and Tensor Flow:
Concepts, Tools, and Techniques to Build Intelligent Systems", 1st Edition,2017, O'Reilly
Media.
REFERENCES:
1. Stephen Marsland, ―Machine Learning: An Algorithmic Perspective‖, CRC Press, Second
Edition, 2009.
2. G. Strang (2016). Introduction to Linear Algebra, Wellesley-Cambridge Press, Fifth edition,
USA.
3. Ian Goodfellow, Yoshua Bengio and Aaron Courville, "Deep Learning", MIT Press, First
Edition (November 18, 2016)
4. Montgomery, D. C. and G. C. Runger (2011). Applied Statistics and Probability for
Engineers. 5th Edition. John Wiley & Sons, Inc., NY, USA

You might also like