You are on page 1of 3

XLRI JAMSHEDPUR

BIG DATA ANALYTICS


COURSE DESCRIPTION AND SCHEDULE

-----------------------------------------------------------------------------------------------------------------------------

Objective:

We are living in an age of information overload. Huge amount of data are generated by human,
computers, and different instruments. The biggest challenge is to analyze those data. Big data
includes datasets whose size and type make them impractical to process and analyze by
traditional analytics tools. Using advanced data mining, predictive modeling, forecasting, text
mining, optimization on big data enables to unearth hidden pattern from large amount of data.
This course focuses on several key information technologies used in manipulating, storing and
analyzing big data. It will provide a brief overview of various technologies used in managing Big
Data. Technologies like Hadoop, MapReduce and Spark will be introduced. In addition, various
business application of Big Data technology will be discussed. Concepts of Machine Learning
with a focus on Supervised and Unsupervised Learning will be illustrated with specific use cases
using Python/R language. Finally, Social Media Analytics, Sentiment Analysis, Fraud Detection
and Relationship Mining concepts and their applications will be illustrated using Python/R
language.

The basic objective of this course is

• Apply appropriate analytic techniques and tools to analyze big data


• Build, interpret and use machine learning models, for getting insights into data and for
prediction
• Solve specific business problems in different domains managing large data volume using
various techniques of web analytics, social media analytics and relationship mining
Books and Articles:
1. Bill Schmarzo, Big Data – Understanding How Data Powers Big Business, Wiley, 2014
2. Bill Franks, Taming Big Data Tidal Wave: Finding Opportunities in Huge Data Streams
with Advanced Analytics, Wiley, 2014.
3. Avinash Kaushik, Web Analytics 2.0: The Art of Online Accountability & Science of
Customer Centricity, SYBEX, 2014
4. Michael Minelli, Michele Chambers and Ambiga Dhiraj, Big Data Big Analytics:
Emerging Business Intelligence and Analytics Trends for Today’s Business, Wiley, 2013
5. David Loshin, Big Data Analytics: From Strategic Planning to Enterprise Integration
With Tools, Techniques, NoSQL and Graph, Morgan Kaufman, 2013
6. Andrew McAfee, Erik Brynjolfsson, Big Data: The Management Revolution, Harvard
Business Review. 2013
Course Schedule
1. Overview of Big Data
o Introducing Technologies for Handling Big Data
o Understanding Hadoop, MapReduce and HBase
o Understanding Spark
o SQL and NoSQL

2. Exploiting the Use of Big Data in Business Context


o Business Intelligence
o Marketing
o Retail
o Insurance and Financial Sector

3. Concepts in Machine Learning


o Supervised Learning – Regression and Classification – Illustration using
Python/ R of a Business Use Case
o Unsupervised Learning – Clustering – Illustration Using Python/R of a
Business Use Case

4. Web Analytics
o Clickstream Analysis – Illustration Using R of a Business Use Case
o Social Media Mining and Sentiment Analysis – Illustration Using Python of a
Business Use Case

5. Fraud Analytics
o Concepts
o Illustration of a Credit Fraud Detection Example Using Python

6. Association Rule and Relationship Mining


o Concepts
o Illustration of a Business Use Case Using Python/R
o
7. Text Mining and Natural Language Processing
o Technology and Business Use cases

You might also like