You are on page 1of 8

Master of Business Administration

Batch: 2020-2022 Term: V

Course Code and Title MFT5SEIM01 – Big Data


Credit Hours 3
Faculty Mr. Neil Harwani
E-mail ID Neilharwani@nirmauni.ac.in
Blog / Classroom https://classroom.google.com/c/MzQ2MjI2Nzg0NTc4?cjc=omzqpwb
Phone No. 9712965004
Office Hours Thursday & Saturday – 7 PM to 8 PM (Thu) & 10.10 to 12.20 PM (Sat)

****************************************************************************

I. Course Overview:

Big Data course intends to provide the students of MBA an overview of Big Data
ecosystem, concepts, architecture & application via case studies, study of trends and
fundamental concepts. Subject will be a mix of technology & management topics around
the ecosystem of Big Data. With large amounts of data getting generated in various areas
like HealthCare, BFSI, Manufacturing, Digital Marketing, Internet and so on, it becomes
important to study and understand the Big Data concepts and how they can be applied to
business and society overall. Many interfacing topics like security, privacy, evolution of
areas in Big Data will be studied as well in this course.

Topics will include overview of Cloud ecosystem, Kafka streaming, NoSQL and similar
evolving areas. Over the last few years post the primary research of Big Data by
companies like Yahoo, Google, IBM among others, many new areas like Big Data
ecosystem in Cloud, Stream processing, various new types of database implementations,
in-memory computing have come up. This has enabled a new area for various businesses
of large-scale processing of data with much faster turnaround compared to traditional
ways.

Recommendations over mobiles / smartphones, eCommerce, healthcare and financial


systems are getting enabled with Big Data in the backend. Large amount of data is stored
using Big Data and processed using Neural Networks / Machine Learning systems to
enable businesses to provide relevant content. This course will act as a foundation to
understand such changes to the Big Data ecosystem.

1
II. Course Learning Outcomes (CLO):

At the end of the course, the students will be able to:

1. Demonstrate an understanding of Big Data and its applications


2. Discover the scope of Big Data in Business
3. Explain Big Data Architecture and Technology

III. Text Book:


Maheshwari, A. (2019). Big Data. McGraw-Hill Education. 2nd Ed

IV. Assessment Components & Schedule:

Assessment Weightage Overall CLO


Schedule
Component % Weightage % Number
18th to 22nd Session. Students
Individual
20 will be submitting case study 20 1 and 2
Assignment
analysis.
Planned as quiz at 12th,14th &
Quizzes [2] 10 16th Session. Best two out of 20 2 and 3
three
17th Session – Students will be
provided with industry topics
which they need to analyse and
Group
20 provide a summary report along 20 1 and 2
Assignment
with recommendations for
improvements compared to
current situation.
End-term
40 40 1, 2 and 3
Examination

V. Session Plan
Session
Description
No.
Topic: Context & Overview of Big Data
Pedagogy: Lecture & discussion
Text Book: Chapter 1 (Page: 3 to 22)
1 Reading: -
Case /
-
Exercise:
CLO No: 1, 2
2 Topic: Big Data Sources and Applications
2
Session
Description
No.
Pedagogy: Lecture & discussion
Text Book: Chapter 2 (Page 23 to 36)
Reading: -
Case /
-
Exercise:
CLO No: 1, 2
Topic: Big Data Architecture
Pedagogy: Lecture & discussion
Text Book: Chapter 3 (Page 37 to 58)
3 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Distributed Computing Using Hadoop
Pedagogy: Lecture & discussion
Text Book: Chapter 4 (Page 59 to 70)
4 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Parallel Processing with Map Reduce
Pedagogy: Lecture & discussion
Text Book: Chapter 5 (Page 71 to 84)
5 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: NoSQL Databases – 1
Pedagogy: Lecture & discussion
Text Book: Chapter 6 (Page 85 to 102)
6 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: NoSQL Databases – 2
Pedagogy: Lecture & discussion
Text Book: Chapter 6 (Page 85 to 102)
7 Reading: -
Case /
-
Exercise:
CLO No: 3

3
Session
Description
No.
Topic: Stream Processing with Spark
Pedagogy: Lecture & discussion
Text Book: Chapter 7 (Page 103 to 114)
8 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Data Ingest with Kafka
Pedagogy: Lecture & discussion
Text Book: Chapter 8 (Page 115 to 124)
9 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Cloud Computing – 1
Pedagogy: Lecture & discussion
Text Book: Chapter 9 (Page 125 to 134)
10 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Cloud Computing – 2 – Chapter 9
Pedagogy: Lecture & discussion
Text Book: (Page 125 to 134)
11 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Big Data Programming Languages
Pedagogy: Lecture & discussion
Text Book: Chapter 10 (Page 135 to 152)
12 Reading: -
Case /
-
Exercise:
CLO No: 3
Topic: Web Log Analyzer Application Development
Pedagogy: Lecture & discussion
Text Book: Chapter 11 (Page 153 to 164)
13 Reading: -
Case /
-
Exercise:
CLO No: 1, 2
14 Topic: Data Modelling Primer
4
Session
Description
No.
Pedagogy: Lecture & discussion
Text Book: Chapter 12 (Page 165 to 172)
Reading: -
Case /
-
Exercise:
CLO No: 1, 2
Topic: Data Analytics Primer
Pedagogy: Lecture & discussion
Text Book: Chapter 13 (Page 173 to 194)
15 Reading: -
Case /
-
Exercise:
CLO No: 1, 2
Topic: Artificial Intelligence for Big Data Primer 1
Pedagogy: Lecture & discussion
Text Book: Chapter 14 (Page 195 to 202)
16 Reading: -
Case /
-
Exercise:
CLO No: 2, 3
Topic: Artificial Intelligence for Big Data Primer 2
Pedagogy: Lecture & discussion
Text Book: Chapter 14 (Page 195 to 202)
17 Reading: -
Case /
1.
Exercise:
CLO No: 2, 3
Big data and creating an enhanced digital connect with
Topic:
customers.
Pedagogy: Case study discussion
Text Book: -
18
Reading: -
Case / Glossier: Co-Creating a Cult Brand with a Digital Community, Jill
Exercise: Avery [HBS Case part of course pack]
CLO No: 1, 2
Topic: Tailor a recommender system to a new context
Pedagogy: Case study discussion
Text Book: -
19 Reading: -
Case / Wattpad, John Deighton, Leora Kornfeld [HBS Case part of course
Exercise: pack]
CLO No: 1, 2
Topic: Data-driven approach to improve people management practices
Pedagogy: Case study discussion
20
Text Book: -
Reading: -

5
Session
Description
No.
Lojas Americanas: Project DNA and the "People Machine", Boris
Case /
Groysberg, Eric Lin, Sarah L. Abbott [HBS Case part of course
Exercise:
pack]
CLO No: 1, 2
Topic: Demand forecasting
Pedagogy: Case study discussion
Text Book: -
Reading: -
21
Komatsu Komtrax: Asset Tracking Meets Demand Forecasting
Case /
Willy Shih, Paul Hong, Young Won Park [HBS Case part of course
Exercise:
pack]
CLO No: 1, 2
Topic: Medicine via technology platforms
Pedagogy: Case study discussion
Text Book: -
22 Reading: -
Case / Maccabitech: The Promise of Israel's Healthcare Data Scott Duke
Exercise: Kominers, Carin-Isabel Knoop [HBS Case part of course pack]
CLO No: 1, 2
Trends in Big Data technology ecosystem around Apache & other
Topic:
open-source projects
Pedagogy: Class discussion
Text Book: -
23
Reading: R5, R6
Case /
-
Exercise:
CLO No: 1, 2
Topic: Trends in Security & Privacy, HealthCare for Big Data
Pedagogy: Class discussion
Text Book: -
24 Reading: R8, R9
Case /
-
Exercise:
CLO No: 1, 2
Topic: Trends in Manufacturing, BFSI, Cloud for Big Data
Pedagogy: Class discussion
Text Book: -
25 Reading: R10, 11, 12, 13, 14, 15
Case /
-
Exercise:
CLO No: 1, 2
Topic: Data Science Careers
Pedagogy: Lecture & discussion
Text Book: Chapter 15 (Page 203 to 206)
26
Reading: -
Case /
-
Exercise:
6
Session
Description
No.
CLO No: 1, 2
Topic: Practical on BigData – Hadoop, HDFS & NoSQL
Pedagogy: Discussion and Demonstration
Text Book: Handouts & blogs
27 Reading:
Case /
-
Exercise:
CLO No: 3
Topic: Practical on BigData – Hadoop, HDFS & NoSQL
Pedagogy: Discussion and Demonstration
Text Book: Handouts & blogs
28 Reading:
Case /
-
Exercise:
CLO No: 3
Topic: Industry Insights – Master Class
Pedagogy: Discussion
Text Book: Chapter 15 (Page 203 to 206)
29 Reading: -
Case /
-
Exercise:
CLO No: 2
Topic: Review of Techno-Managerial aspects of Big Data ecosystem
Pedagogy: Class discussion
Text Book: -
30 Reading: -
Case /
-
Exercise:
CLO No: 1, 2, 3

VI. Readings:

R1. MIT News Big data. (n.d.). https://news.mit.edu/topic/big-data


R2. Apache Projects Directory. (n.d.). https://projects.apache.org/projects.html?category
R3. MIT News Big data. (n.d.). https://news.mit.edu/topic/big-data
R4. Sicular, S. (2013). Gartner’s Big Data Definition Consists of Three Parts, Not to Be
Confused with Three “V”s. https://blogs.gartner.com/svetlana-sicular/gartners-big-data-
definition-consists-of-three-parts-not-to-be-confused-with-three-vs/
R5. From proprietary to open source: How a Firm data science tool became a trending
product. (n.d.). https://www.mckinsey.com/alumni/news-and-insights/global-news/firm-
news/kedro-from-proprietary-to-open-source
R6. Awesome Big Data. (n.d.). https://github.com/onurakpolat/awesome-bigdata
R7. Brown, S. (202 C.E.). 10 big data blunders businesses should avoid.
https://mitsloan.mit.edu/ideas-made-to-matter/10-big-data-blunders-businesses-should-
avoid

7
R8. José Parra-Moyano, Karl Schmedders, and A. “Sandy” P. (2020). What Managers Need
to Know About Data Exchanges. https://sloanreview.mit.edu/article/what-managers-
need-to-know-about-data-exchanges/
R9. Using data science to forecast clinical trial outcomes may help biomedical stakeholders
de-risk their portfolios. (n.d.). https://mitsloan.mit.edu/press/using-data-science-to-
forecast-clinical-trial-outcomes-may-help-biomedical-stakeholders-de-risk-their-
portfolios
R10. Digitalization of industrial production lines. (n.d.).
https://www2.deloitte.com/content/dam/Deloitte/de/Documents/technology/Digitalizati
on-Production-Lines-Big-Data-Technologies-Deloitte.pdf
R11. Gartner Top 10 Trends in Data and Analytics for 2020. (2020).
https://www.gartner.com/smarterwithgartner/gartner-top-10-trends-in-data-and-
analytics-for-2020/
R12. Clouds, big data, and smart assets: Ten tech-enabled business trends to watch.
(2020). https://www.mckinsey.com/industries/technology-media-and-
telecommunications/our-insights/clouds-big-data-and-smart-assets-ten-tech-enabled-
business-trends-to-watch
R13. Amit Garg, Davide Grande, Gloria Macías-Lizaso Miranda, Christoph
Sporleder, and E. W. (2017). Analytics in banking: Time to realize the value.
https://www.mckinsey.com/industries/financial-services/our-insights/analytics-in-
banking-time-to-realize-the-value
R14. Carlos Fernandez Naveira, Imke Jacob, Khaled Rifai, Pamela Simon, and E. W.
(2018). Smarter analytics for banks. https://www.mckinsey.com/industries/financial-
services/our-insights/smarter-analytics-for-banks
R15. Meet Kedro, McKinsey’s first open-source software tool. (2019).
https://www.mckinsey.com/about-us/new-at-mckinsey-blog/meet-kedro-mckinseys-
first-open-source-software-tool
R16. Meet Kedro, McKinsey’s first open-source software tool. (2019).
https://www.mckinsey.com/about-us/new-at-mckinsey-blog/meet-kedro-mckinseys-
first-open-source-software-tool
R17. James Manyika, Michael Chui, Brad Brown, Jacques Bughin, Richard Dobbs,
Charles Roxburgh, and A. H. B. (2011). Big data: The next frontier for innovation,
competition, and productivity. https://www.mckinsey.com/business-
functions/mckinsey-digital/our-insights/big-data-the-next-frontier-for-innovation
R18. Court, D. (2015). Getting big impact from big data.
https://www.mckinsey.com/business-functions/mckinsey-digital/our-insights/getting-
big-impact-from-big-data
R19. Nicolaus Henke, Ari Libarikian, and B. W. (2016). Straight talk about big data.
https://www.mckinsey.com/business-functions/mckinsey-digital/our-insights/straight-
talk-about-big-data

You might also like