Syllabus PDF

Uploaded by

Jv s

0% found this document useful (0 votes)

3 views2 pages

Original Title

Syllabus.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views2 pages

Syllabus PDF

Uploaded by

Jv s

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

L T P C

3 0 0 3

Course Code: INT404R01

Semester: VII

BIG DATA ANALYTICS

Course Objective:

UNIT - I 12 Periods
Getting Started with Big Data: Grasping the Fundamentals of Big Data. An Operating System
for Big Data: Basic Concepts - Hadoop Architecture - Working with a Distributed File System -
Working with Distributed Computation - Submitting a MapReduce Job to YARN. A Framework
for Python and Hadoop Streaming: Hadoop Streaming - A Framework for MapReduce with
Python - Advanced MapReduce

UNIT - II 11 Periods
MapReduce and the New Software Stack: Map Reduce Algorithms - Communication Cost
Model. Mining Data Streams: Stream Data Model - Sampling Data in a Stream - Filtering
Streams - Counting Distinct elements in a Stream - Estimating moments - Counting ones in a
Window - Decaying Windows. Link Analysis: Pagerank Algorithm - Efficient Computation of
Pagerank - Topic-Sensitive PageRank - Link Spam.

UNIT - III 11 Periods

Recommendation Systems: A Model for Recommendation System - Content Based
Recommendations - Collaborative Filtering - Dimensionality Reduction. Mining Social Network
Graphs: Social Networks as Graphs - Clustering of Social Network Graphs - Direct Discovery of
Communities - Partitioning of Graphs - Finding overlapping communities - Simrank - Counting
Triangles.

UNIT - IV 11 Periods
Structured Data Queries with Hive: The Hive Command-Line Interface (CLI), Hive Query
Language, Data Analysis with Hive, HBase. Data Ingestion: Relational Data with Sqoop.
Analytics with Higher-Level APIs: -Level APIs. Machine Learning:
Scalable Machine Learning with Spark

TEXTBOOKS
1. Jure Leskovec, Anand Rajaraman, and Jeffrey D. Ullman, Mining of Massive Data Sets, 2019.
2. Benjamin Bengfort, and Jenny Kim, Data Analytics with Hadoop
3. Judith Hurwitz, Alan Nugent, Dr. Fern Halper, and Marcia Kaufman, Big Data for Dummies,
John - Wiley and Sons, 2013.
REFERENCES
1. Jimmy Lin, Chris Dyer, Data-Intensive Text Processing with MapReduce, Morgan & Claypool
Publishers, 2010.
2. Paul C. Zikopoulos, Chris Eaton, and Dirk deRoos, Thomas Deutsch, George Lapis,
Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data, The
McGraw-Hill Companies, 2012.
3. Srinath Perera, and Thilina Gunarathne, Hadoop MapReduce Cookbook, Packt Publishing,
2013.
4. Tim McGovern, Big Data Now: 2014 Edition
5. Jason Venner, Pro Hadoop, Apress, 2009.

UNITWISE LEARNING OUTCOMES

Upon successful completion of each unit, the learner will be able to
Unit I Discuss the fundamental concepts of big data and Hadoop - MapReduce
framework
Demonstrate the MapReduce algorithms using python and Hadoop Streaming
Unit II Explain the methods of processing and filtering of data streams
Apply link analysis for page rank computation
Unit III Describe the methods for developing a model for recommendation system.
Explain dimensionality reduction and its implementation
Solve community detection problems in social network graphs by applying
algorithms for clustering and partitioning
Unit IV Illustrate the different programming languages in Hadoop such as Hive,
HBase, Pig, Spark

COURSE LEARNING OUTCOMES

Upon successful completion of this course, the learner will be able to

Knowledge
CO No. Course Outcomes
Level
Discuss the fundamental concepts of big data and Hadoop -
1 K3
MapReduce framework
Demonstrate the MapReduce algorithms using python and
2 K4
Hadoop Streaming
Explain the methods of processing and filtering of data streams
3 K3
and page rank algorithms
Describe the methods for developing a model for recommendation
4 K3
system with dimensionality reduction
Solve community detection problems in social network graphs by
5 K3
employing algorithms for clustering and partitioning
Illustrate the different programming languages in Hadoop such as
6 K3
Hive, HBase, Pig, Spark

Relational Databases: State of the Art Report 14:5
From Everand
Relational Databases: State of the Art Report 14:5
D A Bell
No ratings yet
DATA ANALYTICS Lab
Document3 pages
DATA ANALYTICS Lab
Boopathi kumar
No ratings yet
Mastering Hadoop
From Everand
Mastering Hadoop
Sandeep Karanth
No ratings yet
Big Data Syllabus For Theory and Lab
Document4 pages
Big Data Syllabus For Theory and Lab
chetana tukkoji
No ratings yet
Ccs334 - Big Data Analytics
Document2 pages
Ccs334 - Big Data Analytics
silambarasan
No ratings yet
Big Data Syllabus
Document2 pages
Big Data Syllabus
Avinash Bommina
No ratings yet
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
Document6 pages
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
Editor IJTSRD
No ratings yet
19CS4701D
Document2 pages
19CS4701D
Soundarrajan OGP
No ratings yet
Data Science With Python - Lesson 12 - Python Integration With Hadoop
Document53 pages
Data Science With Python - Lesson 12 - Python Integration With Hadoop
Zozer Mbula Lwanga
No ratings yet
Bigdata Syllabus
Document3 pages
Bigdata Syllabus
Sankar Terli
No ratings yet
Big Data Analytics (R18a0529)
Document134 pages
Big Data Analytics (R18a0529)
NAVANEETH 09
No ratings yet
CC ZG522 Course Handout
Document6 pages
CC ZG522 Course Handout
2023mt03587
No ratings yet
Handout MCAIIISem CS508 BigData ManishaJailia JulyDec2022
Document3 pages
Handout MCAIIISem CS508 BigData ManishaJailia JulyDec2022
Sanjali Pandey
No ratings yet
Big Data Theory
Document3 pages
Big Data Theory
lathasivasankari.v
No ratings yet
(R17a0528) Big Data Analytics
Document131 pages
(R17a0528) Big Data Analytics
Keerthana K
No ratings yet
Big Data Analytics Digital Notes
Document119 pages
Big Data Analytics Digital Notes
infweb4735
No ratings yet
Big Data Analytics Syllabus
Document2 pages
Big Data Analytics Syllabus
Saiyed Faiayaz Waris
No ratings yet
Big Data Syllabus
Document4 pages
Big Data Syllabus
JAJULAHARIBABU
No ratings yet
Big Data
Document2 pages
Big Data
POLU SRINIVASA REDDY
No ratings yet
Big Data Analytics With Lab
Document3 pages
Big Data Analytics With Lab
Keerthana K
No ratings yet
20IT503 - Big Data Analytics - Unit4
Document73 pages
20IT503 - Big Data Analytics - Unit4
5023-Monish Kumar K
No ratings yet
Lab Manual BDA
Document36 pages
Lab Manual BDA
hemalata jangale
No ratings yet
30 Comparative Performance Analysis of Apache Spark and Map Reduce Using K-Means E
Document7 pages
30 Comparative Performance Analysis of Apache Spark and Map Reduce Using K-Means E
jefferyleclerc
No ratings yet
AIADS 7th Sem Syllabus Signed
Document19 pages
AIADS 7th Sem Syllabus Signed
himanshu24ai018
No ratings yet
Unit3 BD
Document104 pages
Unit3 BD
Hirdesh Sharma
100% (1)
Big Data and Analytics Syllabus 2021
Document3 pages
Big Data and Analytics Syllabus 2021
rashmi
No ratings yet
A Comparison of Big Data Analytics Approaches Based On Hadoop Mapreduce
Document9 pages
A Comparison of Big Data Analytics Approaches Based On Hadoop Mapreduce
Kyar Nyo Aye
No ratings yet
2021 22 4th Year
Document8 pages
2021 22 4th Year
Abhay tyagi
No ratings yet
Performance of Graph Query Languages: Comparison of Cypher, Gremlin and Native Access in Neo4j
Document10 pages
Performance of Graph Query Languages: Comparison of Cypher, Gremlin and Native Access in Neo4j
Nicholas Tonna
No ratings yet
Big Data Analytics-Syllabus
Document3 pages
Big Data Analytics-Syllabus
Dual Dave
No ratings yet
Big Data
Document4 pages
Big Data
aryan kothambia
No ratings yet
Hadoop
Document14 pages
Hadoop
srcesrbije.online
No ratings yet
Datascience and Machine Learning
Document8 pages
Datascience and Machine Learning
suman
No ratings yet
cp5293 Big Data Analytics Question Bank
Document13 pages
cp5293 Big Data Analytics Question Bank
Sanguine Shereen
0% (1)
Cp5293 Big Data Analytics Question Bank
Document13 pages
Cp5293 Big Data Analytics Question Bank
Sanguine Shereen
0% (1)
A Comparison of Big Remote Sensing Data Processing With Hadoop MspReduce and Spark
Document4 pages
A Comparison of Big Remote Sensing Data Processing With Hadoop MspReduce and Spark
Sara sherif
No ratings yet
Building A Big Data Platform With The Hadoop Ecosystem
Document53 pages
Building A Big Data Platform With The Hadoop Ecosystem
Gregg Barrett
No ratings yet
Hadoop 1 Ref
Document4 pages
Hadoop 1 Ref
Rahul Naik
No ratings yet
Big Data Storge Management
Document4 pages
Big Data Storge Management
V Harsha Shastri
No ratings yet
113 Ce 74
Document4 pages
113 Ce 74
boremshiva1201
No ratings yet
BDAA
Document4 pages
BDAA
Leela Rallapudi
No ratings yet
Big Data Analytics
Document3 pages
Big Data Analytics
tirth_diwani
No ratings yet
Irjet V4i4207 PDF
Document3 pages
Irjet V4i4207 PDF
Soham
No ratings yet
Data Analytics Course Plan 2016
Document7 pages
Data Analytics Course Plan 2016
Rïshí Rìshìkêsh
No ratings yet
Learn Well Technocraft: Hadoop/Big Data Syllabus
Document12 pages
Learn Well Technocraft: Hadoop/Big Data Syllabus
SONAL S.K
No ratings yet
Toward Scalable Internet Traffic Measure
Document8 pages
Toward Scalable Internet Traffic Measure
Eder
No ratings yet
3rd Sem Syllabus
Document13 pages
3rd Sem Syllabus
iShamirali
No ratings yet
Machine Learning and Cloud Computing: Survey of Distributed and Saas Solutions
Document13 pages
Machine Learning and Cloud Computing: Survey of Distributed and Saas Solutions
sfaritha
No ratings yet
Bigdata
Document2 pages
Bigdata
Deepak Pardhi
No ratings yet
Certified Hadoop and Spark Course Curriculum
Document9 pages
Certified Hadoop and Spark Course Curriculum
mano555
No ratings yet
CIT-650 Introduction To Big Data, Developing With Spark and Hadoop
Document4 pages
CIT-650 Introduction To Big Data, Developing With Spark and Hadoop
Amgad Alsisi
No ratings yet
Big Data Management Syllabus
Document5 pages
Big Data Management Syllabus
ANOOPATICS Y
No ratings yet
Pig Mix Benchmark
Document6 pages
Pig Mix Benchmark
abhimonica
No ratings yet
Big Data Black Book PDF
Document2 pages
Big Data Black Book PDF
Achintya Kumar
15% (20)
Hadoop
Document58 pages
Hadoop
duggy
No ratings yet
Spark Streaming Research
Document6 pages
Spark Streaming Research
reshmashaik4656
No ratings yet
Big Data Analytics Comp Syllabus Sem7
Document4 pages
Big Data Analytics Comp Syllabus Sem7
Saprativa Bhattacharjee
No ratings yet
Beginning Database Design
Document2 pages
Beginning Database Design
I Made Putrama
No ratings yet
Effective Query Processing For Web-Scale RDF Data Using Hadoop Components
Document7 pages
Effective Query Processing For Web-Scale RDF Data Using Hadoop Components
Lakshmi Srinivasulu
100% (1)
No SQL Database in Bda
Document84 pages
No SQL Database in Bda
prabha gau
No ratings yet
ACRS SRCCampus Allotted List 21032022 PDF
Document3 pages
ACRS SRCCampus Allotted List 21032022 PDF
Jv s
No ratings yet
Unit4 PDF
Document2 pages
Unit4 PDF
Jv s
No ratings yet
Syllabus
Document2 pages
Syllabus
Jv s
No ratings yet
Alphabets - Dipthongs
Document43 pages
Alphabets - Dipthongs
Jv s
No ratings yet
Unit IV Part III
Document27 pages
Unit IV Part III
Jv s
No ratings yet
Discrete Mathematics
Document3 pages
Discrete Mathematics
Jv s
No ratings yet
1.muscular System - Dipita Guha
Document3 pages
1.muscular System - Dipita Guha
Jv s
No ratings yet
B. Tech. / M. Tech. (Integrated) First Year
Document2 pages
B. Tech. / M. Tech. (Integrated) First Year
Jv s
No ratings yet