Welcome to Scribd!

Skip carousel

INT313

Uploaded by

th vsdv

0% found this document useful (0 votes)

28 views3 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

28 views3 pages

INT313

Uploaded by

th vsdv

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

INT313: BIG DATA PROCESSING FRAMEWORK

L:2 T:0 P:2 Credits:3

Unit I

Introduction to Big Data Processing Frameworks : Introduction to Processing Engines and

Processing Frameworks, Introduction to Batch Processing Systems, Introduction to Stream only

frameworks, Introduction to Hybrid frameworks, Comparison of frameworks

Unit II

Working with Hybrid Framework: Apache Spark : Introduction to Apache Spark, Features of

Apache Spark, Components of Apache Spark, Sentiment Analysis using Apache Spark, Installation of

Apache Spark, Working with Apache Spark using Scala, Introduction to Apache Spark Programming

Unit III

Working with Hybrid Framework: Apache Flink : Introduction to Apache Flink, Working with

Apache Flink Ecosystem, Features of Apache Flink, Apache Flink Architecture, Installation and

Configuration of Apache Flink on Ubuntu, Apache Flink Shell Commands

Unit IV

Working with Stream-only framework: Apache Storm : Introduction to Apache Storm, Building

blocks of Storm Topologies, Apache Storm - Cluster Architecture, Apache Storm - Workflow, Apache

Storm Installation, Possible Use Cases of Apache Storm

Unit V

Working with Stream-only framework: Apache Samza : Introduction to Apache Samza, Apache

Samza Architecture : Streaming Layer, Apache Samza Architecture : Execution Layer, Apache Samza

Architecture : Processing Layer, Introduction to hello-samza (starter project for Apache Samza jobs)

Unit VI

Apache Samza: Working with Apache Kafka: Apache Kafka - Cluster Architecture, Apache Kafka – Basic
Operations, Apache Kafka - Simple Producer Example, Apache Kafka - Consumer Group Example, Apache
Kafka - Integration With Storm, Apache Kafka - Integration With Spark, Real Time Application (Twitter)

List of Practicals/Experiments

1. Setting Up Eclipse for Apache Storm and making it ready for first program.
2. Setting up Maven Project for demonstration of spouts and bolts using Apache Storm.
3. Sentiment Analysis using Apache Spark
4. Demonstrate the use of Mini Reducer i.e. combiner in Apache Hadoop Map Reduce.
5. Demonstrate the use of GraphX in Apache Spark.
6. Demonstrate the use of Spark Streaming in Apache Spark.
7. Demonstrate the use of Producer and Consumer in Apache Kafka.
8. Create an Apache Flink project in Eclipse
9. Implementing different types of Joins in Apache Spark.
10. Apache Flink - Running a Flink Program
References:

1. Big Data Simplified by Sourabh Mukherjee, Amit Kumar Das, Sayan Goswami, Pearson, India

2. Data Analytics with Spark Using Python by Jeffrey Aven, Pearson, India

3. Big Data Fundamentals by Thomas Erl, Pearson, India

Spark Overview: Security
Document4 pages
Spark Overview: Security
gathorsfx
No ratings yet
Apache Spark Tutorial (Fast Data Architecture Series) - DZone Big Data
Document5 pages
Apache Spark Tutorial (Fast Data Architecture Series) - DZone Big Data
Ricardo Cardoso
No ratings yet
Apache Kafka in Spring Boot Application
Document8 pages
Apache Kafka in Spring Boot Application
Phong Nguyen
No ratings yet
PySpark Tutorial For Beginners - Python Examples - Spark by (Examples)
Document19 pages
PySpark Tutorial For Beginners - Python Examples - Spark by (Examples)
pysparkv
No ratings yet
Getting Started With Apache Kafka in Python - Towards Data Science PDF
Document17 pages
Getting Started With Apache Kafka in Python - Towards Data Science PDF
Deven Mali
No ratings yet
Apache Spark Tutorial
Document6 pages
Apache Spark Tutorial
abhimanyu thakur
100% (1)
Apache Kafka Tutorial
Document6 pages
Apache Kafka Tutorial
varam10
No ratings yet
Getting Started With The Alfresco Maven SDK
Document12 pages
Getting Started With The Alfresco Maven SDK
vesar6
No ratings yet
Exp 12
Document8 pages
Exp 12
Smaranika Patil
No ratings yet
Apache Kafka Cookbook - Sample Chapter
Document14 pages
Apache Kafka Cookbook - Sample Chapter
Packt Publishing
100% (1)
Apache Spark Tutorial
Document36 pages
Apache Spark Tutorial
vietpine
100% (3)
Scala Data Analysis Cookbook - Sample Chapter
Document37 pages
Scala Data Analysis Cookbook - Sample Chapter
Packt Publishing
100% (1)
Introducing .NET for Apache Spark: Distributed Processing for Massive Datasets
From Everand
Introducing .NET for Apache Spark: Distributed Processing for Massive Datasets
Ed Elliott
No ratings yet
Introduction To Spark For Data Engineers / Data Scientists
Document100 pages
Introduction To Spark For Data Engineers / Data Scientists
Gabriel Vieira
100% (1)
Apache Spark
Document100 pages
Apache Spark
Tuấn Đặng
No ratings yet
Spark Project Report: Streaming
Document22 pages
Spark Project Report: Streaming
testyy testt
No ratings yet
Mastering Apache Spark - Sample Chapter
Document24 pages
Mastering Apache Spark - Sample Chapter
Packt Publishing
No ratings yet
Introduction To Apache Ka Ka For Python Programmers: Installation
Document8 pages
Introduction To Apache Ka Ka For Python Programmers: Installation
inc0gnit0
No ratings yet
Name: Wable Snehal Mahesh Subject:-Scala & Spark Div: - Mba Ii Roll No: - 57 Guidence Name: - Prof. Archana Suryawanshi - Kadam
Document11 pages
Name: Wable Snehal Mahesh Subject:-Scala & Spark Div: - Mba Ii Roll No: - 57 Guidence Name: - Prof. Archana Suryawanshi - Kadam
Snehal Mahesh Wable
No ratings yet
Apache Kafka Tutorial
Document3 pages
Apache Kafka Tutorial
Mario Soares
No ratings yet
Databricks Spark Reference Applications
Document37 pages
Databricks Spark Reference Applications
jose
No ratings yet
Os Eclipse Soatptuscany PDF
Document34 pages
Os Eclipse Soatptuscany PDF
Billy Henry Ochingwa
No ratings yet
Async Python Service With FastAPI & SQLAlchemy
Document7 pages
Async Python Service With FastAPI & SQLAlchemy
Leon
No ratings yet
Apache Kafka Course Curriculum
Document5 pages
Apache Kafka Course Curriculum
Vinicius Gonçalves
No ratings yet
Top 9 Asynchronous Web Frameworks For Python
Document10 pages
Top 9 Asynchronous Web Frameworks For Python
Leon
No ratings yet
Kafka
Document50 pages
Kafka
Emanuele Parente
No ratings yet
Mastering Apache Spark 2.0
Document62 pages
Mastering Apache Spark 2.0
Cesar Celis
No ratings yet
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet
Spark Databricks Summary
Document100 pages
Spark Databricks Summary
Yolanda De la Hoz Simon
75% (4)
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Learning Real-Time Processing With Spark Streaming - Sample Chapter
Document30 pages
Learning Real-Time Processing With Spark Streaming - Sample Chapter
Packt Publishing
No ratings yet
What Is Apache Spark - Azure Synapse Analytics - Microsoft Docs
Document6 pages
What Is Apache Spark - Azure Synapse Analytics - Microsoft Docs
demetrius albuquerque
No ratings yet
Native Docker Clustering with Swarm
From Everand
Native Docker Clustering with Swarm
Fabrizio Soppelsa
No ratings yet
BDA Lab A7
Document10 pages
BDA Lab A7
the.quote.villa
No ratings yet
A concise guide to PHP MySQL and Apache
From Everand
A concise guide to PHP MySQL and Apache
alasdair gilchrist
Rating: 4 out of 5 stars
4/5 (2)
DevOps с Laravel 3. Kubernetes
Document92 pages
DevOps с Laravel 3. Kubernetes
agris.markus
No ratings yet
Axis The New Incarnation of Apache SOAP
Document0 pages
Axis The New Incarnation of Apache SOAP
murthy_oct24
No ratings yet
Apache Storm Thesis
Document7 pages
Apache Storm Thesis
juliemaypeoria
100% (2)
Apache Spark
Document22 pages
Apache Spark
abhishek63489551
No ratings yet
Apache Mahout
Document4 pages
Apache Mahout
levin696
No ratings yet
Spark Introduction
Document25 pages
Spark Introduction
sr_saurab8511
No ratings yet
Spark Introduction
Document4 pages
Spark Introduction
VIKAS YADAV
No ratings yet
Web Services - Axis
Document28 pages
Web Services - Axis
Kim Hoàng Dương
No ratings yet
Spark Tutorial
Document8 pages
Spark Tutorial
Dukool Sharma
No ratings yet
Interview Question
Document24 pages
Interview Question
Anil Yarlagadda
No ratings yet
Pyspark Tutorial
Document27 pages
Pyspark Tutorial
balha
100% (1)
Swagger Annotations: Openapi
Document2 pages
Swagger Annotations: Openapi
Srinu S
No ratings yet
Anynchronous APIs Whitepaper
Document6 pages
Anynchronous APIs Whitepaper
Alejandro
No ratings yet
Learning Apache Kafka - Second Edition - Sample Chapter
Document12 pages
Learning Apache Kafka - Second Edition - Sample Chapter
Packt Publishing
No ratings yet
Bigdata Notes
Document26 pages
Bigdata Notes
Anil Yarlagadda
No ratings yet
Apache Flink
Document116 pages
Apache Flink
Aylin Koroglu
No ratings yet
Lambda - A Modern Big Data Architecture 5 - 12 PDF
Document128 pages
Lambda - A Modern Big Data Architecture 5 - 12 PDF
Harnoor Sachdeva
No ratings yet
Key Features: General-Purpose Fast Cluster Computing Platform
Document16 pages
Key Features: General-Purpose Fast Cluster Computing Platform
Mahesh VP
No ratings yet
Webservices - Axis: 1.1. Table of Contents
Document35 pages
Webservices - Axis: 1.1. Table of Contents
api-19917789
No ratings yet
Apache Spark: Dhineshkumar S K
Document31 pages
Apache Spark: Dhineshkumar S K
PREM KUMAR M
No ratings yet
7 Steps For A Developer To Learn Apache Spark
Document30 pages
7 Steps For A Developer To Learn Apache Spark
wisepaladin9706
No ratings yet
7 Steps For A Developer To Learn Apache Spark
Document30 pages
7 Steps For A Developer To Learn Apache Spark
Anubhav Sinha
No ratings yet
Apache Kafka Confluent Enterprise Ref Architecture
Document17 pages
Apache Kafka Confluent Enterprise Ref Architecture
Mouhamadou Naby DIA
No ratings yet
Data Engineering Assignment Report
Document9 pages
Data Engineering Assignment Report
Ranjita Mishra
No ratings yet
Prasanth Kothuri, Danilo Piparo, Enric Tejedor Saavedra, Diogo Castro Cern It and Ep-Sft
Document22 pages
Prasanth Kothuri, Danilo Piparo, Enric Tejedor Saavedra, Diogo Castro Cern It and Ep-Sft
Ade Rahman
No ratings yet
Normalization With Examples
Document21 pages
Normalization With Examples
rosalia
No ratings yet
DoDAF V2 - Volume 3
Document13 pages
DoDAF V2 - Volume 3
scranidi
No ratings yet
Week Five Assignment Database Modeling and Normalization
Document9 pages
Week Five Assignment Database Modeling and Normalization
Evans Oduor
No ratings yet
Lec 7 Notes
Document2 pages
Lec 7 Notes
Hii There
No ratings yet
5f62f4d4a52ca Bibhuranjan Sahoo
Document1 page
5f62f4d4a52ca Bibhuranjan Sahoo
Kunal Nag
No ratings yet
Penyerahan Dan Penilaian Tugasan CBDB4103 Intermediate Database MAY 2023
Document11 pages
Penyerahan Dan Penilaian Tugasan CBDB4103 Intermediate Database MAY 2023
suria
No ratings yet
Dimensional Modeling: Prof. Sunita Sahu
Document50 pages
Dimensional Modeling: Prof. Sunita Sahu
Nirav Rana
No ratings yet
Relational Databases
Document19 pages
Relational Databases
Michael
No ratings yet
Chapter 5 System Design
Document9 pages
Chapter 5 System Design
Jebaraj Jeeva
No ratings yet
SQL Commands
Document10 pages
SQL Commands
Rucha Gavaskar
No ratings yet
Lesson 3.1 The Three-Schema Architecture
Document7 pages
Lesson 3.1 The Three-Schema Architecture
Florence Britania-Reyes
No ratings yet
Relational DB Checklist
Document2 pages
Relational DB Checklist
GabrielHolandini
No ratings yet
Cheat Sheet: From Spark Data Sources SQL Queries
Document1 page
Cheat Sheet: From Spark Data Sources SQL Queries
anuja shinde
No ratings yet
Deber Consultas Distribuidas
Document1 page
Deber Consultas Distribuidas
Luis Chica Moncayo
No ratings yet
NoSQL Databases - Lecture 12 - Introduction To Databases (1007156ANR)
Document35 pages
NoSQL Databases - Lecture 12 - Introduction To Databases (1007156ANR)
Beat Signer
No ratings yet
Oracle SQL Revision Tour and Database Fundamentals
Document5 pages
Oracle SQL Revision Tour and Database Fundamentals
Aravind Maratha
No ratings yet
Class X: Unit-3 Relational Database Management Systems (Basic)
Document8 pages
Class X: Unit-3 Relational Database Management Systems (Basic)
kiran
No ratings yet
Handy Mysql Commands Description Command
Document3 pages
Handy Mysql Commands Description Command
sandeep reddy
No ratings yet
MySQL Presentation
Document39 pages
MySQL Presentation
harderharder
100% (2)
DBMS Fourth Chapter Part-1
Document165 pages
DBMS Fourth Chapter Part-1
Ravi Ramegowda
No ratings yet
CH-16 Tables and Integrity Constraints PDF
Document4 pages
CH-16 Tables and Integrity Constraints PDF
Jay Sanduke
No ratings yet
Web-Mca 1 - MCQ
Document8 pages
Web-Mca 1 - MCQ
Makeshifter Singh
No ratings yet
Name: Nikhitha Kasaraneni Email: Phone: (469) 983-8508
Document6 pages
Name: Nikhitha Kasaraneni Email: Phone: (469) 983-8508
kiran2710
No ratings yet
C. Management Information System
Document10 pages
C. Management Information System
MostafaElBaz
No ratings yet
DBMS Assignment 3
Document2 pages
DBMS Assignment 3
IT 04 ADITYA SAGAR PANDEY
No ratings yet
SQL Tutorial
Document41 pages
SQL Tutorial
Tsegaye Hailu
No ratings yet
Which of The Following Is A Challenge in A J2EE?: (A) Fault Tolerance (B) Durability (C) Scalability (D) Reliability
Document19 pages
Which of The Following Is A Challenge in A J2EE?: (A) Fault Tolerance (B) Durability (C) Scalability (D) Reliability
M Naveed Shakir
100% (1)
Unit - 3 RDBMS
Document51 pages
Unit - 3 RDBMS
Liyakath Ali
No ratings yet
Student Schema
Document1 page
Student Schema
Chuka Osemeka
No ratings yet
Phung Thuan
Document12 pages
Phung Thuan
36.Phùng Thuận
No ratings yet