Big Data

Uploaded by

sukhpreet singh

0% found this document useful (0 votes)

14 views7 pages

Original Title

big data

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

14 views7 pages

Big Data

Uploaded by

sukhpreet singh

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

Big Data Analytics: A

Comparative Evaluation
of Apache Hadoop and
Apache Spark
In this presentation, we'll be exploring the differences between two of
the most popular big data processing frameworks, Apache Hadoop
and Apache Spark.

by Sukhpreet Singh
What is Big Data Analytics?
1 Definition 2 Importance

Big Data Analytics refers Big Data Analytics enables

to the process of organizations to drive
extracting insights and innovation and make
valuable information from data-driven decisions that
large and complex can lead to greater
datasets. efficiency and
profitability.

3 Tools

There are various tools available for Big Data Analytics, but
Apache Hadoop and Apache Spark are two of the most widely
used platforms.
Overview of Apache Hadoop

What is Hadoop? How does it work?

Apache Hadoop is an open-source Big Data Hadoop stores data across multiple servers in
processing framework that allows distributed a distributed file system called Hadoop
storage and processing of large datasets across Distributed File System (HDFS). The processing
computing clusters. itself is done using a framework called
MapReduce.
Overview of Apache Spark
What is Spark? How does it work? Features

Apache Spark is an open- Spark uses a processing Spark includes a wide

source Big Data engine built on top of range of features,
processing engine that Hadoop's MapReduce including support for real-
allows fast and efficient framework, but with time stream processing,
processing of large some important machine learning, graph
datasets in a distributed modifications that allow processing, and more.
fashion. faster and more efficient
processing, including in-
memory processing and
caching.
Comparison between Hadoop and Spark
Applications

Both platforms can be used for a

wide range of Big Data
Scalability
processing applications, but
Both platforms are highly Spark is better suited for certain
scalable, but Spark tends to be types of processing, such as
more efficient due to its in- machine learning and real-time
memory processing capabilities. stream processing.

1 2 3 4

Speed Usability

Spark is generally faster than Hadoop can be more complex to

Hadoop, especially for iterative set up and use, while Spark has a
processing and real-time stream simpler and more user-friendly
processing. API.
Evaluation Criteria
Performance Scalability

How well does each platform handle large- How easy is it to scale each platform to
scale data processing? handle larger and more complex datasets?

Usability Features

How easy is it to use and learn each What are the key features of each platform,
platform? and how well do they meet the needs of
your specific use case?
Conclusion

Which is better? Final Thoughts

There is no clear answer to this question, as it Both Apache Hadoop and Apache Spark are
largely depends on your specific use case and powerful Big Data processing platforms that
requirements. can help organizations gain valuable insights
from their data.

Big Data Analytics
From Everand
Big Data Analytics
Venkat Ankam
No ratings yet
Hadoop vs. Spark: The New Age of Big Data
Document7 pages
Hadoop vs. Spark: The New Age of Big Data
adnanbw
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Big Data Analytics: A Comparative Evaluation of Apache Hadoop and Apache Spark
Document8 pages
Big Data Analytics: A Comparative Evaluation of Apache Hadoop and Apache Spark
sukhpreet singh
No ratings yet
Introduction to Big Data Technologies
Document10 pages
Introduction to Big Data Technologies
indolent56
No ratings yet
Big Data Processing With Apache Spark - Infoqdotcom
Document16 pages
Big Data Processing With Apache Spark - Infoqdotcom
abhijitch
No ratings yet
Big Data Processing With Apache Spark
Document17 pages
Big Data Processing With Apache Spark
abhijitch
No ratings yet
Tech Seminar Report
Document5 pages
Tech Seminar Report
Saikumar Thurai
No ratings yet
Top Spark Interview Questions
Document32 pages
Top Spark Interview Questions
srinivas75k
No ratings yet
Hadoop Vs Spark Vs Kafka - Comparing Big Data & Distributed Streaming Tools
Document4 pages
Hadoop Vs Spark Vs Kafka - Comparing Big Data & Distributed Streaming Tools
adnanbw
No ratings yet
Shark
Document24 pages
Shark
kapilkashyap3105
No ratings yet
Spark Interview 4
Document10 pages
Spark Interview 4
consania
No ratings yet
Compare Hadoop vs. Spark vs. Kafka For Your Big Data Strategy
Document10 pages
Compare Hadoop vs. Spark vs. Kafka For Your Big Data Strategy
usman
No ratings yet
Homework 4 (24 11 2022)
Document2 pages
Homework 4 (24 11 2022)
Prabha K
No ratings yet
Module 3
Document51 pages
Module 3
sagarhn sagarhn
No ratings yet
Spark Interview Ques1
Document20 pages
Spark Interview Ques1
Nareshkumar Nakirikanti
No ratings yet
226 Unit-7
Document26 pages
226 Unit-7
shivam saxena
No ratings yet
Apache Spark Quick Guide
Document21 pages
Apache Spark Quick Guide
Oumaima Alfa
100% (1)
Learn Apache Spark
Document31 pages
Learn Apache Spark
abreddy2003
100% (1)
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
Document6 pages
A Comparative Study On Apache Spark and Map Reduce With Performance Analysis Using KNN and Page Rank Algorithm
Editor IJTSRD
No ratings yet
Spark: Prepared by Dulari Bhatt
Document19 pages
Spark: Prepared by Dulari Bhatt
Dulari Bosamiya Bhatt
No ratings yet
Unit 4
Document60 pages
Unit 4
Ramstage Testing
No ratings yet
Spark SQL
Document25 pages
Spark SQL
Rishi
No ratings yet
A Brief Introduction To Apache Spark
Document10 pages
A Brief Introduction To Apache Spark
Venkatesh Narisetty
No ratings yet
Apache Spark PDF
Document34 pages
Apache Spark PDF
sowjanya kandukuri
No ratings yet
Spark Interview Questions: Click Here
Document35 pages
Spark Interview Questions: Click Here
Keshav Krishna
No ratings yet
Hadoopvsspark 180108070838
Document17 pages
Hadoopvsspark 180108070838
salah Alswiay
No ratings yet
Apache Spark For Beginners
Document30 pages
Apache Spark For Beginners
ankesh patel
No ratings yet
Apache Spark
Document25 pages
Apache Spark
PhillipeSantos
No ratings yet
Introduction To Spark
Document23 pages
Introduction To Spark
Trần Nguyên Thái Bảo
No ratings yet
Compare Hadoop and Spark.: Table
Document10 pages
Compare Hadoop and Spark.: Table
consania
No ratings yet
Cloudera Developer Training for Spark & Hadoop (DSH
Document4 pages
Cloudera Developer Training for Spark & Hadoop (DSH
Aiswarya Nimmagadda
No ratings yet
Top Answers To Spark Interview Questions
Document4 pages
Top Answers To Spark Interview Questions
Ejaz Alam
No ratings yet
The Big Big Data' Question Hadoop or Spark
Document3 pages
The Big Big Data' Question Hadoop or Spark
Rajiv Nayan
No ratings yet
Spark Intreview FAQ
Document21 pages
Spark Intreview FAQ
haranadhc
100% (1)
Apache Spark
Document16 pages
Apache Spark
Kolariya Dheeraj
No ratings yet
Big Data Technologies
Document31 pages
Big Data Technologies
AdiTan00
No ratings yet
Apache Spark Interview Questions and Answers PDF
Document31 pages
Apache Spark Interview Questions and Answers PDF
Zyad Ahmed
No ratings yet
Big Data With Hadoop & Spark - Introduction
Document28 pages
Big Data With Hadoop & Spark - Introduction
Premjit Sengupta
No ratings yet
Introduction To Hadoop & Spark
Document28 pages
Introduction To Hadoop & Spark
Justin Talbot
No ratings yet
Apache Spark Interview Questions
Document12 pages
Apache Spark Interview Questions
varun3dec1
No ratings yet
Dataengineering - v2.0 - PDF - 2 - Batch Processing of Data With Spark and Hadoop On GCP - M2 - Executing Spark On Cloud Dataproc
Document67 pages
Dataengineering - v2.0 - PDF - 2 - Batch Processing of Data With Spark and Hadoop On GCP - M2 - Executing Spark On Cloud Dataproc
Edgar Sanchez
No ratings yet
Ace Your Apache Spark Interview
Document22 pages
Ace Your Apache Spark Interview
Venmo 6193
0% (1)
Chapter 2 Hadoop Eco System
Document34 pages
Chapter 2 Hadoop Eco System
lamisaldhamri237
No ratings yet
BDA-UNIT-6
Document14 pages
BDA-UNIT-6
belwalkarvarad
No ratings yet
Apache Spark Features
Document2 pages
Apache Spark Features
nitinlucky
No ratings yet
Spark In-Memory Cluster Computing
Document7 pages
Spark In-Memory Cluster Computing
Sailesh Chauhan
No ratings yet
Pyspark Modules&packages RDD
Document9 pages
Pyspark Modules&packages RDD
klogeswaran.it
No ratings yet
Apache Spark Primer 170303
Document8 pages
Apache Spark Primer 170303
selives
No ratings yet
Spark Interview Questions
Document19 pages
Spark Interview Questions
santosh kumar
No ratings yet
CC-KML051-Unit V
Document17 pages
CC-KML051-Unit V
Fdjs
No ratings yet
Apache Spark Interview Questions and Answers For 2020
Document8 pages
Apache Spark Interview Questions and Answers For 2020
Shashank Abhishek
No ratings yet
Spark Streaming Research
Document6 pages
Spark Streaming Research
reshmashaik4656
No ratings yet
Hadoop vs Apache Spark
Document6 pages
Hadoop vs Apache Spark
indolent56
No ratings yet
Performance Comparison of Apache Hadoop and Apache Spark
Document5 pages
Performance Comparison of Apache Hadoop and Apache Spark
salah Alswiay
No ratings yet
Unit-5 Spark
Document20 pages
Unit-5 Spark
Siva
No ratings yet
unit 4 spark cassendra
Document41 pages
unit 4 spark cassendra
downloadjain123
No ratings yet
Apache Spark Tutorial
Document6 pages
Apache Spark Tutorial
abhimanyu thakur
100% (1)
BDTools
Document15 pages
BDTools
Tanishq Upreti
No ratings yet
Big Data Hadoop Stack
Document52 pages
Big Data Hadoop Stack
Yaser Ali Tariq
No ratings yet
BCA System Analysis MCQs
Document38 pages
BCA System Analysis MCQs
Parmpreet Singh
100% (2)
III B.COM Core Principles of Multimedia MCQs
Document24 pages
III B.COM Core Principles of Multimedia MCQs
Pidoon Esm
76% (42)
Bcab - SC (It) Class List
Document1 page
Bcab - SC (It) Class List
sukhpreet singh
No ratings yet
Unit 2 Image Compression Multiple Choice
Document7 pages
Unit 2 Image Compression Multiple Choice
Navaneetha Krishnan
No ratings yet
Subsidized Solar Pumps for Farmers in Punjab Under MNRE Scheme
Document28 pages
Subsidized Solar Pumps for Farmers in Punjab Under MNRE Scheme
sukhpreet singh
No ratings yet
SIG
Document43 pages
SIG
sukhpreet singh
No ratings yet
Conference Guide
Document23 pages
Conference Guide
apple987
No ratings yet
Sapanjeet Kaur Sidhu Research Work
Document1 page
Sapanjeet Kaur Sidhu Research Work
sukhpreet singh
No ratings yet
Environmentppt 121202081136 Phpapp02
Document18 pages
Environmentppt 121202081136 Phpapp02
xallilax
No ratings yet
CG MSCIT4th
Document1 page
CG MSCIT4th
sukhpreet singh
No ratings yet
Aa
Document1 page
Aa
sukhpreet singh
No ratings yet
Conference Guide
Document23 pages
Conference Guide
apple987
No ratings yet
72 1 1 Part1 Compressed
Document18 pages
72 1 1 Part1 Compressed
sukhpreet singh
No ratings yet
Java Inheritance PDF
Document7 pages
Java Inheritance PDF
Mallikarjun Aradhya
No ratings yet
Java Inheritance PDF
Document7 pages
Java Inheritance PDF
Mallikarjun Aradhya
No ratings yet
MCQ Questions - On Enviromantal Studies
Document7 pages
MCQ Questions - On Enviromantal Studies
Mehul Joshie
No ratings yet
Notification Employees Provident Fund Organisation Assistant Posts PDF
Document40 pages
Notification Employees Provident Fund Organisation Assistant Posts PDF
Mahesh Pawar
No ratings yet
Global Mindset
Document1 page
Global Mindset
kong_yau_2
No ratings yet
Cambridge International AS & A Level Biology grade thresholds for June 2019
Document2 pages
Cambridge International AS & A Level Biology grade thresholds for June 2019
bloom
No ratings yet
156 - Tapoja Ray - Group 2 Assigned Leadership
Document3 pages
156 - Tapoja Ray - Group 2 Assigned Leadership
payal
No ratings yet
Campus Drive - Zieta Tech. Pvt. Ltd. Notice
Document1 page
Campus Drive - Zieta Tech. Pvt. Ltd. Notice
Bhabani Guddy
No ratings yet
What Are Phrasal Verbs?: Types
Document2 pages
What Are Phrasal Verbs?: Types
Heidy Teresa Morales
No ratings yet
Lesson Plan in Science 6
Document9 pages
Lesson Plan in Science 6
Alexander Pamulagan Baterna
No ratings yet
Black History Month Lesson
Document11 pages
Black History Month Lesson
J Johnson
No ratings yet
Clothing - Female - Stays - English & American
Document220 pages
Clothing - Female - Stays - English & American
The 18th Century Material Culture Resource Center
100% (28)
Mahinay Trinidad Videos Study PDF
Document9 pages
Mahinay Trinidad Videos Study PDF
Kristine Joy Dungganon
No ratings yet
ConvNet For The 2020s
Document12 pages
ConvNet For The 2020s
Hamza OKD
No ratings yet
Four Stage Phil Iri Administration 1
Document37 pages
Four Stage Phil Iri Administration 1
LOLITA DE LEON
No ratings yet
Best Beginner Guitar Books for Kids
Document3 pages
Best Beginner Guitar Books for Kids
tvis Music
No ratings yet
457-Article Text-2481-3-10-20220703
Document9 pages
457-Article Text-2481-3-10-20220703
ADE SRI RAHAYU
No ratings yet
CPL Procedure New
Document5 pages
CPL Procedure New
Shibin Johney
No ratings yet
9 Classroom Practices
Document4 pages
9 Classroom Practices
api-356378736
No ratings yet
BED1107 Introduction To Social Work and Social Welfare Resources
Document4 pages
BED1107 Introduction To Social Work and Social Welfare Resources
Ibrahim Mohamed Ibrahim
No ratings yet
Ejercicio de So and Neither
Document4 pages
Ejercicio de So and Neither
Cinthya Zamora
No ratings yet
Types of Assessment Instruments For English Language Learning
Document19 pages
Types of Assessment Instruments For English Language Learning
Shayra Castillo
No ratings yet
DepEd Order On Time Allotment Per Learning Areas - TeacherPH
Document1 page
DepEd Order On Time Allotment Per Learning Areas - TeacherPH
sjnhs SPA
No ratings yet
Howto 30 Ise Profiling
Document118 pages
Howto 30 Ise Profiling
Anonymous 0eCNZp2M
No ratings yet
Ophthalmology For The Primary Care Physician
Document393 pages
Ophthalmology For The Primary Care Physician
Catana Tudor
No ratings yet
Introduction To The EU-BIC Quality System
Document9 pages
Introduction To The EU-BIC Quality System
Enes Buladı
100% (1)
Kalkhaire Shubham V.
Document2 pages
Kalkhaire Shubham V.
Cĥĕťáń Ťĩķám
No ratings yet
Philippine Professional Standards for Teachers Guide
Document12 pages
Philippine Professional Standards for Teachers Guide
Ania
No ratings yet
Car Owners Lucknow
Document21 pages
Car Owners Lucknow
fun one
50% (2)
Gador-Whyte, Sarah Mellas, Andrew - Hymns, Homilies and Hermeneutics in Byzantium
Document5 pages
Gador-Whyte, Sarah Mellas, Andrew - Hymns, Homilies and Hermeneutics in Byzantium
Robert-Andrei Bălan
No ratings yet
Passive Voice Lesson Plan
Document2 pages
Passive Voice Lesson Plan
api-437615450
No ratings yet
Ethics and Accountability in Philippine
Document32 pages
Ethics and Accountability in Philippine
Melchor Padilla Dioso
No ratings yet
LESSON PLAN: Writing Effective Informal Emails
Document43 pages
LESSON PLAN: Writing Effective Informal Emails
Emily James
No ratings yet
Residential Nursing Home Proposal
Document22 pages
Residential Nursing Home Proposal
api-324136209
No ratings yet