Welcome to Scribd!

Skip carousel

Big Data RAJNEESH CCC

Uploaded by

vidhya associate

0% found this document useful (0 votes)

14 views11 pages

Original Title

Big data RAJNEESH CCC

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

14 views11 pages

Big Data RAJNEESH CCC

Uploaded by

vidhya associate

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

TRINITY INSTITUTE OF TECHNOLOGY & RESEARCH BHOPAL

SUBJECT – BIG DATA

Topic Name
HADOOP
Presented By:-
Under the Guidance of:-
RAJNEESH KORI
Prof.-VANDNA GAUTAM
Enrol No. 0198CS201068

Prof.-URMILA MAHOR
HOD (CS)
TRINITY ,Bhopal
CONTENTS

 Introduction.
 What is Hadoop.
 Main component.
 History of Hadoop.
 Features of Hadoop.
 Hadoop ecosystem.
 Advantage.
 Disadvantage.
Introduction

 Hadoop is an open-source software framework that is used for

storing and processing large amounts of data in a distributed
computing environment.

 It is designed to handle big data and is based on the MapReduce

programming model, which allows for the parallel processing of
large datasets.
What is Hadoop
 Hadoop is an open source software programming framework for
storing a large amount of data and performing the computation.
Its framework is based on Java programming with some native
code in C and shell scripts.

 Hadoop is an open-source software framework that is used for

storing and processing large amounts of data in a distributed
computing environment. It is designed to handle big data and is
based on the MapReduce programming model, which allows for
the parallel processing of large datasets.
History of Hadoop
 Apache Software Foundation is the developers of Hadoop, and it’s co-founders are Doug
Cutting and Mike Cafarella. It’s co-founder Doug Cutting named it on his son’s toy elephant.
In October 2003 the first paper release was Google File System. In January 2006, MapReduce
development started on the Apache Nutch which consisted of around 6000 lines coding for it
and around 5000 lines coding for HDFS. In April 2006 Hadoop 0.1.0 was released.

 Hadoop is an open-source software framework for storing and processing big data. It was
created by Apache Software Foundation in 2006, based on a white paper written by Google in
2003 that described the Google File System (GFS) and the MapReduce programming model.
Hadoop has two main components:

 HDFS (Hadoop Distributed File System): This is the storage

component of Hadoop, which allows for the storage of large
amounts of data across multiple machines. It is designed to work
with commodity hardware, which makes it cost-effective.

 YARN (Yet Another Resource Negotiator): This is the resource

management component of Hadoop, which manages the
allocation of resources (such as CPU and memory) for processing
the data stored in HDFS.
Features of Hadoop

 It is fault tolerance.

 It is highly available.

 It’s programming is easy.

 It have huge flexible storage.

 It is low cost.
Hadoop ecosystem

 Introduction: Hadoop Ecosystem is a platform or a

suite which provides various services to solve the big
data problems. It includes Apache projects and various
commercial tools and solutions. There are four major
elements of Hadoop i.e. HDFS, MapReduce, YARN,
and Hadoop Common. Most of the tools or solutions
are used to supplement or support these major
elements. All these tools work collectively to provide
services such as absorption, analysis, storage and
maintenance of data etc.
Advantage

• Ability to store a large amount of data.

• High flexibility.
• Cost effective.
• High computational power.
• Tasks are independent.
• Linear scaling
Disadvantage
• Not very effective for small data.
• Hard cluster management.
• Has stability issues.
• Security concerns.
• Complexity: Hadoop can be complex to set up and maintain, especially for
organizations without a dedicated team of experts.
• Latency: Hadoop is not well-suited for low-latency workloads and may not be the
best choice for real-time data processing.
• Limited Support for Real-time Processing: Hadoop’s batch-oriented nature makes it
less suited for real-time streaming or interactive data processing use cases.
THANK

YOU

HADOOP AND PYTHON FOR BEGINNERS GUIDE
Document89 pages
HADOOP AND PYTHON FOR BEGINNERS GUIDE
Antony George Sahayaraj
No ratings yet
Centralized AWS Logging Guide Summary
Document42 pages
Centralized AWS Logging Guide Summary
Devisetti Siva Priya
No ratings yet
Hadoop Ecosystem Components Explained
Document7 pages
Hadoop Ecosystem Components Explained
Akhi
No ratings yet
Hadoop Ecosystem Components
Document6 pages
Hadoop Ecosystem Components
Kittu
No ratings yet
Boilerand BMSApplication Guide
Document21 pages
Boilerand BMSApplication Guide
realpete
No ratings yet
Mastering Hadoop
From Everand
Mastering Hadoop
Sandeep Karanth
No ratings yet
Gaddis Python 4e Chapter 08
Document22 pages
Gaddis Python 4e Chapter 08
Aseil Nagro
0% (1)
ETL Vs ELT White Paper
Document12 pages
ETL Vs ELT White Paper
Deepak Rawat
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Every Man in His Humour by Jonson, Ben, 1573-1637
Document165 pages
Every Man in His Humour by Jonson, Ben, 1573-1637
Gutenberg.org
100% (1)
Seminar Report PDF
Document35 pages
Seminar Report PDF
Crystal
100% (2)
Big Data ABHISHEK PRAJA C CCCCCCCCCCC
Document11 pages
Big Data ABHISHEK PRAJA C CCCCCCCCCCC
vidhya associate
No ratings yet
Hadoop
Document11 pages
Hadoop
Inu Kag
No ratings yet
UNIT-2
Document10 pages
UNIT-2
tripathineeharika
No ratings yet
Big Data Analytics Unit-3
Document15 pages
Big Data Analytics Unit-3
4241 DAYANA SRI VARSHA
No ratings yet
Big Data Syllabus and Building Blocks
Document37 pages
Big Data Syllabus and Building Blocks
Prem Kumar
No ratings yet
CASE STUDY On Application of Hadoop
Document16 pages
CASE STUDY On Application of Hadoop
haqueashraful713
No ratings yet
CC-KML051-Unit V
Document17 pages
CC-KML051-Unit V
Fdjs
No ratings yet
Chapter 2 Hadoop Eco System
Document34 pages
Chapter 2 Hadoop Eco System
lamisaldhamri237
No ratings yet
Hadoop Presentation: Swarnali B.SC Computer Science Hons. 2 Year Chandernagore Govt. College Halder
Document8 pages
Hadoop Presentation: Swarnali B.SC Computer Science Hons. 2 Year Chandernagore Govt. College Halder
Akash Halder
No ratings yet
UNIT-I Introduction To Hadoop - A20
Document24 pages
UNIT-I Introduction To Hadoop - A20
Manoj Reddy
No ratings yet
Big Data Hadoop Stack
Document52 pages
Big Data Hadoop Stack
Yaser Ali Tariq
No ratings yet
Seminar On: Hadoop Technology
Document13 pages
Seminar On: Hadoop Technology
SAV Sports
No ratings yet
Bda Aiml Note Unit 2
Document13 pages
Bda Aiml Note Unit 2
viswakranthipalagiri
No ratings yet
BDA Unit 3
Document6 pages
BDA Unit 3
Sp
No ratings yet
What Is The Hadoop Ecosystem
Document5 pages
What Is The Hadoop Ecosystem
Zahra Mea
No ratings yet
Parallel Project
Document32 pages
Parallel Project
hafsabashir820
No ratings yet
Bda - 10
Document7 pages
Bda - 10
deshpande.pxresh
No ratings yet
Hadoop Unit-4
Document44 pages
Hadoop Unit-4
Kishore Parimi
No ratings yet
Research Paper On Apache Hadoop
Document6 pages
Research Paper On Apache Hadoop
soezsevkg
100% (1)
A Review of Hadoop Ecosystem for Processing Big Data
Document7 pages
A Review of Hadoop Ecosystem for Processing Big Data
V Kalyan
No ratings yet
Apache Hadoop
Document11 pages
Apache Hadoop
Imaad Ukaye
No ratings yet
Module III Note
Document36 pages
Module III Note
johnsonjoshal5
No ratings yet
U1-Lec 4
Document12 pages
U1-Lec 4
Papu Kutty
No ratings yet
Haddob Lab Report
Document12 pages
Haddob Lab Report
Magneto Eric Apollyon Thorn
No ratings yet
Guided By:: Miss. Rupali Zambre
Document20 pages
Guided By:: Miss. Rupali Zambre
john
No ratings yet
Guided By:: Miss. Rupali Zambre
Document20 pages
Guided By:: Miss. Rupali Zambre
john
No ratings yet
Unit 3
Document15 pages
Unit 3
xcgfxgvx
No ratings yet
Getting Started With HDP Sandbox
Document107 pages
Getting Started With HDP Sandbox
risdianto sigma
No ratings yet
Unit 2
Document30 pages
Unit 2
Awadhesh Maurya
No ratings yet
Big Data Analytics Using Hadoop
Document26 pages
Big Data Analytics Using Hadoop
bhargavi
No ratings yet
Subject: Data Driven Decision Making: Apache Hadoop For Big Data
Document5 pages
Subject: Data Driven Decision Making: Apache Hadoop For Big Data
Tarun Maheshwari
No ratings yet
CSE Hadoop Report
Document14 pages
CSE Hadoop Report
rohit
No ratings yet
hadoop
Document13 pages
hadoop
kajole7693
No ratings yet
High Performance Fault-Tolerant Hadoop Distributed File System
Document9 pages
High Performance Fault-Tolerant Hadoop Distributed File System
Editor IJRITCC
No ratings yet
Hadoop Admin Download Syllabus PDF
Document4 pages
Hadoop Admin Download Syllabus PDF
shubham phulari
No ratings yet
What Are The Best Online Video Tutorials For Hadoop and Big Data
Document260 pages
What Are The Best Online Video Tutorials For Hadoop and Big Data
Mathews
No ratings yet
Lesson 1 - Introduction To Big Data and Hadoop
Document46 pages
Lesson 1 - Introduction To Big Data and Hadoop
PoojaSampath
No ratings yet
Chicago Crime (2013) Analysis Using Pig and Visualization Using R
Document61 pages
Chicago Crime (2013) Analysis Using Pig and Visualization Using R
Saurabh Sharma
No ratings yet
Hadoop
Document7 pages
Hadoop
Anonymous mFO6slhI0
No ratings yet
Research Paper On Hadoop Technology
Document4 pages
Research Paper On Hadoop Technology
efjddr4z
100% (1)
The Solution For Big Data Hadoop
Document27 pages
The Solution For Big Data Hadoop
Amritranjan Das
No ratings yet
BigData Unit 2
Document15 pages
BigData Unit 2
Sreedhar Arikatla
No ratings yet
Apache Hadoop: Jump To Navigation Jump To Search
Document2 pages
Apache Hadoop: Jump To Navigation Jump To Search
Varun Malik
No ratings yet
Hadoop: Seminar On
Document15 pages
Hadoop: Seminar On
Anugrah Sharma
No ratings yet
Big Data
Document12 pages
Big Data
prerana rai
No ratings yet
Hadoop
Document6 pages
Hadoop
Vikas Sinha
No ratings yet
.Analysis and Processing of Massive Data Based On Hadoop Platform A Perusal of Big Data Classification and Hadoop Technology
Document3 pages
.Analysis and Processing of Massive Data Based On Hadoop Platform A Perusal of Big Data Classification and Hadoop Technology
Precious Pearl
No ratings yet
BDA Presentations Unit-4 - Hadoop, Ecosystem
Document25 pages
BDA Presentations Unit-4 - Hadoop, Ecosystem
Ashish Chauhan
No ratings yet
Introduction To Hadoop
Document44 pages
Introduction To Hadoop
Ponnusamy S Pichaimuthu
No ratings yet
Had Oop Cancer
Document6 pages
Had Oop Cancer
Anil Kumar Gupta
No ratings yet
Experiment No - 01
Document14 pages
Experiment No - 01
AYAAN Satkut
No ratings yet
HADOOP: A Solution To Big Data Problems Using Partitioning Mechanism Map-Reduce
Document6 pages
HADOOP: A Solution To Big Data Problems Using Partitioning Mechanism Map-Reduce
Editor IJTSRD
No ratings yet
Large-Scale Encryption in The Hadoop Environment Challenges and Solutions
Document8 pages
Large-Scale Encryption in The Hadoop Environment Challenges and Solutions
Abhishek Das
No ratings yet
Cloud Computing
Document19 pages
Cloud Computing
Afia Faryad
No ratings yet
Mits NIRDOSH
Document4 pages
Mits NIRDOSH
vidhya associate
No ratings yet
Project 3
Document16 pages
Project 3
vidhya associate
No ratings yet
Digital Signature Cryptography (My2) Saurabh Kumar CC
Document12 pages
Digital Signature Cryptography (My2) Saurabh Kumar CC
vidhya associate
No ratings yet
Mits
Document4 pages
Mits
vidhya associate
No ratings yet
Sample Report For Movie Recommender System
Document30 pages
Sample Report For Movie Recommender System
sanskritiiiii.2002
No ratings yet
Sample Report For Movie Recommender System
Document30 pages
Sample Report For Movie Recommender System
sanskritiiiii.2002
No ratings yet
SOFTWARE ARCHITECTURE - PDF RAJNEESH KORI
Document10 pages
SOFTWARE ARCHITECTURE - PDF RAJNEESH KORI
vidhya associate
No ratings yet
Digital Signature Cryptography (My2) Abhishek Praja C
Document12 pages
Digital Signature Cryptography (My2) Abhishek Praja C
vidhya associate
No ratings yet
Project Report
Document25 pages
Project Report
vidhya associate
No ratings yet
Dikshant Major Project2023
Document1 page
Dikshant Major Project2023
vidhya associate
No ratings yet
MD Nausad
Document5 pages
MD Nausad
vidhya associate
No ratings yet
1st Abstract RA Bill Slab Rate
Document1 page
1st Abstract RA Bill Slab Rate
vidhya associate
No ratings yet
SOFTWARE ARCHITECTURE - PDF RAJNEESH KORI
Document10 pages
SOFTWARE ARCHITECTURE - PDF RAJNEESH KORI
vidhya associate
No ratings yet
Big Data RAJNEESH CCC
Document11 pages
Big Data RAJNEESH CCC
vidhya associate
No ratings yet
Mobile Computing Guide: Parts, Issues & AdvantagesTITLE
Document231 pages
Mobile Computing Guide: Parts, Issues & AdvantagesTITLE
vidhya associate
No ratings yet
Digital Signature Cryptography (My2) Abhishek Praja C
Document12 pages
Digital Signature Cryptography (My2) Abhishek Praja C
vidhya associate
No ratings yet
Thyristor Controlled Power For Induction Motor 1 Front
Document8 pages
Thyristor Controlled Power For Induction Motor 1 Front
vidhya associate
No ratings yet
MD Kalam Final
Document5 pages
MD Kalam Final
vidhya associate
No ratings yet
Babu Hassan
Document5 pages
Babu Hassan
vidhya associate
No ratings yet
Extract File 20221130 193426
Document1 page
Extract File 20221130 193426
vidhya associate
No ratings yet
Rahul
Document8 pages
Rahul
vidhya associate
No ratings yet
3 Exp
Document4 pages
3 Exp
vidhya associate
No ratings yet
Skin Cancer Disease: A 40-Character
Document5 pages
Skin Cancer Disease: A 40-Character
vidhya associate
No ratings yet
Madhyanchal: Bachelor of Pharmacy
Document5 pages
Madhyanchal: Bachelor of Pharmacy
vidhya associate
No ratings yet
MD Gule Reza
Document5 pages
MD Gule Reza
vidhya associate
No ratings yet
MD Kalam Final
Document5 pages
MD Kalam Final
vidhya associate
No ratings yet
Extract File 20221130 193426
Document1 page
Extract File 20221130 193426
vidhya associate
No ratings yet
8 TH
Document5 pages
8 TH
vidhya associate
No ratings yet
3 Exp
Document4 pages
3 Exp
vidhya associate
No ratings yet
Lecture 3-CSS
Document94 pages
Lecture 3-CSS
Shruti Garg IIT
No ratings yet
Web Application: Advance Java
Document13 pages
Web Application: Advance Java
aaaaaaa2010
No ratings yet
DC Inverter VRF System Service MANUAL (R410A)
Document173 pages
DC Inverter VRF System Service MANUAL (R410A)
James Richardson
No ratings yet
11 Cybersecurity Trends For 2023
Document11 pages
11 Cybersecurity Trends For 2023
shirouhx
100% (1)
ETAP 12.5 Install Guide Release PDF
Document4 pages
ETAP 12.5 Install Guide Release PDF
Atabat Adudu
No ratings yet
SRGS (6-8) New Syllabus Booklet Format Class 8
Document2 pages
SRGS (6-8) New Syllabus Booklet Format Class 8
Asad Abbas
No ratings yet
Seagate DiscWizard - en
Document79 pages
Seagate DiscWizard - en
Haza
No ratings yet
GLOFA Pdf4a3620ed40d71 PDF
Document70 pages
GLOFA Pdf4a3620ed40d71 PDF
Luis Guillermo Posada
No ratings yet
Modeling and Simulation in SIMULINK For Engineers and Scientists
Document242 pages
Modeling and Simulation in SIMULINK For Engineers and Scientists
osmyav
100% (2)
SQLite
Document8 pages
SQLite
Amit
No ratings yet
Cadence Neocircuit
Document3 pages
Cadence Neocircuit
RF_RAJA
No ratings yet
Robotic Process Automation
Document6 pages
Robotic Process Automation
Tanushree Kalita
No ratings yet
Purposeful Chatbots For Your Enterprise: Transform Your Enterprise With Infosys Nia Conversational Interface
Document4 pages
Purposeful Chatbots For Your Enterprise: Transform Your Enterprise With Infosys Nia Conversational Interface
diet
No ratings yet
Deduplicatable Dynamic Proof of Storage For Multi-User Environments
Document5 pages
Deduplicatable Dynamic Proof of Storage For Multi-User Environments
Sahruday Tumoju
No ratings yet
Risk Methods
Document1 page
Risk Methods
api-27601960
No ratings yet
Imaster NCE-IP Lite V100R022C00 Product Description (x86) 01-C
Document518 pages
Imaster NCE-IP Lite V100R022C00 Product Description (x86) 01-C
MauricioJavierRomanini
No ratings yet
Week 9 Lecture Material PDF
Document97 pages
Week 9 Lecture Material PDF
Merugu spoorthi
No ratings yet
Place A New IMEI Order
Document1 page
Place A New IMEI Order
papa dame ndiaye
No ratings yet
Space Mouse
Document6 pages
Space Mouse
anubha goyal
No ratings yet
Deploying and Managing Exchange Server 2013 HA
Document265 pages
Deploying and Managing Exchange Server 2013 HA
Ahmed Gamal
No ratings yet
How To Find A Solution For Short Dumps From ST22
Document5 pages
How To Find A Solution For Short Dumps From ST22
koisak
No ratings yet
Methodology for Verifying Combat Simulation Behavior Models
Document10 pages
Methodology for Verifying Combat Simulation Behavior Models
Steve Jang
No ratings yet
Toshiba Self Checkout
Document8 pages
Toshiba Self Checkout
Gi Jin Kim
No ratings yet
CPS 110 Problem Set #1 Solutions Spring 2000
Document16 pages
CPS 110 Problem Set #1 Solutions Spring 2000
Ivan Fadillah
No ratings yet
Bike REntal Project file
Document22 pages
Bike REntal Project file
Nimit Agarwal
No ratings yet