1 &2 - The Big Data Technology Landscape Myppy

Uploaded by

Shushanth munna

0% found this document useful (0 votes)

12 views25 pages

Original Title

1 &2 . The Big Data Technology Landscape myppy

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views25 pages

1 &2 - The Big Data Technology Landscape Myppy

Uploaded by

Shushanth munna

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 25

Search inside document

THE BIG DATA

TECHNOLOGY
LANDSCAPE
By:Syed Nawaz
Asst. Professor
SREC
CSE
Topics To Learn
 What is NOSQL
 Where it is Used
 Types of NOSQL Databases
 Why NoSQL
 Advantages of noSQL
 What we miss with NoSQL
 Difference between SQL and noSQL
What is noSQL
 Not only SQL
 Non-relational, opensource, distributed DB’s which
can handle rich variety of data
(struc,semi,unstructured).
Features of NoSQL
 NoSQL DB’s are non-relational:Store data in the
form of key-value pairs,document-oreinted or
column-oriented or graph-based DB’s.
 Distributed
 No support for ACID: Follow CAP thereom
 No fixed Schema:flexible schema
Types of NoSQL Databases
 Key-value
 Document
 Column
 Graph
Key-value
 It maintains a big hash table of keys and values.
 Ex: Dynamo,Redis,Riak etc

Key Value
First name Sai
Last name kumar
Document Database
 It maintains data in collections constituted of
documents.
 Ex: MongoDB, Apache CouchDB, Couchbase etc…
 Sample document database:
 {
 “Book name”: “BDA”,
 “publication”: “wiley India”
 “Year of Publication”: “2011”
 }
Column
 Each storage block has data from only one column.
 Ex: Cassandra, HBase etc…
Graph
 Also called as Network database.
 Graph stores data in nodes.
 Ex: Neo4j,HyperGraphDB…
Why NoSQL
 Scale Out architecture
 Store variety of data
 Dynamic Schema
 Auto Sharding: Spread data across different nodes
 Replication: Availabity,Fault tolerance and
recovery
Advantages of Nosql
What we miss with NoSQL
 Joins
 GroupBy
 ACID properties
 SQL
 Easy integration with other applications that
support SQL.
Nosql in industry
SQL vs NoSQL
Hadoop
 It is an open source framework given by apache
software foundation for storing and processing
huge datasets with a cluster of commodity
hardware.
HDFS Architecture
Features of hadoop
 Optimized to handle massive quantities of data.
 Shared nothing architecture
 Data replication
 High throughput
 Complements OLTP and OLAP
 NOT good when work cannot be parallelized
 NOT good for processing small files
Key advantages of Hadoop
 Stores data in native format
 Scalable
 Cost-effective
 Resilient to failure
 Flexible
 Fast
Versions of Hadoop
Hadoop Ecosystem
Hadoop Ecosystem
 HDFS: It simply stores data files .
 Hbase: Hadoop’s database. It supports structured data storage for large
databases.
 Hive: Similar to ANSI SQL.
 Pig: data flow language. Pig scripts are automatically converted to map
reduce programs by pig interpreter.
 Zookeeper: coordination service for distributed applications.
 Oozie: workflow schedular to manage hadoop jobs.
 Mahout : scalable machine learning and data mining library.
 Chukwa: data collection system for managing large distributed systems.
 Sqoop: data transfer between RDBMS and hadoop.
 Ambari : web based tool for provisioninig,managing and monitoring hadoop
cluster.
Anatomy of File read in hadoop
Anatomy of File write in hadoop
Working with HDFS commands
 Hadoop fs –ls /
 Hadoop fs –ls –R /
 Hadoop fs –mkdir /sample
 Hadoop fs –put /root/sample/test.txt /sample/test.txt
 Hadoop fs –get /sample/test.txt /root/sample/testsample.txt
 Hadoop fs –copyfromlocal /roor/sample/test.txt
/sample/testsample.txt
 Hadoop fs –cat /sample/test.txt
 Hadoop fs –cp /sample/test.txt /sample1
 Hadoop fs –rm-r /sample1
 Successfully compiled ur drivercode,maper code and reducer code
 Export your jar files
 WordCount.jar
 $hadoop jar WordCount.jar /packdecmo/WordCount
/sample/test.txt /sample/wordcountoutput
 Wordcountoutput

 Hadoop fs –cat /wordcountoutput/part-r-00000

 Hi,2
Part-r-00000
 Cse,1
_sucess
 Student,1 s

Hadoop Ecosystem
Document56 pages
Hadoop Ecosystem
RUGAL NEEMA MBA 2021-23 (Delhi)
No ratings yet
The Big Data Technology Landscape
Document36 pages
The Big Data Technology Landscape
Ponnusamy S Pichaimuthu
No ratings yet
BDA Unit 2 Q&A
Document14 pages
BDA Unit 2 Q&A
viswakranthipalagiri
No ratings yet
Introduction to NOSQL Databases
Document40 pages
Introduction to NOSQL Databases
Irfan Pinjari
No ratings yet
NoSQL Database
Document5 pages
NoSQL Database
Sindhu Wardhana
No ratings yet
Hadoop Ecosystem
Document55 pages
Hadoop Ecosystem
nehal
No ratings yet
Ibm Hadoop
Document4 pages
Ibm Hadoop
4022 MALISHWARAN M
No ratings yet
Hortonworks Data Platform (HDP)
Document56 pages
Hortonworks Data Platform (HDP)
Harshit Bansal
100% (1)
The Hadoop Ecosystem Explained
Document55 pages
The Hadoop Ecosystem Explained
Rishabh Gupta
No ratings yet
Hadoop Ecosystem PDF
Document55 pages
Hadoop Ecosystem PDF
Rishabh Gupta
No ratings yet
Apache HIVE
Document105 pages
Apache HIVE
hemanth kumar p
100% (1)
Learn Hbase in 24 Hours
From Everand
Learn Hbase in 24 Hours
Alex Nordeen
No ratings yet
Unit_2_notes (1)
Document15 pages
Unit_2_notes (1)
sanchitghare
No ratings yet
Introduction To Hadoop
Document5 pages
Introduction To Hadoop
Hanumanthu Gouthami
No ratings yet
2 Hadoop
Document20 pages
2 Hadoop
YASH PRAJAPATI
No ratings yet
Hadoop Unit-4
Document44 pages
Hadoop Unit-4
Kishore Parimi
No ratings yet
A Review Paper On Big Data Database'S: Cassandra, Hbase, Hive
Document6 pages
A Review Paper On Big Data Database'S: Cassandra, Hbase, Hive
shani thakur
No ratings yet
BDA Presentations Unit-4 - Hadoop, Ecosystem
Document25 pages
BDA Presentations Unit-4 - Hadoop, Ecosystem
Ashish Chauhan
No ratings yet
Guided By:-Prof. K. Kakwani: Payal M. Wadhwani
Document24 pages
Guided By:-Prof. K. Kakwani: Payal M. Wadhwani
Ravi Joshi
No ratings yet
Big Data Technology Stack
Document12 pages
Big Data Technology Stack
Khalid Imran
No ratings yet
Assignment 6
Document12 pages
Assignment 6
Pujan Patel
No ratings yet
Top 18 Free and Widely Used, Open Source NoSQL Databases
Document4 pages
Top 18 Free and Widely Used, Open Source NoSQL Databases
dchandra15
No ratings yet
6 H Data With Hive Big Data Analytics B.tech. Final Year
Document24 pages
6 H Data With Hive Big Data Analytics B.tech. Final Year
RISHIKA ARORA
No ratings yet
Comparing Database Systems
Document77 pages
Comparing Database Systems
Sarvesh Dharme
No ratings yet
Hadoop Ecosystem Components
Document6 pages
Hadoop Ecosystem Components
Kittu
No ratings yet
S - Hadoop Ecosystem
Document14 pages
S - Hadoop Ecosystem
trancongquang2002
No ratings yet
BDA Lab Assignment 3 PDF
Document17 pages
BDA Lab Assignment 3 PDF
parth shah
No ratings yet
SQL and Nosql Programming With Spark
Document63 pages
SQL and Nosql Programming With Spark
Huy Nguyễn
No ratings yet
BigData Unit 2
Document15 pages
BigData Unit 2
Sreedhar Arikatla
No ratings yet
Big Data Analytics: A Comparative Evaluation of Apache Hadoop and Apache Spark
Document8 pages
Big Data Analytics: A Comparative Evaluation of Apache Hadoop and Apache Spark
sukhpreet singh
No ratings yet
Certified Hadoop and Spark Course Curriculum
Document9 pages
Certified Hadoop and Spark Course Curriculum
mano555
No ratings yet
h13999 Hadoop Ecs Data Services WP
Document9 pages
h13999 Hadoop Ecs Data Services WP
Vijay Reddy
No ratings yet
HADOOP
Document40 pages
HADOOP
saadiaiftikhar123
No ratings yet
No SQL
Document19 pages
No SQL
Dileep Singh
No ratings yet
Beginner's Guide to Hadoop Fundamentals
Document3 pages
Beginner's Guide to Hadoop Fundamentals
Sundaram yadav
No ratings yet
NoSQL DB
Document33 pages
NoSQL DB
AKSHAY Kumar
No ratings yet
Report On Hive of Apache
Document3 pages
Report On Hive of Apache
Gsoft Labs
No ratings yet
Bda - 10
Document7 pages
Bda - 10
deshpande.pxresh
No ratings yet
Bda Unit 5 Notes
Document23 pages
Bda Unit 5 Notes
Aishwarya Rayasam
No ratings yet
Gold Video Task Complted
Document31 pages
Gold Video Task Complted
srinivas75k
No ratings yet
Big Data Module 2
Document23 pages
Big Data Module 2
Srikanth M
No ratings yet
Big Data and Hadoop Overview
Document17 pages
Big Data and Hadoop Overview
Shreekanth Vankamamidi, PMP
100% (1)
Hadoop
Document6 pages
Hadoop
Vikas Sinha
No ratings yet
hive
Document2 pages
hive
scribd.unguided000
No ratings yet
Experiment No - 01
Document14 pages
Experiment No - 01
AYAAN Satkut
No ratings yet
FULL STACK-UNIT-III
Document56 pages
FULL STACK-UNIT-III
vedha0118
No ratings yet
What is Apache Hadoop? A guide to its core components and features
Document85 pages
What is Apache Hadoop? A guide to its core components and features
mvdurgadevi
No ratings yet
Hadoop Development Download Syllabus PDF
Document5 pages
Hadoop Development Download Syllabus PDF
shubham phulari
No ratings yet
NOSQL
Document6 pages
NOSQL
AKSHAY Kumar
No ratings yet
Practise Quiz Ccd-470 Exam (05-2014) - Cloudera Quiz Learning
Document74 pages
Practise Quiz Ccd-470 Exam (05-2014) - Cloudera Quiz Learning
ratneshkumarg
No ratings yet
What Is The Hadoop Ecosystem?
Document4 pages
What Is The Hadoop Ecosystem?
Maanit Singal
No ratings yet
Cloud Computing - Unit 3
Document38 pages
Cloud Computing - Unit 3
lightfreezzer
No ratings yet
Hadoop Interview1
Document27 pages
Hadoop Interview1
paramreddy2000
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
Document47 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
Ashita Punjabi
No ratings yet
Introduction to Big Data and Hadoop Framework
Document29 pages
Introduction to Big Data and Hadoop Framework
Manoj K Upadhyaya
100% (1)
SDL Module-No SQL Module Assignment No. 2: Q1 What Is Hadoop and Need For It? Discuss It's Architecture
Document6 pages
SDL Module-No SQL Module Assignment No. 2: Q1 What Is Hadoop and Need For It? Discuss It's Architecture
asdfasdf
No ratings yet
Cassandra Vs MongoDB Vs CouchDB Vs Redis Vs Riak Vs HBase Vs Couchbase Vs Hypertable Vs ElasticSearch Vs Accumulo Vs VoltDB Vs Scalaris Comparison - Software Architect Kristof Kovacs
Document11 pages
Cassandra Vs MongoDB Vs CouchDB Vs Redis Vs Riak Vs HBase Vs Couchbase Vs Hypertable Vs ElasticSearch Vs Accumulo Vs VoltDB Vs Scalaris Comparison - Software Architect Kristof Kovacs
irobot143
No ratings yet
2 BDA A6515 Hadoop
Document55 pages
2 BDA A6515 Hadoop
Sheshikanth Don
No ratings yet
Hadoop Interview Questions New
Document9 pages
Hadoop Interview Questions New
Rupali Shetty
No ratings yet
CC - Unit - 4
Document2 pages
CC - Unit - 4
Faruk Mohamed
No ratings yet
Introduction To Map Reduce Programming: By: Syed Nawaz Pasha Course Name: Big Data Analytics
Document12 pages
Introduction To Map Reduce Programming: By: Syed Nawaz Pasha Course Name: Big Data Analytics
Shushanth munna
No ratings yet
Major Project ppt1
Document11 pages
Major Project ppt1
Shushanth munna
No ratings yet
Unit Iii Data Structure
Document43 pages
Unit Iii Data Structure
Shushanth munna
No ratings yet
Big Data Analytics: By: Syed Nawaz Pasha at SR Univeristy Professional Elective-5 B.Tech Iv-Ii Sem
Document31 pages
Big Data Analytics: By: Syed Nawaz Pasha at SR Univeristy Professional Elective-5 B.Tech Iv-Ii Sem
Shushanth munna
100% (1)
(PE-V: Big Data and Analytics) : Apache Sqoop and Its Features
Document18 pages
(PE-V: Big Data and Analytics) : Apache Sqoop and Its Features
Shushanth munna
No ratings yet
File 2
Document44 pages
File 2
Shushanth munna
No ratings yet
Introduction To Map Reduce Programming: By: Syed Nawaz Pasha Course Name: Big Data Analytics
Document12 pages
Introduction To Map Reduce Programming: By: Syed Nawaz Pasha Course Name: Big Data Analytics
Shushanth munna
No ratings yet
IMP Questions
Document6 pages
IMP Questions
Shushanth munna
No ratings yet
Online House Rental System Connecting Owners and Tenants
Document5 pages
Online House Rental System Connecting Owners and Tenants
Shushanth munna
No ratings yet
Item normalization and indexing process
Document7 pages
Item normalization and indexing process
Shushanth munna
No ratings yet
Tutorial 1 - Exploring Arcgis: Objectives
Document11 pages
Tutorial 1 - Exploring Arcgis: Objectives
Moses Kaswa
No ratings yet
Gek 34124G
Document24 pages
Gek 34124G
gusgif
No ratings yet
UPS5000-S-1200 KVA Quick Guide
Document20 pages
UPS5000-S-1200 KVA Quick Guide
nobita3
No ratings yet
To Study About Various Types of Mode of
Document20 pages
To Study About Various Types of Mode of
Vikas Agrawal
No ratings yet
Problem Statements PDF
Document2 pages
Problem Statements PDF
adama tharun
No ratings yet
ServiceNow Sample Resume 3
Document7 pages
ServiceNow Sample Resume 3
Chiranjeevi Ch
No ratings yet
Ilovepdf Merged
Document10 pages
Ilovepdf Merged
Anisha Sapra
No ratings yet
WRAN CME V100R008 Feature Description
Document53 pages
WRAN CME V100R008 Feature Description
Sameer Ibraimo
No ratings yet
Presenting ServiceNow Data
Document76 pages
Presenting ServiceNow Data
Giriprasad Gunalan
No ratings yet
2012 01 20 - Twogether32 E
Document76 pages
2012 01 20 - Twogether32 E
nivan009sku9645
No ratings yet
Jonathan F. Quiles Ii - Bsit - D Human Computer Interaction 2 Assignment #1
Document6 pages
Jonathan F. Quiles Ii - Bsit - D Human Computer Interaction 2 Assignment #1
Jonathan Quiles
No ratings yet
History of Suspension Systesm
Document6 pages
History of Suspension Systesm
mangutkar_amit
67% (3)
Screenshot 2023-07-15 at 6.43.36 PM
Document1 page
Screenshot 2023-07-15 at 6.43.36 PM
Taniel Carter
No ratings yet
Pelton Turbine Operation and Design
Document22 pages
Pelton Turbine Operation and Design
Manuel Caipo
No ratings yet
Collimator Instruction Manuals PDF
Document160 pages
Collimator Instruction Manuals PDF
Nauman
100% (1)
Multiple Vacancies With MENAISCO in Jazan - Saudi Arabia
Document5 pages
Multiple Vacancies With MENAISCO in Jazan - Saudi Arabia
Viiq Corpse Grinder
No ratings yet
CN CS203 Lab Manual
Document36 pages
CN CS203 Lab Manual
Sarthak Singh Chandel
No ratings yet
A Process Story in Firozabad Cluster
Document257 pages
A Process Story in Firozabad Cluster
sudshk
No ratings yet
Directional Driller X CV
Document2 pages
Directional Driller X CV
Mino Mino
No ratings yet
Fourth Edition: Descriptive Analytics II: Business Intelligence and Data Warehousing
Document61 pages
Fourth Edition: Descriptive Analytics II: Business Intelligence and Data Warehousing
ramhan
No ratings yet
Classifications of Air Conditioning System: Based On Major Function
Document67 pages
Classifications of Air Conditioning System: Based On Major Function
jet latorre
No ratings yet
Wangkheirakpam2021 Article LinearityPerformanceAndIntermo
Document9 pages
Wangkheirakpam2021 Article LinearityPerformanceAndIntermo
sharmasamriti27
No ratings yet
Nptel Gis Ce
Document1 page
Nptel Gis Ce
kkeyan8080
No ratings yet
Civil Engineering in Indoor Substation
Document12 pages
Civil Engineering in Indoor Substation
farhan
No ratings yet
Guidelines For Passenger Services at European Airports
Document120 pages
Guidelines For Passenger Services at European Airports
JPFJ12
100% (2)
CM6650 User Manual
Document254 pages
CM6650 User Manual
acjp1979
No ratings yet
Draft SemestralWorK Aircraft2
Document7 pages
Draft SemestralWorK Aircraft2
Filip Skultety
No ratings yet
A Presentation On: Status of Construction Procedures of Nepal & E-Bidding For Contract Documentation
Document27 pages
A Presentation On: Status of Construction Procedures of Nepal & E-Bidding For Contract Documentation
Shankar Khanal
No ratings yet
Building Management System (BMS) Basic Trunkline Schematic Diagram 2
Document3 pages
Building Management System (BMS) Basic Trunkline Schematic Diagram 2
Anonymous NcB95G6Xw
No ratings yet
Advantages and Disadvantages of Technology
Document2 pages
Advantages and Disadvantages of Technology
Arim Arim
100% (1)