Welcome to Scribd!

Skip carousel

232056-Homework 5

Uploaded by

Prabha K

0% found this document useful (0 votes)

8 views4 pages

Original Title

232056-homework 5

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views4 pages

232056-Homework 5

Uploaded by

Prabha K

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

QUERYING DATA USING HIVE

01. What is the meta store in hive?

Hive Meta store is a component in hive that stores the
catalog of the system that contains the metadata about hive
create columns, hive table creation, and partitions. Metadata is
mostly stored in the traditional form of RDBMS.

02. Where does the data of the hive table gets stored?
Hive stores its database and table metadata in a
metastore, which is a database or file backed store that
enables easy data abstraction and discovery.

03. Why hive does not store metadata information in HDFS?

Hive stores metadata information in the metastore
using RDBMS instead of HDFS. The reason for choosing RDBMS
is to achieve low latency as HDFS read/write operations are
time consuming processes.

04. What is the difference between local and remote

metastore?
Local metastore service still runs in the same JVM as
hive but it connects to a database running in a separate JVM
not on hive service JVM.

05. What is the default database provided by Apache hive for

metastore?
In hive by default, metastore service runs in the
same JVM as the hive service. It uses embedded derby stored
on the local file system in this mode. Thus both metastore
service ang hive service runs in the same JVM by using
embedded derby database.

06. What is the difference between external table and

manage table?
Managed tables are hive owned tables where the
entire lifecycle of the tables’ data are managed and controlled
by hive. External tables are tables where hive has loose
coupling with the data. All the write operations to the
managed tables are performed using hive SQL commands.

07. Is it possible to change the default location of a managed

table?
Yes, it can do by using the clause – LOCATION
‘<hdfs_path>’ we can change the default location of a
managed table.

08. What is a partition in hive?

The partition in hive means dividing the table into some
parts based on the values of a particular column like date,
course, city or country.

09. Why do we perform partitioning in Hive?

We perform partitioning in hive since the data is stored in
slices, the query response time become faster.

10. What is dynamic partitioning and when is it used?

Dynamic partitioning is the strategic approach to load the
data from the non-partitioning table where the single insert to
the partition table is called a dynamic partition. It is used in
loading data from an existing non-partitioning table to improve
the sampling and therefore, decrease the query latency.

11. Suppose, you create a table that contains details of all the
transactions done by the customers of year 2022: CREATE
TABLE transaction _ details (cust _ id INT, amount
FLOAT, month STRING, country STRING) ROW
FORMAT DELIMITED FIELDS TERMINATED BY’,’;
now, after inserting 50,000 tuples in this table, you want to
know the total revenue generated for each month. But, hive
is taking too much time to process and list the steps that
you will be taking in order to do so?
1. Create a partitioned table, say partitioned _ transaction:

CREATE TABLE partitioned _ transaction (cust_ id INT,

amount FLOAT, country STRING) PATITIONED BY (month
STRING) ROW FORMAT DELIMATED FIELDS TERMINATED
BY’,’;

2. Enable dynamic partitioning in hive:

SET hive. Exec. dynamic. Partition = true;

SET hive. Exec. Dynamic. Partition. Mode = nonstrick;

3. Transfer the data from the non-partitioned table into the

newly created partitioned table:
INSERT OVERWRITE TABLE partitioned _ transaction
PARTITION (month) SELECT cust _ id, amount, country,
month FROM transaction _ details;

Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
OPTIMIZED METADATA QUERIES
Document3 pages
OPTIMIZED METADATA QUERIES
Prabha K
No ratings yet
Learn Hbase in 24 Hours
From Everand
Learn Hbase in 24 Hours
Alex Nordeen
No ratings yet
Top 15 Apache Hive Interview Questions
Document10 pages
Top 15 Apache Hive Interview Questions
shubham rathod
No ratings yet
Learn SAP BI in 24 Hours
From Everand
Learn SAP BI in 24 Hours
Alex Nordeen
Rating: 3 out of 5 stars
3/5 (1)
Hive vs HBase - Key Differences and Apache Hive Concepts
Document11 pages
Hive vs HBase - Key Differences and Apache Hive Concepts
ashok c
No ratings yet
Hive Interview Questions Answers
Document6 pages
Hive Interview Questions Answers
rksekhar
No ratings yet
BDA Assignment I and II
Document8 pages
BDA Assignment I and II
Ashok Mane Mane
No ratings yet
Warehousing
Document100 pages
Warehousing
Karthik Sakaraboyina
No ratings yet
Big Data Huawei Course
Document23 pages
Big Data Huawei Course
Thiago Siqueira
No ratings yet
Apache Hive Optimization Techniques - 1 - Towards Data Science
Document8 pages
Apache Hive Optimization Techniques - 1 - Towards Data Science
mydumm
No ratings yet
Hadoop Hive
Document61 pages
Hadoop Hive
mustaq
No ratings yet
Chapter_4_Data Access – Hive
Document35 pages
Chapter_4_Data Access – Hive
lamisaldhamri237
No ratings yet
Hive PPT
Document61 pages
Hive PPT
SHWETA DABHADE
No ratings yet
Hadoop HIVE
Document41 pages
Hadoop HIVE
SHWETA DABHADE
No ratings yet
Hive Is A Data Warehouse Infrastructure Tool To Process Structured Data in Hadoop
Document30 pages
Hive Is A Data Warehouse Infrastructure Tool To Process Structured Data in Hadoop
aravind
No ratings yet
Apache Hive overview - data warehousing on Hadoop
Document61 pages
Apache Hive overview - data warehousing on Hadoop
Naveen Reddy
No ratings yet
Unit 4
Document36 pages
Unit 4
Radhamani V
No ratings yet
Hive
Document65 pages
Hive
Apurva
No ratings yet
hive (1)
Document45 pages
hive (1)
babel 8
No ratings yet
7 1590 PDF
Document7 pages
7 1590 PDF
avc
No ratings yet
Hive
Document5 pages
Hive
Mayank Rai
No ratings yet
Data integrity, BW versions, indices, KPIs, ODS vs cube
Document12 pages
Data integrity, BW versions, indices, KPIs, ODS vs cube
kalicharan13
No ratings yet
Bda Exp-6
Document10 pages
Bda Exp-6
Mohit Gangwani
No ratings yet
Automating Hive query execution verification and loading data without moving files
Document17 pages
Automating Hive query execution verification and loading data without moving files
mihirhota
75% (4)
Chapter+9+ HIVE
Document50 pages
Chapter+9+ HIVE
HARISH SUBRAMANIAN
No ratings yet
Hive Crash Course: A Beginner's Guide
Document19 pages
Hive Crash Course: A Beginner's Guide
sunita
No ratings yet
Accessing Hadoop Data Using Hive: Hive Configuration
Document3 pages
Accessing Hadoop Data Using Hive: Hive Configuration
rajiv2karna
No ratings yet
Web Based Data Management of Apache Hive
Document22 pages
Web Based Data Management of Apache Hive
Krupa Patel
No ratings yet
04 Bigdata Hive
Document22 pages
04 Bigdata Hive
Rohit Uppal
No ratings yet
Hbase: Big Data Huawei Course
Document27 pages
Hbase: Big Data Huawei Course
Thiago Siqueira
No ratings yet
Unit 4
Document189 pages
Unit 4
Rajesh Kumar Rakasula
No ratings yet
HIVE
Document24 pages
HIVE
Iskander Denguezli
No ratings yet
Hive
Document17 pages
Hive
pruphiphis
No ratings yet
Learn Hadoop with resources for BigData training
Document6 pages
Learn Hadoop with resources for BigData training
sourashree
100% (1)
UNIT IV PART - 1
Document60 pages
UNIT IV PART - 1
Nithya Naraparaju
No ratings yet
Unit-4 Pig Hive
Document40 pages
Unit-4 Pig Hive
Siva
No ratings yet
Hive
Document30 pages
Hive
Nitish
No ratings yet
Using Hive For Data Warehousing: Introduction To Hive
Document4 pages
Using Hive For Data Warehousing: Introduction To Hive
Saurav Gupta
No ratings yet
Hands-On Lab: IBM Software Information Management
Document25 pages
Hands-On Lab: IBM Software Information Management
ramos
No ratings yet
hive
Document2 pages
hive
scribd.unguided000
No ratings yet
Hive SQL Data Warehouse
Document75 pages
Hive SQL Data Warehouse
qw
No ratings yet
Bda Unit 5 Notes
Document23 pages
Bda Unit 5 Notes
Aishwarya Rayasam
No ratings yet
Hive Lecture Notes
Document17 pages
Hive Lecture Notes
Yuvaraj V, Assistant Professor, BCA
100% (1)
Apache Hadoop Developer Hive ProgrammingV1.0 Con
Document91 pages
Apache Hadoop Developer Hive ProgrammingV1.0 Con
Guruprasad Vijayakumar
100% (1)
Hive - A Warehousing Solution Over A Map-Reduce Framework
Document4 pages
Hive - A Warehousing Solution Over A Map-Reduce Framework
Satyanarayan Patel
No ratings yet
Apache HIVE
Document105 pages
Apache HIVE
hemanth kumar p
100% (1)
Data Analytics Units 5
Document12 pages
Data Analytics Units 5
Aditi Jaiswal
No ratings yet
HIVE-Processing Structured Data in Hadoop: March 2017
Document5 pages
HIVE-Processing Structured Data in Hadoop: March 2017
ronics123
No ratings yet
Hive Objective
Document2 pages
Hive Objective
ravi
No ratings yet
Assignment Day 10: Task 1
Document8 pages
Assignment Day 10: Task 1
marina dutta
No ratings yet
Understanding Data Processing with Hive
Document11 pages
Understanding Data Processing with Hive
Ňąŕëşh ķümãŕ S
No ratings yet
Top 50 Informatica Interview Questions & Answers
Document11 pages
Top 50 Informatica Interview Questions & Answers
Richa kashyap
No ratings yet
Hadoop Cluster
Document23 pages
Hadoop Cluster
Anoushka Rao
No ratings yet
All in One Informatica Questionnaire
Document83 pages
All in One Informatica Questionnaire
Ramasundari Vadali
75% (4)
Introduction To HIVE
Document8 pages
Introduction To HIVE
Kajal
No ratings yet
When Do You Go For Data Transfer Transformation: Benefits
Document5 pages
When Do You Go For Data Transfer Transformation: Benefits
Ramkoti Vemula
No ratings yet
Unit-Vi Hive Hadoop & Big Data
Document24 pages
Unit-Vi Hive Hadoop & Big Data
Abhay Dabhade
100% (1)
HDFS and MapReduce Quiz Review
Document266 pages
HDFS and MapReduce Quiz Review
Bruno Teles
No ratings yet
Exames BDF PDF
Document15 pages
Exames BDF PDF
Bruno Teles
No ratings yet
Spark SQL: Structured Data Processing in Spark
Document1 page
Spark SQL: Structured Data Processing in Spark
Prabha K
No ratings yet
Homework Scala and Sqoop programming questions
Document6 pages
Homework Scala and Sqoop programming questions
Prabha K
No ratings yet
Homework 3 (23 11 2022)
Document2 pages
Homework 3 (23 11 2022)
Prabha K
No ratings yet
Homework 1 (21 11 2022)
Document1 page
Homework 1 (21 11 2022)
Prabha K
No ratings yet
Electromagnetic Model Units and Constants
Document11 pages
Electromagnetic Model Units and Constants
Prabha K
No ratings yet
Robot C Programming Tutorial Robot C Programming Tutorial
Document9 pages
Robot C Programming Tutorial Robot C Programming Tutorial
mohanmzcet
No ratings yet
Feedback Amplifiers Explained
Document131 pages
Feedback Amplifiers Explained
Prabha K
No ratings yet
Electrical Engineering Fundamentals for Beginners
Document112 pages
Electrical Engineering Fundamentals for Beginners
Prabha K
No ratings yet
Technology in Action: 16 Edition
Document26 pages
Technology in Action: 16 Edition
Valery Well
No ratings yet
The Five Generations of Computers
Document3 pages
The Five Generations of Computers
shiva.vasudevan
No ratings yet
Netmon Software Installation and Configuration Guide: December 21, 2020
Document20 pages
Netmon Software Installation and Configuration Guide: December 21, 2020
Khinci Keyen
No ratings yet
12th Computer Science EM XII - 20.02.2019 PDF
Document360 pages
12th Computer Science EM XII - 20.02.2019 PDF
Srini Vasan
No ratings yet
PTE - OI - 03 - Initial Supply Control
Document2 pages
PTE - OI - 03 - Initial Supply Control
Abhinav Singh
No ratings yet
Introduction To C Programming
Document29 pages
Introduction To C Programming
Shukri Ishah
No ratings yet
802.11ac Wave 1 AP Products or Before: AP Firmware Upgrade Path Guide
Document2 pages
802.11ac Wave 1 AP Products or Before: AP Firmware Upgrade Path Guide
Royer Casaverde Diaz
No ratings yet
Wireless and Mobile Network Architecture
Document592 pages
Wireless and Mobile Network Architecture
Madhuri Desai
No ratings yet
Apache Camel
Document2 pages
Apache Camel
linda976
No ratings yet
Microprocessor MCQ UNIT 6: MP - U6 MCQ Saj
Document117 pages
Microprocessor MCQ UNIT 6: MP - U6 MCQ Saj
7779854268
No ratings yet
Cloud Computing Comprehensive Exam Questions
Document2 pages
Cloud Computing Comprehensive Exam Questions
ರಾಘವೇಂದ್ರ ಟಿ ಎಸ್
No ratings yet
MSc Computing Program Part I
Document19 pages
MSc Computing Program Part I
Zarrug Abdulgader
No ratings yet
Wireless Communications and Mobile Computing Guide
Document38 pages
Wireless Communications and Mobile Computing Guide
sara
No ratings yet
Chapter 5 Python
Document7 pages
Chapter 5 Python
Uma Maheswari
No ratings yet
Operating System
Document29 pages
Operating System
Francine Espineda
No ratings yet
Microsoft Access Knowledge and Troubleshooting Areas
Document2 pages
Microsoft Access Knowledge and Troubleshooting Areas
BSACC2BLK1HAZELJANE SUMAMPONG
No ratings yet
Datasheet - Netscout-Tap-Family
Document5 pages
Datasheet - Netscout-Tap-Family
Kelvin Patty
No ratings yet
X98 AllCardsLost
Document6 pages
X98 AllCardsLost
hichem alloui
No ratings yet
Optical Networks (Debasish Datta) PDF
Document719 pages
Optical Networks (Debasish Datta) PDF
Sourabh B
No ratings yet
Inder Kumar: Curriculum Vitae
Document4 pages
Inder Kumar: Curriculum Vitae
Wild Raac
No ratings yet
Big Data Cloud Requirements
Document38 pages
Big Data Cloud Requirements
Konstantin S. Trusevich
No ratings yet
36470e4333697b687fd9d82cd6347fe9
Document55 pages
36470e4333697b687fd9d82cd6347fe9
Bertin Bidias
No ratings yet
Log
Document7 pages
Log
Kenneth Charles Rigonan
No ratings yet
Miroslav Lessev: Monitoring Microsoft SQL Server Using The Elastic Stack
Document32 pages
Miroslav Lessev: Monitoring Microsoft SQL Server Using The Elastic Stack
Igor Camargo
No ratings yet
CH 8
Document19 pages
CH 8
rollandike
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Arithmetic
Document26 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Arithmetic
Aqib abdullah
No ratings yet
ODI12c Creating and Connecting To ODI Master and Work Repositories
Document6 pages
ODI12c Creating and Connecting To ODI Master and Work Repositories
Elie Diab
No ratings yet
DS-1200KI/DS-1006KI Keyboard: User Manual
Document38 pages
DS-1200KI/DS-1006KI Keyboard: User Manual
Aydee Aranguren
No ratings yet
Ultrasound
Document3 pages
Ultrasound
Afiq Husaini
No ratings yet
Instagram Sfpug
Document183 pages
Instagram Sfpug
tantaliz3r
No ratings yet
ITIL 4: High-velocity IT: Reference and study guide
From Everand
ITIL 4: High-velocity IT: Reference and study guide
Mark Smalley
No ratings yet
Python Projects for Everyone
From Everand
Python Projects for Everyone
Mohamad Charara
No ratings yet
Agile Metrics in Action: How to measure and improve team performance
From Everand
Agile Metrics in Action: How to measure and improve team performance
Christopher Davis
No ratings yet
Dark Data: Why What You Don’t Know Matters
From Everand
Dark Data: Why What You Don’t Know Matters
David J. Hand
Rating: 4.5 out of 5 stars
4.5/5 (3)
ITIL 4: Digital and IT strategy: Reference and study guide
From Everand
ITIL 4: Digital and IT strategy: Reference and study guide
David Cannon
Rating: 5 out of 5 stars
5/5 (1)
Real-Time Big Data Analytics
From Everand
Real-Time Big Data Analytics
Shilpi
Rating: 5 out of 5 stars
5/5 (1)
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
From Everand
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Walter Shields
Rating: 4.5 out of 5 stars
4.5/5 (46)
Monitored: Business and Surveillance in a Time of Big Data
From Everand
Monitored: Business and Surveillance in a Time of Big Data
Peter Bloom
Rating: 4 out of 5 stars
4/5 (1)
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
Microsoft Access Guide to Success: From Fundamentals to Mastery in Crafting Databases, Optimizing Tasks, & Making Unparalleled Impressions [III EDITION]
From Everand
Microsoft Access Guide to Success: From Fundamentals to Mastery in Crafting Databases, Optimizing Tasks, & Making Unparalleled Impressions [III EDITION]
Kevin Pitch
Rating: 5 out of 5 stars
5/5 (8)
ITIL 4: Direct, plan and improve: Reference and study guide
From Everand
ITIL 4: Direct, plan and improve: Reference and study guide
Lou Hunnebeck
No ratings yet
Blockchain Basics: A Non-Technical Introduction in 25 Steps
From Everand
Blockchain Basics: A Non-Technical Introduction in 25 Steps
Daniel Drescher
Rating: 4.5 out of 5 stars
4.5/5 (24)
ITIL 4: Create, Deliver and Support: Reference and study guide
From Everand
ITIL 4: Create, Deliver and Support: Reference and study guide
Barclay Rae
No ratings yet
Mastering PostgreSQL: A Comprehensive Guide for Developers
From Everand
Mastering PostgreSQL: A Comprehensive Guide for Developers
Kameron Hussain
No ratings yet
Excel 2021
From Everand
Excel 2021
JIAYI SIMONDS
Rating: 4 out of 5 stars
4/5 (11)
Mastering Blockchain
From Everand
Mastering Blockchain
Imran Bashir
Rating: 4.5 out of 5 stars
4.5/5 (3)
SQL All-in-One For Dummies
From Everand
SQL All-in-One For Dummies
Allen G. Taylor
Rating: 4 out of 5 stars
4/5 (1)
Learn SAP SD in 24 Hours
From Everand
Learn SAP SD in 24 Hours
Alex Nordeen
No ratings yet
Mastering PostgreSQL 12 - Third Edition: Advanced techniques to build and administer scalable and reliable PostgreSQL database applications, 3rd Edition
From Everand
Mastering PostgreSQL 12 - Third Edition: Advanced techniques to build and administer scalable and reliable PostgreSQL database applications, 3rd Edition
Hans-Jürgen Schönig
No ratings yet
IBM DB2 Administration Guide: Installation, Upgrade and Configuration of IBM DB2 on RHEL 8, Windows 10 and IBM Cloud (English Edition)
From Everand
IBM DB2 Administration Guide: Installation, Upgrade and Configuration of IBM DB2 on RHEL 8, Windows 10 and IBM Cloud (English Edition)
A S Bluck
No ratings yet
COBOL Basic Training Using VSAM, IMS and DB2
From Everand
COBOL Basic Training Using VSAM, IMS and DB2
Robert Wingate
Rating: 5 out of 5 stars
5/5 (2)
Data Lakes For Dummies
From Everand
Data Lakes For Dummies
Alan R. Simon
No ratings yet
Professional Access 2013 Programming
From Everand
Professional Access 2013 Programming
Ben Clothier
No ratings yet
SQL Server: Tips and Tricks - 2
From Everand
SQL Server: Tips and Tricks - 2
Priyanka Agarwal
Rating: 4.5 out of 5 stars
4.5/5 (3)
Phoenix in Action
From Everand
Phoenix in Action
Geoffrey Lessel
No ratings yet
Joe Celko's SQL for Smarties: Advanced SQL Programming
From Everand
Joe Celko's SQL for Smarties: Advanced SQL Programming
Joe Celko
Rating: 3 out of 5 stars
3/5 (1)
PostgreSQL Administration Essentials
From Everand
PostgreSQL Administration Essentials
Hans-Jürgen Schönig
No ratings yet
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
From Everand
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
Steve Williams
Rating: 5 out of 5 stars
5/5 (5)
R: Recipes for Analysis, Visualization and Machine Learning
From Everand
R: Recipes for Analysis, Visualization and Machine Learning
Atmajitsinh Gohil
No ratings yet
Fusion Strategy: How Real-Time Data and AI Will Power the Industrial Future
From Everand
Fusion Strategy: How Real-Time Data and AI Will Power the Industrial Future
Vijay Govindarajan
No ratings yet