Professional Documents
Culture Documents
Aim:
Introduction to Hive:
Hive is a data warehouse infrastructure tool to process structured data in Hadoop. It resides on
top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
Initially Hive was developed by Facebook, later the Apache Software Foundation took it up and
developed it further as an open source under the name Apache Hive. It is used by different
companies. For example, Amazon uses it in Amazon Elastic MapReduce.
Hive is not
o A relational database
o A design for On-line Transaction Processing (OLTP)
o A language for real-time queries and row-level updates
Installation of Hive:
wget https://archive.apache.org/dist/hive/hive-0.13.1/apache-hive-0.13.1-bin.tar.gz
STEP 2: Extract the downloaded Hive-0.13.1. files using the code given below.
STEP 4: Change the mode of Hive-0.13.1. using the command given below.
STEP 5: Open. bashrc file using vi editor (or) nano editor using the command given below.
STEP 6: Copy the export 2 line commands and paste in the .bashrc file at the end.
vi $HOME/.bashrc
export HIVE_PREFIX=/usr/hive
export PATH=$PATH:$HIVE_PREFIX/bin
exec bash
Running of Hive:
Conclusion:
Thus, Hive platform is successfully installed over hadoop and successfully executed in
Ubuntu.