Hadoop installation guide

This current tutorial is all about installation of Hadoop and setting up a single node cluster on Linux Ubuntu environment.it is a made easy short tutorial when compared with the existing hefty tutorials on the web...anyway here we go... What we want to do 1. Install sun java jdk 2. Install hadoop-0.20 3. Install hue (a file browser and job designer on hdfs) Prerequisites The following process was tested on Ubuntu 10.04LTS and also 11.04...but regarding stability i would personally prefer 10.04LTS.. Download 10.04 or 11.04 from this link:http://www.ubuntu.com/download/ubuntu/download Once done with the installation of Ubuntu, sun java must be installed. Let’s get into it... Open terminal (applications>terminal)...the following steps(highlighted command) must be copied one by one paste it to terminal interface hit enter after every step.. Step 1: we create a canonical partner repository...to do so paste the below code to terminal. sudo add-apt-repository "deb http://archive.canonical.com/ lucid partner"

Step 2: we update the source list sudo apt-get update Step 3: now we Install sun-java6-jdk sudo apt-get install sun-java6-jdk .

Step 1: Ubuntu 10.deb .com/one-click-install/lucid/cdh3-repository_1..hit the below link and save the file in the downloads folder. http://archive.Step 4: we make a quick check whether Sun’s JDK is correctly set up java -version Now we head to the installation of Hadoop.cloudera...0_all.04 is a lucid system..

deb Step 3: now we update the installed package sudo apt-get update .0_all.Step 2: now we install the downloaded package with the following command sudo dpkg -i Downloads/cdh3-repository_1.

.d/hadoop-0.Step 4: now we are to the final Hadoop installation sudo apt-get install hadoop-0. do sudo $service start.Now we start the daemons and verify whether all the components are working fine. done ..20-conf-pseudo We are done with the installation of Hadoop.20-*. With the below command for service in /etc/init.

. if ur well used to the commands of Hadoop then u can start working with Hadoop. if u want to say adios to the command UI or if ur a newbie to Hadoop i would prefer using hue-a file browser and job designer on hdfs.now we say Hadoop accomplished. Follow the next command Step 2:open a new terminal and type the command sudo gedit Step 3: now head to the file location as places>computer>filesystem>>etc>hive>conf>hive-site.. Now we head to the installation of hue(Hadoop user environment) Step 1: To install cloudera Hue sudo apt-get install hue Once done with the installation of hue.xml file (to edit drag the file to gedit interface) and then save it after editing the below code.. ..

In the [desktop] section. [desktop] secret_key=jFE93j.ini configuration file(head to the location and drag the file to gedit editor interface).ConnectionURL property: <property> <name>javax.create=true</value> <description>JDBC connect string for a JDBC metastore</description> </property> Step 4: now once again open the gedit editor to provide a secret key sudo gedit step 5:Open the /etc/hue/hue.option.KEiwN2s3['d.jdo./. enter a long series of random characters (30 to 60 characters is recommended) and save the file after editing with the below code.2[290-eiw..Change the location of the tables by changing the value of the javax.option.ConnectionURL</name> <value>jdbc:derby:.jdo.q[eIW^y#e=+Iei*@Mn<qW5o .databaseName=/usr/share/hue/metastore_db.

open a web browser and go to: http://localhost:8088/ .Step 6: Start Hue by running sudo /etc/init.d/hue start Step 7: To use Hue.

d/hadoop-0.20-*. done Restart hue as: sudo /etc/init.d/hadoop-0.20-*. done Now restart Hadoop daemons as: for service in /etc/init.d/hue start Now the following plugins must be installed to avoid misconfiguration in hue browser Step 8: installing flume sudo apt-get install flume .If error occurs stop hue followed by stopping hadoop nodes as: sudo /etc/init. do sudo $service stop.d/hue stop Now stop the Hadoop daemons as (error occurs but will rectify the misconfiguration in a while): for service in /etc/init. do sudo $service start.

Step 9: installing hbase sudo apt-get install hadoop-hbase step 10:installing Hadoop-pig sudo apt-get install hadoop-pig .

----THE END---- -Sriram .. Hit the file browser and job browser from the dock provided.Now we cross check whether the entire configuration is working well or not…open hue in a browser using http://localhost:8088/ To check whether all the daemons are working fine.. If it works well then we are done.