Professional Documents
Culture Documents
https://drive.google.com/drive/folders/1F0CJw-UQNZGzQ3igbO_
mvOUFVhn-s721?usp=sharing
Java Installation
Step 1:
Install java jdk files in default location, just press the
next button.
Step 2:
To Install the jre files make sure you have to create a
new folder in C drive.
Step 3:
Now create a new folder named “Java” by clicking on
“make new folder”
Step 4 :
By clicking on next you can see that the installation
process has been Started
Step 5:
Now click on program files
Step 6:
Now move this jdk file to folder you have created in C
drive named java
Step 7:
Step 11:
Now let's check java is functioning correctly
Open cmd
Step 1:
Step 2:
Next step is to set the environment variable for
hadoop. Before that it is need to set the configuration
of hadoop
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
</property>
Here We have added one property which is the file
location that is fs.default file system
And the local host location that is 9000
Step 4:
Now edit mapred-site.xml
Here we also need to add some properties
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
<property>
<name>mapreduce.application.classpath</name>
<value>%HADOOP_HOME%/share/hadoop/mapreduce/*,%HADOOP_HO
ME%/share/hadoop/mapreduce/lib/*,%HADOOP_HOME%/share/had
oop/common/*,%HADOOP_HOME%/share/hadoop/common/lib/*,%HA
DOOP_HOME%/share/hadoop/yarn/*,%HADOOP_HOME%/share/hadoo
p/yarn/lib/*,%HADOOP_HOME%/share/hadoop/hdfs/*,%HADOOP_H
OME%/share/hadoop/hdfs/lib/*</value>
</property>
Step 5:
Now update yarn-site.xml
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class
</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
<property>
<name>yarn.resourcemanager.scheduler.address</name>
<value>localhost:8030</value>
</property>
<property>
<name>yarn.resourcemanager.address</name>
<value>localhost:8032</value>
</property>
<property>
<name>yarn.resourcemanager.webapp.address</name>
<value>localhost:8088</value>
</property>
<property>
<name>yarn.resourcemanager.resource-tracker.address</name>
<value>localhost:8031</value>
</property>
<property>
<name>yarn.resourcemanager.admin.address</name>
<value>localhost:8033</value>
</property>
<property>
<name>yarn.nodemanager.env-whitelist</name>
<value>JAVA_HOME,HADOOP_COMMON_HOME,HADOOP_HDFS_HOME,HADOOP
_CONF_DIR,CLASSPATH_PREPEND_DISTCACHE,HADOOP_YARN_HOME,HADO
OP_MAPRED_HOME</value>
</property>
Step 6:
Step 7:
Now copy the location of namenode and datanode
datanode location is C:\hadoop\data\datanode
Namenode location is C:\hadoop\data\namenode
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<!-- <value>file:///DIRECTORY 1 HERE</value> -->
<value>C:\hadoop\data\namenode</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<!-- <value>file:///DIRECTORY 2 HERE</value> -->
<value>C:\hadoop\data\datanode</value>
</property>
Step 8:
Step 9:
Now go to environment variables for setting home
and path for hadoop
Step: 11