EX1-Installation of Hadoop
EX1-Installation of Hadoop
Algorithm
Step 1: Download latest version of java from following website.
https://www.oracle.com/in/java/technologies/downloads/
Step 2: Install the java in C folder and set following PATH and Environmental variable.
Go to setting -> search-> System environment variable.
PATH
C:\jdk-20.0.2\bin
JAVA_HOME
C:\jdk-20.0.2\bin
Step 3: Restart the system. After restarting the system type following command in the
command prompt to check whether java is properly installed in your system.
Java -version
Javac -version
Step 4: Download latest version of version of Hadoop. Latest version of hadoop can be
downloaded from the following official website.
https://hadoop.apache.org/releases.html
Step 5: Install Hadoop in C folder. Now you can see following folder will be installed in
C drive.
C:\hadoop-3.3.6
Step 6: Now open the hadoop-3.3.6 folder and you can see following folders inside the
hadoop-3.3.6 folder.
Step 7: Create a folder data inside hadoop-3.3.6 folder. Inside folder data create two
folders namenode and datanode.
<property>
<name>fs.defaultersFS</name>
<value>hdfs://localhost:900</value>
</property>
ii) Open mapred-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>
<property>
<name>mapreduce.framework.name</name>
<value> yarn </value>
</property>
iii) Open yarn-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value> mapreduce_shuffle</value>
<property>
<property>
<name>yarn.nodemanager.auxservices.mapreduce.shuffle</name>
<value>org.apache.hadoop.mapred.shufflehandler</value>
<property>
iv) Open hdfs-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name> dfs.namenode.name.dir</name>
<value>C:\hadoop-3.3.6\data\namenode</value>
</property>
<property>
<name> dfs.datanode.name.dir</name>
<value>C:\hadoop-3.3.6\data\datanode</value>
</property>
iv) Open hadoop-env.cmd file in Notepad++ and changed following script as follows.
JAVA_HOME=%JAVA_HOME%
Can be changed as
JAVA_HOME=C:\jdk-20.0.2
Step 10: Create following paths and environmental variables for hadoop.
PATH
C:\hadoop-3.3.6\bin
C:\hadoop-3.3.6\sbin
HADOOP_HOME
C:\hadoop-3.3.6\bin
Step 11: Restart the system. After restarting the system type following command to check
whether hadoop is installed correctly.
Step 11: Open command prompt. In command prompt change directory to Hadoop 3.3.6/sbin
Type Start-all.cmd in command prompt and Appearance of following screen shows hadoop
was installed successfully.