0% found this document useful (0 votes)
117 views

EX1-Installation of Hadoop

The document provides steps to install Hadoop Apache in standalone mode. It involves downloading Java and the latest version of Hadoop, setting environment variables and PATH, creating folders for namenode and datanode data, editing configuration files like core-site.xml and hdfs-site.xml, and starting all processes using start-all.cmd. Successful startup of all Hadoop processes confirms that Hadoop is installed correctly.

Uploaded by

anand.avcs088
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
117 views

EX1-Installation of Hadoop

The document provides steps to install Hadoop Apache in standalone mode. It involves downloading Java and the latest version of Hadoop, setting environment variables and PATH, creating folders for namenode and datanode data, editing configuration files like core-site.xml and hdfs-site.xml, and starting all processes using start-all.cmd. Successful startup of all Hadoop processes confirms that Hadoop is installed correctly.

Uploaded by

anand.avcs088
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

Installation of Hadoop Apache

Aim: To install Hadoop apache in standalone mode.

Algorithm
Step 1: Download latest version of java from following website.
https://www.oracle.com/in/java/technologies/downloads/
Step 2: Install the java in C folder and set following PATH and Environmental variable.
Go to setting -> search-> System environment variable.

PATH
C:\jdk-20.0.2\bin
JAVA_HOME
C:\jdk-20.0.2\bin

Step 3: Restart the system. After restarting the system type following command in the
command prompt to check whether java is properly installed in your system.
Java -version
Javac -version

Step 4: Download latest version of version of Hadoop. Latest version of hadoop can be
downloaded from the following official website.
https://hadoop.apache.org/releases.html
Step 5: Install Hadoop in C folder. Now you can see following folder will be installed in
C drive.
C:\hadoop-3.3.6
Step 6: Now open the hadoop-3.3.6 folder and you can see following folders inside the
hadoop-3.3.6 folder.

Step 7: Create a folder data inside hadoop-3.3.6 folder. Inside folder data create two
folders namenode and datanode.

Step 8: Install notepad+ + from following link.


https://notepad-plus-plus.org/downloads/

Step 9: Go the following folder and edit following files as follows.


C:\hadoop-3.3.6\etc\hadoop
1.Core-site.xml
2. mapred-site.xml
3.yarn-site.xml
4. hdfs-site.xml
5.hadoop-env.cmd
i) Open Core-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>

<property>
<name>fs.defaultersFS</name>
<value>hdfs://localhost:900</value>
</property>

ii) Open mapred-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>
<property>
<name>mapreduce.framework.name</name>
<value> yarn </value>
</property>
iii) Open yarn-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value> mapreduce_shuffle</value>
<property>

<property>
<name>yarn.nodemanager.auxservices.mapreduce.shuffle</name>
<value>org.apache.hadoop.mapred.shufflehandler</value>
<property>
iv) Open hdfs-site.xml file in Notepad++ and insert following script between
<Configuration> </configuration>

<property>
<name>dfs.replication</name>
<value>1</value>
</property>

<property>
<name> dfs.namenode.name.dir</name>
<value>C:\hadoop-3.3.6\data\namenode</value>
</property>

<property>
<name> dfs.datanode.name.dir</name>
<value>C:\hadoop-3.3.6\data\datanode</value>
</property>
iv) Open hadoop-env.cmd file in Notepad++ and changed following script as follows.
JAVA_HOME=%JAVA_HOME%
Can be changed as
JAVA_HOME=C:\jdk-20.0.2
Step 10: Create following paths and environmental variables for hadoop.
PATH
C:\hadoop-3.3.6\bin
C:\hadoop-3.3.6\sbin

HADOOP_HOME
C:\hadoop-3.3.6\bin
Step 11: Restart the system. After restarting the system type following command to check
whether hadoop is installed correctly.

Step 11: Open command prompt. In command prompt change directory to Hadoop 3.3.6/sbin
Type Start-all.cmd in command prompt and Appearance of following screen shows hadoop
was installed successfully.

Result: Hadoop is installed successfully.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy