0% found this document useful (0 votes)

11 views8 pages

CC EXP 8 VBHV

Uploaded by

online compiler

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views8 pages

CC EXP 8 VBHV

Uploaded by

online compiler

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Vaibhav Gupta 21BCS3440

Aim: Install Hadoop single node cluster and run simple applications likeword
count.

Hadoop framework is well comportable in the Linux environment but for the users who
are not familiar with Linux environment but want to use the hadoop framework can be
make use of this article. This article is aim to Install hadoop single node cluster and run
simple application like wordcout.

Procedure:
1. Install Java
2. Configure and install hadoop
3. Test hadoop installation
4. Create wordcount program
5. Input file to mapreduce
6. Display the output

I. JAVA Installation
1. Go to official Java Downloading page
https://www.oracle.com/java/technologies/javase-jre8-downloads.html
1. After downloading java, run the jdk-8u241-windows-x64.exe file
2. Follow the instructions and click next.
3. After finishing the installation it is need to set Java environment variable
4. Go to Start->Edit the System environment variable->Environment
variable
5. Then Click new and enter variable name as “JAVA_HOME”
6. In the value field Enter the java path such as
“C:\Java\jdk1.8.0_241”(Consider your installation folder)

Fig-3.1

7. Go to path and click edit then type “%JAVA_HOME%\bin”

Fig-3.2
8 . Then click Ok and Go to Command Prompt
9. Type “Java -version”. If it prints the installed version of java, now java
successfully installed in your System.

Fig-3.3

II Configuring And Installing Hadoop

1. Download Hadoop 2.8.0
from http://archive.apache.org/dist/hadoop/core//hadoop-2.8.0/hadoop-2.8.0.tar.gz)
2. Extract the tar file ( in my case I used 7-zip to extract the file and I stored the
extracted file in the D:\hadoop)
3. After finishing the extraction it is need to set Hadoop environment variable
4. Go to Start->Edit the System environment variable->Environment variable
5. Then Click new and enter variable name as “HADOOP_HOME”
6. In the value field Enter the java path such as “D:\hadoop”(Consider your
installation folder)

Fig-3.4

7. Go to path and click edit then type “%HADOOP_HOME%\bin”

Fig-3.6

8. Now we have to configure the hadoop.

9. Go to D:/hadoop/etc/hadoop/.. folder, find the below mentioned files andpaste
the following.

i. core-site.xml
<configuration> <property> <name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value> </property>
</configuration>

ii. Rename " mapred- site. xml. template " to " mapred- site. xml " and
edit this fileD:/Hadoop/etc/hadoop/mapred-site.xml, paste below xml
paragraph and save this file.
<configuration> <property> &https://www.linkedin.com/redir/phishing-
page?url=lt%3Bname%26gt%3Bmapreduce%2eframework%2ename</name>
<value>yarn</value> </property>
</configuration>

iii. Create folder "data" under "D:\Hadoop"

 Create folder "datanode" under "D:\Hadoop\data"
 Create folder "namenode" under "D:\Hadoop\data" data

iv. hdfs-site.xml
<configuration> <property> <name>dfs.replication</name>
<value>1</value> </property> <property>
<name>dfs.namenode.name.dir</name>
<value>D:\hadoop\data\namenode</value> </property> <property>
<name>dfs.datanode.data.dir</name>
<value>D:\hadoop\data\datanode</value> </property>
</configuration>

v. yarn-site.xml
<configuration> <property> <name>yarn.nodemanager.auxservices</name>
<value>mapreduce_shuffle</value> </property>
<property>
<name>yarn.nodemanager.auxservices.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value> </property>
</configuration>

vi. Edit file D:\Hadoop\etc\hadoop\hadoop-env.cmd by closing the

command line "JAVA_HOME=%JAVA_HOME%" instead of set
"JAVA_HOME= C:\Java\jdk1.8.0_241" (if your java file in Program Files
the instead of give Progra~1 otherwise you will get JAVA_HOME
incorrectly set error)
vii. Download file Hadoop
Configuration.zip https://github.com/Prithiviraj2503/hadoop-installation-
windows

viii. Delete file bin on D:\Hadoop\bin and replace it by the bin file of
Downloaded configuration file (from Hadoop Configuration.zip).

ix. Open cmd and typing command "hdfs namenode – format " .You
will see through command prompt which tasks are processing, after
competeation you will get a massage like namenode format succesfully and
shutdown message

hdfs namenode –format

III. Testing Hadoop Installation

1. Open Cmd and type the following “Hadoop -version”

Fig-3.7
2. To start the hadoop locate to “D:\hadoop\sbin” via command prompt andpress
start-all.cmd

Fig-3.8
Now, you can see the namenode, datanode and yarn engines getting start,

Fig-3.9

3. Now type “jps”. JPS (Java Virtual Machine Process Status Tool) is a command is
used to check all the Hadoop daemons like NameNode, DataNode,
ResourceManager, NodeManager etc.

Fig-3.10
4. Open: http://localhost:8088 in any browser

Fig-3.11

5. Open: http://localhost:50070 in any browser

Fig-3.12

Now hadoop succesfully installed in your System.

IV. Simple WordCount Program

1) After successful hadoop installation we need to create an directory in the
hadoop file system

2) Start the hadoop via command prompt $ start-all.cmd

3) By using $jps command Ensure hadoop nodes are running

4) To create a directory, use: $ hadoop fs –mkdir /inputdir

5) To input a file within a directory, use: $ hadoop fs –

put D:/input_file.txt/inputdir
6) To ensure wether your file succesfully imported, use: $ hadoop fs –ls
/inputdir/

7) To view the content of the file, use: $ hadoop dfs –cat

/inputdir/input_file.txt
Link for input file : https://github.com/Prithiviraj2503/hadoop-installation-
windows

Fig-3.13

8) Now appy mapreduce program to the input file. We have

a mapReduceClient.jar which contain java mapper and reducer programs. After
applying the jar file you can see the task performed in the mapreduce phase.All the
resuts of completed tasks will be printed in the command prompt.
Link for mapReduceClient.jar : https://github.com/Prithiviraj2503/hadoop-
installation-windows
Fig-3.14
9) After completed the mapreduce tasks the output will be stored inthe
output_dir directory To see the output, use: $ hadoop dfs –cat
/output_dir/

Fig-3.15

10) To stop the hadoop type $stop-all.cmd

Now the hadoop single node cluster was installed succesfully and the simple
word count program were executed succesfully in your windows system.
Fig-3.16

Analysis:
This provides a clear, step-by-step guide for installing and configuring Hadoop on a
Windows system, along with running a basic WordCount program. It covers essential
tasks such as setting up Java, configuring Hadoop, testing the installation, and executing
the WordCount program. The instructions are detailed, including screenshots for clarity.
However, it could benefit from explanations of Hadoop concepts, troubleshooting tips,
and considerations for security. Overall, it's a useful resource for beginners aiming to set
up Hadoop on Windows.

Conclusion:
In this experiment, we installed and ran Hadoop on a Windows environment, complete
with executing a simple WordCount program. By following the detailed instructions
provided, users can successfully set up their Hadoop single-node cluster and perform
basic MapReduce tasks. While the guide covers essential steps and includes helpful
visuals, there's room for improvement in terms of explaining Hadoop concepts, offering
troubleshooting guidance, and addressing security considerations. Nonetheless, it serves
as a valuable resource for beginners seeking to explore Hadoop in a Windows setting.

Result:
Installed and ran Hadoop on Windows, including executing a WordCount program, and
explained in depth the concepts and addressing potential issues.

BDA Lab Manual R22
0% (1)
BDA Lab Manual R22
70 pages
Exp 5 - 9
No ratings yet
Exp 5 - 9
25 pages
Da Lab Record - Merged
No ratings yet
Da Lab Record - Merged
48 pages
Bda Exp1 Chinmay
No ratings yet
Bda Exp1 Chinmay
13 pages
Big Data Analytics lab-JD
No ratings yet
Big Data Analytics lab-JD
49 pages
DA Lab EXERCISE
No ratings yet
DA Lab EXERCISE
24 pages
Big Data Manual
No ratings yet
Big Data Manual
19 pages
Bda Lab Record
No ratings yet
Bda Lab Record
60 pages
BDA Lab Manual 2023-2024
No ratings yet
BDA Lab Manual 2023-2024
54 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
42 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Hadoopfile PP
No ratings yet
Hadoopfile PP
83 pages
Hadoop Installation For Windows
No ratings yet
Hadoop Installation For Windows
10 pages
Final Copy - BDA LAB Record
No ratings yet
Final Copy - BDA LAB Record
44 pages
Step 1: Download Binary Package
No ratings yet
Step 1: Download Binary Package
50 pages
Install Hadoop-2.6.0 On Windows10
No ratings yet
Install Hadoop-2.6.0 On Windows10
8 pages
Hadoop On Windows
No ratings yet
Hadoop On Windows
13 pages
Big Data
No ratings yet
Big Data
32 pages
Big Data Manual Ai
No ratings yet
Big Data Manual Ai
33 pages
Hadoop 1
No ratings yet
Hadoop 1
39 pages
Practical N0.2 AIM: Install Hadoop Hadoop Installation On Windows 10
No ratings yet
Practical N0.2 AIM: Install Hadoop Hadoop Installation On Windows 10
12 pages
HDFS Installation Steps
No ratings yet
HDFS Installation Steps
17 pages
BDA Lab Manual by T.Naga Praveena
No ratings yet
BDA Lab Manual by T.Naga Praveena
40 pages
Hadoop Installation
No ratings yet
Hadoop Installation
17 pages
Worksheet3.1 CC
No ratings yet
Worksheet3.1 CC
8 pages
Experiment No. 3.1: 1) JAVA-Java JDK 2) HADOOP-Hadoop Package - Step 1: Verify The Java Installed
No ratings yet
Experiment No. 3.1: 1) JAVA-Java JDK 2) HADOOP-Hadoop Package - Step 1: Verify The Java Installed
6 pages
Hadoop Installation Process
No ratings yet
Hadoop Installation Process
16 pages
Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
Computer Science & Engineering: Department of
No ratings yet
Computer Science & Engineering: Department of
6 pages
Bda Record
No ratings yet
Bda Record
83 pages
A Report On Distributed Computing
No ratings yet
A Report On Distributed Computing
25 pages
New Bda Manual
No ratings yet
New Bda Manual
80 pages
Bda Manual
No ratings yet
Bda Manual
80 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
26 pages
Big Data
No ratings yet
Big Data
28 pages
Hadoop Record 2024-Final
No ratings yet
Hadoop Record 2024-Final
59 pages
Big Data Akshat
No ratings yet
Big Data Akshat
57 pages
Lab Manual
No ratings yet
Lab Manual
34 pages
Big Data File
No ratings yet
Big Data File
16 pages
Hbase Installationn
No ratings yet
Hbase Installationn
12 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
62 pages
Setup Hadoop On Windows 10 Machines
No ratings yet
Setup Hadoop On Windows 10 Machines
4 pages
Data Science
No ratings yet
Data Science
82 pages
Big Datalab
No ratings yet
Big Datalab
4 pages
Anushka Shetty 35
No ratings yet
Anushka Shetty 35
34 pages
Bda Manual
No ratings yet
Bda Manual
33 pages
Lab Manual
No ratings yet
Lab Manual
27 pages
HADOOP PPT
No ratings yet
HADOOP PPT
21 pages
Linux Notes For Professionals
100% (1)
Linux Notes For Professionals
65 pages
Hadoop Installation and Configuration
No ratings yet
Hadoop Installation and Configuration
16 pages
Hadoop Lab Notes: Nicola Tonellotto November 15, 2010
No ratings yet
Hadoop Lab Notes: Nicola Tonellotto November 15, 2010
9 pages
Bda 2
No ratings yet
Bda 2
25 pages
BIG Data File
No ratings yet
BIG Data File
28 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
Big Data Lab Record
No ratings yet
Big Data Lab Record
30 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
Survey Master Quick Guide 20160219
0% (1)
Survey Master Quick Guide 20160219
20 pages
Install and Run Hadoop On Windows
No ratings yet
Install and Run Hadoop On Windows
29 pages
ACP Users Guide
No ratings yet
ACP Users Guide
586 pages
Practical-1: Aim: Hadoop Configuration and Single Node Cluster Setup and Perform File Management Task in
No ratings yet
Practical-1: Aim: Hadoop Configuration and Single Node Cluster Setup and Perform File Management Task in
61 pages
11th CS ALP 100 MCQs Test KEY
No ratings yet
11th CS ALP 100 MCQs Test KEY
5 pages
Cadd Viva
No ratings yet
Cadd Viva
5 pages
CS9077
100% (1)
CS9077
2 pages
Usage Guide
No ratings yet
Usage Guide
6 pages
Install Hadoop-2.6.0 On Windows10
No ratings yet
Install Hadoop-2.6.0 On Windows10
8 pages
WOCAT FAO Tutorial QGIS
No ratings yet
WOCAT FAO Tutorial QGIS
51 pages
How To Guide DACIA MEDIA NAV TB4 v3 ENG
No ratings yet
How To Guide DACIA MEDIA NAV TB4 v3 ENG
13 pages
Resources PDF Trainings EC-2245-Mainframe-MVS REXX
No ratings yet
Resources PDF Trainings EC-2245-Mainframe-MVS REXX
4 pages
AICT (Outline)
No ratings yet
AICT (Outline)
5 pages
Spotlight Functionality Details
No ratings yet
Spotlight Functionality Details
7 pages
Performance and Workflow
No ratings yet
Performance and Workflow
12 pages
SC2006 Notes
No ratings yet
SC2006 Notes
75 pages
An Introduction To Firewalls
No ratings yet
An Introduction To Firewalls
21 pages
Exploring The Use of Metrics For Software Assurance
No ratings yet
Exploring The Use of Metrics For Software Assurance
69 pages
Bhagyesh 1
No ratings yet
Bhagyesh 1
6 pages
Final SAP Mcq2 (21june)
No ratings yet
Final SAP Mcq2 (21june)
54 pages
Thesis For Computer Engineering
100% (2)
Thesis For Computer Engineering
5 pages
Kali Linux 2.0 Top 10 Post Install Tips
No ratings yet
Kali Linux 2.0 Top 10 Post Install Tips
1 page
AhoogaModularDisplay - M02 Manual 2023
No ratings yet
AhoogaModularDisplay - M02 Manual 2023
8 pages
A Comparative Study of Mobile Phone's Operating Systems
No ratings yet
A Comparative Study of Mobile Phone's Operating Systems
7 pages
Microsoft MB 820 Vceexamstest Actual Questions by Lott 22 07 2024 7qa
No ratings yet
Microsoft MB 820 Vceexamstest Actual Questions by Lott 22 07 2024 7qa
12 pages
Session 22-Agent Determination Errors
No ratings yet
Session 22-Agent Determination Errors
21 pages
Supporting-The-Understanding-And-Comparison-Of-Low-Code-Development-Platforms
No ratings yet
Supporting-The-Understanding-And-Comparison-Of-Low-Code-Development-Platforms
8 pages
User Manual - IS - CDC - 2 - Operations and Commands (Guia)
No ratings yet
User Manual - IS - CDC - 2 - Operations and Commands (Guia)
61 pages
Migrate VOTE and OCR
No ratings yet
Migrate VOTE and OCR
9 pages
Cracking The Coding Interview-Gayle McDowell
No ratings yet
Cracking The Coding Interview-Gayle McDowell
1 page
BPI eADA Enrollment Guide
No ratings yet
BPI eADA Enrollment Guide
15 pages
2020 06 21 10.45.02
No ratings yet
2020 06 21 10.45.02
2 pages
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CC EXP 8 VBHV

Uploaded by

CC EXP 8 VBHV

Uploaded by

Vaibhav Gupta 21BCS3440

7. Go to path and click edit then type “%JAVA_HOME%\bin”

II Configuring And Installing Hadoop

7. Go to path and click edit then type “%HADOOP_HOME%\bin”

8. Now we have to configure the hadoop.

iii. Create folder "data" under "D:\Hadoop"

vi. Edit file D:\Hadoop\etc\hadoop\hadoop-env.cmd by closing the

hdfs namenode –format

III. Testing Hadoop Installation

5. Open: http://localhost:50070 in any browser

Now hadoop succesfully installed in your System.

IV. Simple WordCount Program

2) Start the hadoop via command prompt $ start-all.cmd

3) By using $jps command Ensure hadoop nodes are running

4) To create a directory, use: $ hadoop fs –mkdir /inputdir

5) To input a file within a directory, use: $ hadoop fs –

7) To view the content of the file, use: $ hadoop dfs –cat

8) Now appy mapreduce program to the input file. We have

10) To stop the hadoop type $stop-all.cmd

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.