Create A Multi-Node Cluster For Distributed Hadoop Environment
Create A Multi-Node Cluster For Distributed Hadoop Environment
Environment ”
S/w specifications
Ubuntu 18
Hadoop Cloudera 2.5.0
Java 11
****************************************************************
Step 2 Edit the /etc/hosts and /etc/hostname folder. (On all nodes)
Code
sudo nano /etc/hosts
sudo nano /etc/hostname
master 192.168.75.93
slave1 192.168.75.85
slave2 192.168.75.156
#(Remove The loopback address)
In /etc/hosts change the name to master, slave1 , slave2 respectively.
(Restart the system)
****************************************************************
Step 3 Setup ssh connection (On all nodes)
Code
sudo apt-get install openssh-server
ssh-keygen -b 4096
--------------------------------------------------------------
ssh-copy-id 192.168.75.156
ssh-copy-id 192.168.75.93
ssh-copy-id 192.168.75.85
-----------------------------------------------------------
#To connect with other computers:
ssh 192.168.75.85
ssh 192.168.75.156 (Do Cntrl +Shift + T for multiple terminals)
Code
sudo wget http://archive /cloudera…………………..
sudo tar -xzvf hadoop-2.5.0-cdh 5.3.2
sudo mv hadoop-2.5.0 hadoop
ls
****************************************************************
Step 6 Configure Hadoop (On Master node only)
sudo nano ~/.bashrc
source ~/.bashrc
#(Copy the content from website and change the path in prefix according to your
username “anju”)
********
Use these links
https://data-flair.training/blogs/hadoop-2-6-multinode-cluster-setup/
http://archive.cloudera.com/cdh5/cdh/5/hadoop-2.5.0-cdh5.3.2.tar.gz