0% found this document useful (0 votes)

23 views33 pages

Bda Manual

manual

Uploaded by

muthu viknesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views33 pages

Bda Manual

manual

Uploaded by

muthu viknesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

EXP.

NO:1 DOWNLOADING AND INSTALLING HADOOP; UNDERSTANDING

DIFFERENT HADOOP MODES. STARTUP SCRIPTS,
DATE:
CONFIGURATION FILES.

AIM:
To Downloading and installing Hadoop; Understanding different Hadoop modes. Startup
scripts, Configuration files.

PREREQUISITES TO INSTALL HADOOP ON WINDOWS

• VIRTUAL BOX (For Linux): it is used for installing the operating system on it.
• OPERATING SYSTEM: You can install Hadoop on Windows or Linux based
operating systems. Ubuntu and CentOS are very commonly used.
• JAVA: You need to install the Java 8 package on your system.
• HADOOP: You require Hadoop latest version

1. Install Java
• Java JDK Link to download_
https://www.oracle.com/java/technologies/javase-jdk8-downloads.html
• Extract and install Java in C:\Java
• Open cmd and type-> javac -version

2. Download Hadoop

https://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-3.3.0/hadoop-3.3.0.tar.gz

• extract to C:\Hadoop

1
2
3
4
5
6
paste the xml code in folder and save
<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

Rename "mapred-site.xml.template" to "mapred-site.xml" and edit this file C:/Hadoop-

3.3.0/etc/hadoop/mapred-site.xml, paste xml code and save this file.
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yam</value>
</property>
</configuration>

Create folder "data" under "C:\Hadoop-3.3.0"

Create folder "datanode" under "C:\Hadoop-3.3.0\data"
Create folder "namenode" under "C:\Hadoop-3.3.0\data"

7
<value>/hadoop-3.3.0/data/datanode</value>
</property>
</configuration>

Edit file C:/Hadoop-3.3.0/etc/hadoop/yam-site.xml,

paste xml code and save this file.
<configuration>
<property>
<name>yam.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yam.nodemanager.auxservices.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
</configuration>

Edit file C:/Hadoop-3.3.0/etc/hadoop/hadoop-env.cmd

by closing the command line
"JAVA- HOME=¾JA VA- HOME%" instead of set "JAVA- HOME=C:\Java"

6. Hadoop Configurations

Download
https://github.com/brainmentorspvtltd/BigData_RDE/blob/master/Hadoop%20Configuration.zip
or (for hadoop 3)
https://github.com/s911415/apache-hadoop-3.1.0-winutils
• Copy folder bin and replace existing bin folder in
C:\Hadoop-3.3.0\bin
• Format the NameNode
• Open cmd and type command "hdfs namenode -format"

8
7. Testing
• Open cmd and change directory to C:\Hadoop-3.3.0\sbin
• type start-all.cmd

(Or you can start like this)

Start namenode and datanode with this command
• type start-dfs.cmd
• Start yam through this command
• type start-yam.cmd
Make sure these apps are running
• Hadoop Namenode
• Hadoop datanode
• YARN Resource Manager

9
10
11
HADOOP IMPLEMENTATION OF FILE MANAGEMENT
EXP.NO:2 TASKS, SUCH AS ADDING
FILES AND DIRECTORIES, RETRIEVING FILES AND
DATE:
DELETING FILES.
AIM:
To implement the following file management tasks in Hadoop:
1. Adding files and directories
2. Retrieving files
3. Deleting Files
I.Create a directory in HDFS at given path(s).
Usage:
hadoop fs -mkdir <paths> Example:
hadoop fs -mkdir /user/saurzcode/dirl /user/saurzcode/dir2

2. List the contents of adirectory.

Usage:
hadoop fs -ls <args>
Example:
hadoop fs -ls /user/saurzcode

3. Upload and download a file in HDFS.

Upload: hadoop fs -put:

Copy single src file, or multiple src files from local file system to the Hadoop data file system
Usage:
hadoop fs -put <localsrc> ... <HDFS_dest_Path> Example:
hadoop fs -put /home/saurzcode/Samplefile.txt /user/ saurzcode/dir3/
Download:
hadoop fs -get:
Copies/Downloads files to the local file system
Usage:

hadoop fs -get /user/saurzcode/dir3/Samplefile.txt /home/

4. See contents of a file Same as

unix cat command: Usage:

hadoop fs -cat <path[filename

12
Example:

hadoop fs -cat /user/saurzcode/dirl/abc.txt

1. Copy a file from source todestination

This command allows multiple sources as well in which casethe destination must be a directory.
Usage:
hadoop fs -cp <source> <dest>

Example:

hadoop fs -cp
/user/saurzcode/dir1/abc.txt
/user/saurzcode/dir2

2. Copy a file from/To Local file system to

HDFS copyFromLocal
Usage:
hadoop fs -copyFromLocal <localsrc>

URI Example:
hadoop fs -copyFromLocal /home/saurzcode/abc.txt /user/ saurzcode/abc.txt
Similar to put command, except that the source is restricted to a local file reference.

copyToLocal
Usage:
hadoop fs -copyToLocal [-ignorecrc] [-ere] URI <localdst>

Similar to get command, except that the destination is restricted to a local file reference.

3. Move file from source to destination.

Note:- Moving files across filesystem is not permitted.

Usage:
hadoop fs -mv <src> <dest> Example:
hadoop fs -mv /user/saurzcode/dirl/abc.txt /user/saurzcode/ dir2

13
4. Remove a file or directory in HDFS.
Remove files specified as argument. Deletes directory only when it is empty
Usage:
hadoop fs -rm <arg> Example:
hadoop fs -rm /user/saurzcode/dirl/abc.txt

Recursive version of delete.

Usage:
hadoop fs -rmr <arg> Example:
hadoop fs -rmr /user/saurzcode/

5. Display last few lines of a file.

Similar to tail command in Unix.
Usage:
hadoop fs -tail <path[filename]> Example:
hadoop fs -tail /user/saurzcode/dirl/abc.txt

6. Display the aggregate length of a file.

Usage:
hadoop fs -du <path> Example:
hadoop fs -du /user/saurzcode/dirl/abc.txt

RESULT:

Thus,the Hadoop Implementation of file management tasks, such as Adding

files and directories, retrieving files and Deleting files is executed successfully.

14
EXP.NO:3 IMPLEMENT OF MATRIX MULTIPLICATION WITH
HADOOP MAP REDUCE
DATE:

AIM:
To write a Map Reduce Program that implements Matrix Multiplication.

ALGORITHM:
We assume that the input matrices are already stored in Hadoop Distributed File System
(HDFS) in a suitable format (e.g., CSV, TSV) where each row represents a matrix element. The
matrices are compatible for multiplication (the number of columns in the first matrix is equal
to the number of rows in the second matrix).

STEP 1: MAPPER
The mapper will take the input matrices and emit key-value pairs for each element in
the result matrix. The key will be the (row, column) index of the result element, and the value
will be the corresponding element value.

STEP 2: REDUCER
The reducer will take the key-value pairs emitted by the mapper and calculate the partial
sum for each element in the result matrix.

STEP 3: MAIN DRIVER

The main driver class sets up the Hadoop job configuration and specifies the input and
output paths for the matrices.

STEP 4: RUNNING THE JOB

To run the MapReduce job, you need to package your classes into a JAR file and then submit
with the actual HDFS paths to your input matrices and desired output directory.

PROGRAM:
import java.io.IOException;
import java.util.StringTokenizer;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Mapper;

15
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.TextlnputFormat;
import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;
import org.apache.hadoop.mapreduce.lib.input.FilelnputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.fs.Path;
public class MatrixMultiplicationMapper extends Mapper<LongWritable, Text, Text, Text>
{
protected void map(LongWritable key, Text value, Context context) throws IOException,
InterruptedException {
II Parse the input line to get row, column, and value of each element in the input matrices
String[] elements = value.toString().split(",");
int row= Integer.parselnt(elements[0]);
int col = Integer.parselnt(elements[1]);
int val = Integer.parselnt(elements[2]);

II Emit key-value pairs where key is (row, column) index of the result element
II and value is the corresponding element value
context.write(new Text(row +","+col), new Text(val));
}
}
public class MatrixMultiplicationReducer extends Reducer<Text, Text, Text, IntWritable>
{ protected void reduce(Text key, Iterable<Text> values, Context context) throws IOException,
InterruptedException {
int result= 0;
for (Text value : values) {
II Accumulate the partial sum for the result element
result+= Integer.parselnt(value.toString());
}
II Emit the final result for the result element
context.write(key, new IntWritable(result));
}

16
}
public class MatrixMultiplicationDriver {
public static void main(String[] args) throws Exception
{ Configuration conf = new Configuration();
Job job= Job.getlnstance(conf, "Matrix Multiplication");
job.setJarByClass(MatrixMultiplicationDriver.class);
job.setMapperClass(MatrixMultiplicationMapper.class);
job.setReducerClass(MatrixMultiplicationReducer.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(Text.class);
FilelnputFormat.addlnputPathUob, new Path(args[0]));
FileOutputFormat.setOutputPathUob, new Path(args[1]));
System.exitUob.waitForCompletion(true)? 0: 1);
}
}

Run the program

hadoop jar matrixmultiplication.jar MatrixMultiplicationDriver input_path output_path

17
18
EXP.NO:4 RUN A BASIC WORD COUNT MAP REDUCE PROGRAM
TO UNDERSTAND MAP REDUCE PARADIGM
DATE:

AIM:
To write a Basic Word Count program to understand Map Reduce Paradigm.

ALGORITHM:

The entire MapReduce program can be fundamentally divided into three parts:
• Mapper Phase Code
• Reducer Phase Code
• Driver Code

STEP 1: MAPPER CODE:

We have created a class Map that extends the class Mapper which is already defined in
the MapReduce Framework.
• We define the data types of input and output key/value pair after the class declaration
using angle brackets.
• Both the input and output of the Mapper is a key/value pair.
Input:
• The key is nothing but the offset of each line in the text file:LongWritable
• The value is each individual line : Text
Output:
• The key is the tokenized words: Text
• We have the hardcoded value in our case which is 1: IntWritable
• Example - Dear 1, Bear 1, etc.
We have written ajava code where we have tokenized each word and assigned them a
hardcoded value equal to 1.

STEP 2 : REDUCER CODE:

• We have created a class Reduce which extends class Reducer like that of Mapper.
• We define the data types of input and output key/value pair after the class declaration
using angle brackets as done for Mapper.
• Both the input and the output of the Reducer is a key value pair.

19
Input:
• The key nothing but those unique words which have been generated after the sorting
and shuffling phase: Text
• The value is a list of integers corresponding to each key: IntWritable
• Example - Bear, [l, 1], etc.

Output:
• The key is all the unique words present in the input text file: Text
• The value is the number of occurrences of each of the unique words: IntWritable
• Example - Bear, 2; Car, 3, etc.
• We have aggregated the values present in each of the list corresponding to each key and
produced the final answer.
• In general, a single reducer is created for each of the unique words, but, you can specify the
number of reducer in mapred-site.xml.

STEP 3: DRIVER CODE:

• In the driver class, we set the configuration of our MapReduce job to run in Hadoop.
• We specify the name of the job, the data type of input/ output of the mapper and reducer.
• We also specify the names of the mapper and reducer classes.
• The path of the input and output folder is also specified.
• The method setinputFormatClass () is used for specifying that how a Mapper will read
the input data or what will be the unit of work The main () method is the entry point for
the driver. In this method, we instantiate a new Configuration object for the job.

20
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.TextlnputFormat;
import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;
import org.apache.hadoop.mapreduce.lib.input.FilelnputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.fs.Path;
public class WordCount
{
public static class Map extends Mapper<LongWritable,Text,Text,IntWritable>
{ public void map(LongWritable key, Text value,Context context) throws
IOException,InterruptedException{
String line= value.toString();
StringTokenizer tokenizer = new StringTokenizer(line);
while (tokenizer.hasMoreTokens())
{ value.set(tokenizer.nextToken());
context.write(value, new IntWritable(l));
}
}
}
public static class Reduce extends Reducer<Text,IntWritable,Text,IntWritable>
{ public void reduce(Text key, Iterable<IntWritable> values,Context context)
throws IOException,InterruptedException {
int sum=0;

for(IntWritable x: values)
{
sum+=x.get();
}
context.write(key, new IntWritable(sum));
}
}
public static void main(String[] args) throws Exception
{ Configuration conf= new Configuration();

21
Job job= new Job(conf,"My Word Count Program");
job.setJarByClass(WordCount.class);
job.setMapperClass(Map.class);
job.setReducerClass(Reduce.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(lntWritable.class);
job.setlnputFormatClass(TextlnputFormat.class);
job.setOutputFormatClass(TextOutputFormat.class);
Path outputPath = new Path(args[l]);
//Configuring the input/output path from the filesystem into the job
FilelnputFormat.addlnputPathUob, new Path(args[0]));
FileOutputFormat.setOutputPathUob, new Path(args[1]));
//deleting the output path automatically from hdfs so that we don't have to
delete it explicitly
outputPath.getFileSystem(conf).delete(outputPath);
//exiting the job only if the flag value becomes false
System.exitUob.waitForCompletion(true)? 0: 1);
}
}
Run the MapReduce code:
The command for running a MapReduce code is:
hadoop jar hadoop-mapreduce-example.jar WordCount /sample/input /sample/output
OUTPUT:

22
23
EXP.N0:5
INSTALLATION OF HIVE ALONG WITH PRACTICE EXAMPLES.
DATE:

AIM:
To install HIVE along with practice examples.

PREREQUISITES:
• Java Development Kit (JDK) installed and the JAVA_HOME environment variable
set.
• Hadoop installed and configured on your Windows system.

STEP-BY-STEP INSTALLATION:

1. Download HIVE:
Visit the Apache Hive website and download the latest stable version of Hive.
Official Apache Hive website: https://hive.apache.org/
2. Extract the Downloaded Hive Archive to a Directory on Your Windows Machine,
e.g., C:\hive.
3. Configure Hive:
• Open the Hive configuration file (hive-site.xml) located in the conf folder of the
extracted Hive directory.

• Set the necessary configurations, such as Hive Metastore connection settings and
Hadoop configurations. Make sure to adjust paths accordingly for Windows
4. Environment Variables Setup:
• Add the Hive binary directory (C:\hive\bin in this example) to your PATH environment
variable.

• Set the HIVE_HOME environment variable to point to the Hive installation directory
(C:\hive in this example).

24
5. Start the Hive Metastore service:
To start the Hive Metastore service, you can use the schematool script:

6. Start Hive:
• Open a command prompt or terminal and navigate to the Hive installation directory.
• Execute the hive command to start the Hive shell.

EXAMPLES:

1. Create a Database:
To create a new database in HIVE, use the following syntax:

CREATE DATABASE database_name;

Example:
CREATE DATABASE mydatabase;

2. Use a Database:
To use a specific database in HIVE, use the following syntax:
USE database_name;
Example:
USE mydatabase;

3. Show Databases:
To display a list of available databases in HIVE, use the following syntax:
SHOW DATABASES;

4. Create a Table:
To create a table in HIVE, use the following syntax:
CREATE TABLE table_name (
columnl datatype,
column2 datatype,

);

25
Example:
CREATE TABLE mytable
( id INT,
name STRING,
age INT
);

5. Show Tables:
To display a list of tables in the current database, use the following syntax:
SHOW TABLES;

6. Describe a Table:
To view the schema and details of a specific table, use the following syntax:
DESCRIBE table_name;

Example:
DESCRIBE mytable;

7. Insert Data into a Table:

To insert data into a table in HIVE, use the following syntax:
INSERT INTO table_name (columnl, column2, ...) VALUES (valuel, value2, ...);
Example:
INSERT INTO mytable (id, name, age) VALUES (1, 'John Doe', 25);

8. Select Data from a Table:

SELECT columnl, column2, ... FROM table_name WHERE condition;
Example:
SELECT * FROM mytable WHERE age > 20;

RESULT:
Thus the Installation of HIVE was done successfully.

26
EXP.N0:6 INSTALLATION OF THRIFT

DATE:

AIM:
To install Apache thrift on Windows OS.

ALGORITHM:

Step 1: Download Apache Thrift:

• Visit the Apache Thrift website: https://thrift.apache.org/
• Go to the "Downloads" section and find the latest version of Thrift.
• Download the Windows binary distribution (ZIP file) for the desired version.

Step 2: Extract the ZIP file:

• Locate the downloaded ZIP file and extract its contents to a directory of your choice.
• This directory will be referred to as <THRIFT_DIR> in the following steps.

Step 3: Set up environment variables:

• Open the Start menu and search for "Environment Variables" and select "Edit the
system environment variables."
• Click the "Environment Variables" button at the bottom right of the "System Properties"
window.
• Under the "System variables" section, find the "Path" variable and click "Edit."
• Add the following entries to the "Variable value" field (replace <THRIFT_DIR> with
the actual directory path):
<THRIFT DIR>\bin
<THRIFT DIR>\lib
• Click "OK" to save the changes.

Step 4: Verify the installation:

• Open a new Command Prompt window.
• Run the following command to verify that Thrift is installed and accessible:
thrift -version
• If everything is set up correctly, you should see the version number of Thrift printed
on the screen.

27
28
EXP.NO:7 PRACTICE IMPORTING AND EXPORTING DATA FROM
VARIOUS DATABASES.
DATE:

AIM:
To import and export data from various Databases using SQOOP.

ALGORITHM:

Step 1: Install SQOOP.

• First, you need to install Sqoop on your Hadoop cluster or machine.
• Download the latest version of Sqoop from the Apache Sqoop website
(http://sqoop.apache.org/) and follow the installation instructions provided in the
documentation

Step 2: Importing data from a database:

• To import data from a database into Hadoop, use the following Sqoop command:

--password <DB_PASSWORD> \

• Replace the placeholders

• (<DB_TYPE>, <DB_HOST>, <DB_PORT>, <DB_NAME>, <DB_USERNAME>,
<DB_PASSWORD>, <TABLE_NAME>, <HDFS_TARGET_DIR>, and
<NUMBER_OF_MAP_TASKS>) with the appropriate values for your database and
Hadoop environment.

Step 3: Exporting data to a database:

To export data from Hadoop to a database, use the following Sqoop command:

29
--password <DB_PASSWORD> \
--table <TABLE NAME> \
--export-dir <HDFS_EXPORT_DIR> \
--input-fields-terminated-by '<DELIMITER>'

• Replace the placeholders

• (<DB_TYPE>, <DB_HOST>, <DB_PORT>, <DB_NAME>, <DB_USERNAME>,
<DB_PASSWORD>, <TABLE_NAME>, <HDFS_EXPORT_DIR>, and
<DELIMITER>) with the appropriate values for your database and Hadoop
environment.

RESULT:
Thus the implementation export data from various Databases using SQOOP was done
successfully.

30
EXP.N0:8 INSTALLATION OF HBASE ALONG WITH PRACTICE EXAMPLES
DATE:

AIM:
To install HBASE using Virtual Machine and perform some operations in HBASE.

ALGORITHM:
Step 1: Install a Virtual Machine
• Download and install a virtual machine software such as VirtualBox
(https://www.virtualbox.org/) or VMware (https://www.vmware.com/).
• Create a new virtual machine and install a Unix-based operating system like Ubuntu or
CentOS. You can download the ISO image of your desired Linux distribution from their
official websites.

Step 2: Set up the Virtual Machine

• Launch the virtual machine and install the Unix-based operating system following the
installation wizard.
• Make sure the virtual machine has network connectivity to download software
packages.

Step 3: Install Java

• Open the terminal or command line in the virtual machine.
• Update the package list
sudo apt update
• Install OpenJDK (Java Development Kit):
sudo apt install default-jdk
• Verify the Java installation:
java -version
Step 4: Download and Install HBase
• In the virtual machine, navigate to the directory where you want to install HBase.
• Download the HBase binary distribution from the Apache HBase website
(https://hbase.apache.org/). Look for the latest stable version.
• Extract the downloaded archive
tar -xvf <hbase_archive_name>.tar.gz
• Replace <hbase_archive_name> with the actual name of the HBase archive file.

31
• Move the extracted HBase directory to a desired location:
sudo mv <hbase_extracted_directory> /opt/hbase
• Replace <hbase_extracted_directory> with the actual name of the extracted HBase
directory.

Step 5: Configure HBase

• Open the HBase configuration file for editing:
sudo nano /opt/hbase/conf/hbase-site.xml
• Add the following properties to the configuration file:

• Save the file and exit the text editor.

Step 6: Start HBase

• Start the HBase server:

sudo /opt/hbase/bin/start-hbase.sh

HBASE PRACTICE EXAMPLES:

Step 1: Start HBase

• Make sure HBase is installed and running on your Windows system.

Step 2: Open HBase Shell

• Open a command prompt or terminal window and navigate to the directory where the
HBase installation is located. Run the following command to start the HBase shell:
>>hbase shell

32
Step 3: Create a Table
• In the HBase shell, you can create a table with column families.
• For example, let's create a table named "my_table" with a column family called "cf':
>> create 'my_table', 'cf

Step 4: Insert Data

• To insert data into the table, you can use the put command.
• Here's an example of inserting a row with a specific row key and values:
>> put 'my_table', 'rowl', 'cf:columnl', 'valuel'
>> put 'my_table', 'rowl', 'cf:column2', 'value2'

Step 5: Get Data

• You can retrieve data from the table using the get command.
• For example, to get the values of a specific row:
>> get 'my_table', 'rowl'
• This will display all the column family values for the specified row.

Step 6: Scan Data

• To scan and retrieve multiple rows or the entire table, use the scan command.
• For instance, to scan all rows in the table:
>> scan 'my_table'
• This will display all rows and their corresponding column family values.

Step 7: Delete Data

• To delete a specific row or a particular cell value, you can use the delete command.
• Here's an example of deleting a specific row:
>>delete 'my_table', 'rowl'

Step 8: Disable and Drop Table

• If you want to remove the table entirely, you need to disable and drop it.
• Use the following commands:
>>disable 'my_table'
>>drop 'my_table'

RESULT:
Thus the installation of HBase using Virtual Machine was done successfully.

Dhyeya IAS Hindi Literature Optional Official Class Notes PDF in Hindi by Kumar Sarvesh Sir
No ratings yet
Dhyeya IAS Hindi Literature Optional Official Class Notes PDF in Hindi by Kumar Sarvesh Sir
353 pages
CCS334 BDA Lab Manual
No ratings yet
CCS334 BDA Lab Manual
35 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
45 pages
Ccs 334 Bigdata Manual
No ratings yet
Ccs 334 Bigdata Manual
45 pages
CCS334 Bda Lab Manual
No ratings yet
CCS334 Bda Lab Manual
48 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
Da Lab Record - Merged
No ratings yet
Da Lab Record - Merged
48 pages
Ccs334 Bda Lab Manual PRINT
No ratings yet
Ccs334 Bda Lab Manual PRINT
53 pages
Big Data
No ratings yet
Big Data
23 pages
Cp5261 Da Lab Me-Cse 2021 - Edit
No ratings yet
Cp5261 Da Lab Me-Cse 2021 - Edit
88 pages
DA Lab EXERCISE
No ratings yet
DA Lab EXERCISE
24 pages
Big Data Analytics lab-JD
No ratings yet
Big Data Analytics lab-JD
49 pages
BDA Record
No ratings yet
BDA Record
58 pages
BDA Lab Manual - Organized
No ratings yet
BDA Lab Manual - Organized
69 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
42 pages
BDA Record
No ratings yet
BDA Record
34 pages
Wireless Touch Keyboard K400 Plus Setup Guide
No ratings yet
Wireless Touch Keyboard K400 Plus Setup Guide
7 pages
BDA LabManual
No ratings yet
BDA LabManual
20 pages
BDA-Lab Record
No ratings yet
BDA-Lab Record
43 pages
Usb Diy Effects Controller
100% (2)
Usb Diy Effects Controller
2 pages
BDA Lab
No ratings yet
BDA Lab
13 pages
Learning C++ (MEAP V05) - Michael Haephrati and Ruth Haephrati - Chapters 1 To 8 of 17, 2023 - Manning Publications Co. - 9781617298509 - Anna's Archive
100% (1)
Learning C++ (MEAP V05) - Michael Haephrati and Ruth Haephrati - Chapters 1 To 8 of 17, 2023 - Manning Publications Co. - 9781617298509 - Anna's Archive
589 pages
New Bda Manual
No ratings yet
New Bda Manual
80 pages
Big Data Manual
No ratings yet
Big Data Manual
19 pages
Hadoopfile PP
No ratings yet
Hadoopfile PP
83 pages
Hadoop Installation
No ratings yet
Hadoop Installation
11 pages
Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
26 pages
Bda Manual
No ratings yet
Bda Manual
80 pages
Group A 1st
No ratings yet
Group A 1st
4 pages
Install Hadoop-2.6.0 On Windows10
No ratings yet
Install Hadoop-2.6.0 On Windows10
8 pages
Unique Number (Rollno) Name of The Student As in The Academic Records
No ratings yet
Unique Number (Rollno) Name of The Student As in The Academic Records
55 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
54 pages
Bda Record
No ratings yet
Bda Record
83 pages
Lab Manual
No ratings yet
Lab Manual
34 pages
BDA
No ratings yet
BDA
19 pages
ScadaBR-Developers - CERTI - ScadaBR2
100% (1)
ScadaBR-Developers - CERTI - ScadaBR2
20 pages
Big Data File
No ratings yet
Big Data File
16 pages
Hadoop Record 2024-Final
No ratings yet
Hadoop Record 2024-Final
59 pages
Ccs334 Bda Lab Ex
No ratings yet
Ccs334 Bda Lab Ex
45 pages
Data Science
No ratings yet
Data Science
82 pages
Big Data Manual Ai
No ratings yet
Big Data Manual Ai
33 pages
Bda Record (24-25)
No ratings yet
Bda Record (24-25)
50 pages
Your SQL Quickstart Guide
No ratings yet
Your SQL Quickstart Guide
32 pages
Bigdatamanualfinal 231019063224 d211cb48
No ratings yet
Bigdatamanualfinal 231019063224 d211cb48
45 pages
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
Big Datalab
No ratings yet
Big Datalab
4 pages
BIGDATALABCURRENT
No ratings yet
BIGDATALABCURRENT
54 pages
Big Data
No ratings yet
Big Data
28 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
32 pages
Big Data Record 2024-25
No ratings yet
Big Data Record 2024-25
46 pages
BIG Data File
No ratings yet
BIG Data File
28 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
ME3241E Cheat Sheet
No ratings yet
ME3241E Cheat Sheet
2 pages
Big Data Analytics IT
No ratings yet
Big Data Analytics IT
55 pages
Anushka Shetty 35
No ratings yet
Anushka Shetty 35
34 pages
@bigdatalabfile 09
No ratings yet
@bigdatalabfile 09
35 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
BIGDATA LAB MANUAL
No ratings yet
BIGDATA LAB MANUAL
27 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Extreme Computing Lab Exercises Session One: 1 Getting Started
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
6 pages
Assefacv Cbe
No ratings yet
Assefacv Cbe
7 pages
Hadoop Single Node Cluster Setup Steps
No ratings yet
Hadoop Single Node Cluster Setup Steps
7 pages
Hadoop On Windows
No ratings yet
Hadoop On Windows
6 pages
RPT Internal Scheme Report
No ratings yet
RPT Internal Scheme Report
14 pages
Axe 10
100% (7)
Axe 10
40 pages
1747 UIC Procedure
No ratings yet
1747 UIC Procedure
7 pages
Install Hadoop-2.6.0 On Windows10
No ratings yet
Install Hadoop-2.6.0 On Windows10
8 pages
Twenty: 20.1 Try Yourself Create OSM Based Vector Files
No ratings yet
Twenty: 20.1 Try Yourself Create OSM Based Vector Files
10 pages
Presentation From The Union of Myanmar
No ratings yet
Presentation From The Union of Myanmar
19 pages
VSPlayer User Manual V6.0.0.4
No ratings yet
VSPlayer User Manual V6.0.0.4
17 pages
Chapter 1
No ratings yet
Chapter 1
31 pages
Principles of Electronic Communication Systems: Second Edition
No ratings yet
Principles of Electronic Communication Systems: Second Edition
60 pages
Relational Database Design: Practice Exercises
0% (1)
Relational Database Design: Practice Exercises
6 pages
MSTD - BillingSoftware - User Manual Ver 1.01
No ratings yet
MSTD - BillingSoftware - User Manual Ver 1.01
52 pages
Booklet V8 Config Guide
No ratings yet
Booklet V8 Config Guide
28 pages
Covid19 Detection Using Federated Learning
No ratings yet
Covid19 Detection Using Federated Learning
63 pages
Ascii Codes
No ratings yet
Ascii Codes
7 pages
EE 2310 Homework #3 Solutions - Flip Flops and Flip-Flop Circuits
No ratings yet
EE 2310 Homework #3 Solutions - Flip Flops and Flip-Flop Circuits
4 pages
Introduction To Robotics: Tentative Workshop Schedule
No ratings yet
Introduction To Robotics: Tentative Workshop Schedule
2 pages
Configuring Ip Slas HTTP Operations: Finding Feature Information
No ratings yet
Configuring Ip Slas HTTP Operations: Finding Feature Information
12 pages
Translate English To Khmer - Google Search 3
No ratings yet
Translate English To Khmer - Google Search 3
1 page
CADATHON Rulebook
No ratings yet
CADATHON Rulebook
3 pages
Dolby Multichannel Amplifier Product Sheet
No ratings yet
Dolby Multichannel Amplifier Product Sheet
2 pages
Log
No ratings yet
Log
3 pages
Invoice: WPS Canada Inc
No ratings yet
Invoice: WPS Canada Inc
2 pages
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Bda Manual

Uploaded by

Bda Manual

Uploaded by

EXP.

NO:1 DOWNLOADING AND INSTALLING HADOOP; UNDERSTANDING

PREREQUISITES TO INSTALL HADOOP ON WINDOWS

Rename "mapred-site.xml.template" to "mapred-site.xml" and edit this file C:/Hadoop-

Create folder "data" under "C:\Hadoop-3.3.0"

Edit file C:\Hadoop-3.3.0/etc/hadoop/hdfs-site.xml,

Edit file C:/Hadoop-3.3.0/etc/hadoop/yam-site.xml,

Edit file C:/Hadoop-3.3.0/etc/hadoop/hadoop-env.cmd

(Or you can start like this)

2. List the contents of adirectory.

3. Upload and download a file in HDFS.

Upload: hadoop fs -put:

hadoop fs -get /user/saurzcode/dir3/Samplefile.txt /home/

4. See contents of a file Same as

unix cat command: Usage:

hadoop fs -cat /user/saurzcode/dirl/abc.txt

1. Copy a file from source todestination

2. Copy a file from/To Local file system to

3. Move file from source to destination.

Note:- Moving files across filesystem is not permitted.

Recursive version of delete.

5. Display last few lines of a file.

6. Display the aggregate length of a file.

Thus,the Hadoop Implementation of file management tasks, such as Adding

STEP 3: MAIN DRIVER

STEP 4: RUNNING THE JOB

Run the program

STEP 1: MAPPER CODE:

STEP 2 : REDUCER CODE:

STEP 3: DRIVER CODE:

CREATE DATABASE database_name;

7. Insert Data into a Table:

8. Select Data from a Table:

Step 1: Download Apache Thrift:

Step 2: Extract the ZIP file:

Step 3: Set up environment variables:

Step 4: Verify the installation:

Step 1: Install SQOOP.

Step 2: Importing data from a database:

• Replace the placeholders

Step 3: Exporting data to a database:

• Replace the placeholders

Step 2: Set up the Virtual Machine

Step 3: Install Java

Step 5: Configure HBase

• Save the file and exit the text editor.

Step 6: Start HBase

• Start the HBase server:

HBASE PRACTICE EXAMPLES:

Step 1: Start HBase

Step 2: Open HBase Shell

Step 4: Insert Data

Step 5: Get Data

Step 6: Scan Data

Step 7: Delete Data

Step 8: Disable and Drop Table

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.