HDFS
HDFS
System
History of Hadoop
Year Event
2003 Google- MapReduce and Google File System (GFS)->Hadoop.
2005 Doug Cutting & Mike Cafarella - Nutch project-> Hadoop.
2006 Standalone project- Doug Cutting's son's toy elephant.
2008 Hadoop joins the Apache Software Foundation
2010 Hadoop 1.0
2011 Hive, Pig, HBase, and ZooKeeper
2012 Hadoop -> big data processing.
2014 Hadoop 2.0 -YARN & improved HDFS
2015 Cloud-based Hadoop services like Amazon EMR, Google Dataproc, and Azure HDInsight
Manages Distributed
General-purpose General-purpose Optimized for
Purpose structured data storage for big
file system file system Apple devices
with queries data
Applications
Primary Use Windows macOS/iOS Big data
needing Linux storage
Case storage storage analytics
tables/queries
Key Features:
Executes scripts in non-Java languages for batch processing.
Part of Hadoop's MapReduce ecosystem.
HDFS COMMANDS
mkdir ls get
put cat
HDFS COMMANDS
mkdir:
hdfs dfs -mkdir /path/to/directory
hdfs dfs -mkdir /user/hadoop/data cat:
hdfs dfs -cat /path/to/file
hdfs dfs -cat /user/hadoop/data/data.txt
ls:
hdfs dfs -ls /path/to/directory get:
hdfs dfs -ls /user/Hadoop hdfs dfs -get /hdfs/path/file /local/path
hdfs dfs -get /user/hadoop/data/data.txt
put: /home/user
hdfs dfs -put /local/path/file /hdfs/path
hdfs dfs -put /home/user/data.txt
/user/hadoop/data
HDFS COMMANDS
cp mv rm
chown touchz
HDFS COMMANDS
cp:
hdfs dfs -cp /source/path /destination/path
chown:
hdfs dfs -cp /user/hadoop/data.txt
/user/hadoop/backup/data.txt hdfs dfs -chown [user]:[group]
/path/to/file_or_directory
du df setrep
clear stat
HDFS COMMANDS
du: setrep:
hdfs dfs -du [options] /path [s,h] hdfs dfs -setrep -w [replication_factor] /path
hdfs dfs -setrep -w 3
hdfs dfs -du /user/hadoop/project
/user/hadoop/project/data.txt
df: stat:
hdfs dfs -df [path] hdfs dfs -stat [format] /path
hdfs dfs -df /
Filesystem Size Used Available Use%
hdfs://localhost:9000 1000GB 400GB 600GB 40%
clear:
hdfs dfs -stat %n %b %r %y /user/hadoop/project/data.txt
clear
Thank you