Core Utilities
Core Utilities
This article deals with so-called core utilities on a GNU/Linux system, such as less, ls, and grep. The
scope of this article includes, but is not limited to, those utilities included with the
GNU coreutils package. What follows are various tips and tricks and other helpful information
related to these utilities.
Most command-line interfaces are documented in man pages, GNU commands tend to be
documented in info(1) pages, some shells provide a help command for shell-builtin commands.
Additionally most commands print their usage when run with the --help argument.
Contents
[hide]
1File management
o 1.1ls
1.1.1Long format
1.1.2File names containing spaces enclosed in quotes
o 1.2cat
o 1.3less
1.3.1Vim as alternative pager
o 1.4mkdir
o 1.5mv
o 1.6rm
o 1.7chmod
o 1.8chown
o 1.9find
o 1.10locate
2Text streams
o 2.1grep
o 2.2sed
o 2.3awk
o 2.4pv
3System administration
o 3.1sudo
o 3.2which
o 3.3lsblk
o 3.4ip
o 3.5ss
4Miscellaneous
o 4.1dd
o 4.2iconv
4.2.1Convert a file in place
o 4.3od
o 4.4seq
o 4.5tar
o 4.6wipefs
5See also
File management
Manual
Command Description Example
page
mv ~/compressed.zip
mv Move a file mv(1)
~/archive/compressed2.zip
ls
ls lists directory contents.
See info ls or the online manual for more information.
exa is a modern, and more user friendly alternative to ls and tree, that has more features, such as
displaying Gitmodifications along with filenames, colouring differently each columnn in --
long mode, or displaying --long mode metadata along with a tree view. exa
Long format
The -l option displays some metadata, for example:
$ ls -l /path/to/directory
total 128
drwxr-xr-x 2 archie users 4096 Jul 5 21:03 Desktop
drwxr-xr-x 6 archie users 4096 Jul 5 17:37 Documents
drwxr-xr-x 2 archie users 4096 Jul 5 13:45 Downloads
-rw-rw-r-- 1 archie users 5120 Jun 27 08:28 customers.ods
-rw-r--r-- 1 archie users 3339 Jun 27 08:28 todo
-rwxr-xr-x 1 archie users 2048 Jul 6 12:56 myscript.sh
The total value represents the total disk allocation for the files in the directory, by default in
number of blocks.
Below, each file and subdirectory is represented by a line divided into 7 metadata fields, in the
following order:
Because cat is not built into the shell, on many occasions you may find it more convenient to use
a redirection, for example in scripts, or if you care a lot about performance. In fact < file does
the same as cat file.
If you need to list file lines in reverse order, there is a coreutil command
called tac (cat reversed).
less
less is a terminal pager program used to view the contents of a text file one screen at a time. Whilst
similar to other pagers such as more and the deprecated pg, less offers a more advanced interface
and complete feature-set.
~/.bashrc
alias less='/usr/share/vim/vim80/macros/less.sh'
There is also an alternative to the less.sh macro, which may work as the PAGER environment
variable. Install vimpagerand add the following to your shell configuration file:
~/.bashrc
export PAGER='vimpager'
alias less=$PAGER
Now programs that use the PAGER environment variable, like git, will use vim as pager.
mkdir
mkdir makes directories.
To create a directory and its whole hierarchy, the -p switch is used, otherwise an error is printed.
Changing mode of a just created directory using chmod is not necessary as the -m option lets you
define the access permissions.
Tip: If you just want a temporary directory, a better alternative may be mktemp: mktemp -d
mv
mv moves and renames files and directories.
Note: "Security aliases" are dangerous because you get used to them, resulting in potential data
loss when you use another system / user that doesn't have these aliases.
This alias asks for confirmation before overwriting any existing files and lists the operations in
progress.
rm
rm removes files or directories.
Note: "Security aliases" are dangerous because you get used to them, resulting in potential data
loss when you use another system / user that doesn't have these aliases.
One would probably expect a find command to take as argument a file name and search the
filesystem for files matching that name. For a program that does exactly that see #locate below.
Instead, find takes a set of directories and matches each file under them against a set of
expressions. This design allows for some very powerful "one-liners" that would not be possible using
the "intuitive" design described above. See GregsWiki:UsingFind for usage details.
locate
Install the mlocate package. The package contains an updatedb.timer unit, which invokes a
database update each day. The timer is enabled right after installation, start it manually if you want
to use it before reboot. You can also manually run updatedb as root at any time. By default, paths
such as /media and /mnt are ignored, so locate may not discover files on external devices.
See updatedb(8) for details.
The locate command is a common Unix tool for quickly finding files by name. It offers speed
improvements over the find tool by searching a pre-constructed database file, rather than the
filesystem directly. The downside of this approach is that changes made since the construction of
the database file cannot be detected by locate.
Before locate can be used, the database will need to be created. To do this, execute updatedb as
root.
See also How locate works and rewrite it in one minute.
Text streams
grep
grep is a command line text search utility originally written for Unix. The grep command searches
files or standard input for lines matching a given regular expression, and prints these lines to
standard output.
Remember that grep handles files, so a construct like cat file | grep pattern is
replaceable with grep pattern file
There are grep alternatives optimized for VCS source code, such
as ripgrep, the_silver_searcher, and ack.
To include file line numbers in the output, use the -n option.
grep can also be used for hexadecimal search in a binary file, to look for let say the A1
F2 sequence in a file, the command line is:
Note: Some commands send their output to stderr(3), and grep has no apparent effect. In this
case, redirect stderrto stdout with command 2>&1 | grep args or (for Bash 4) command |&
grep args. See also I/O Redirection.
awk
AWK is a pattern scanning and processing language. There are multiple implementations:
You can use pv to monitor the progress of data through a pipeline, for example:
# dd if=/source/filestream | pv -monitor_options -s size_of_file | d
d of=/destination/filestream
System administration
sudo
See Sudo.
which
which shows the full path of shell commands. In the following example the full path
of ssh is used as an argument for journalctl:
lsblk
lsblk(8) will show all available block devices along with their partitioning schemes,
for example:
$ lsblk -f
NAME FSTYPE LABEL UUID MOU
NTPOINT
sda
├─sda1 vfat C4DA-2C4D /bo
ot
├─sda2 swap 5b1564b2-2e2c-452c-bcfa-d1f572ae99f2 [SW
AP]
└─sda3 ext4 56adc99b-a61e-46af-aab7-a6d07e504652 /
The beginning of the device name specifies the type of block device. Most modern
storage devices (e.g. hard disks, SSDs and USB flash drives) are recognised as SCSI
disks (sd). The type is followed by a lower-case letter starting from a for the first device
(sda), b for the second device (sdb), and so on. Existing partitions on each device will
be listed with a number starting from 1 for the first partition (sda1), 2 for the second
(sda2), and so on. In the example above, only one device is available (sda), and that
device has three partitions (sda1 to sda3), each with a different file system.
Other common block device types include for example mmcblk for memory cards
and nvme for NVMe devices. Unknown types can be searched in the kernel
documentation.
ip
ip allows you to show information about network devices, IP addresses, routing tables,
and other objects in the Linux IP software stack. By appending various commands, you
can also manipulate or configure most of these objects.
Note: The ip utility is provided by the iproute2 package, which is included in
the base group.
The help command is available for all objects. For example, typing ip addr help will
show you the command syntax available for the address object. For advanced usage
see the iproute2 documentation.
The Network configuration article shows how the ip command is used in practice for
various common tasks.
Note: You might be familiar with the ifconfig command, which is deprecated in Arch
Linux; use ip instead.
ss
ss is a utility to investigate network ports and is part of the iproute2 package in
the base group. It has a similar functionality to the deprecated netstat utility.
Common usage includes:
Display all TCP Sockets with service names:
$ ss -at
$ ss -atn
$ ss -au
For more information see ss(8) or ss.html from the iproute2 package.
Miscellaneous
dd
dd is a utility for Unix and Unix-like operating systems whose primary purpose is to
convert and copy a file.
Similarly to cp, by default dd makes a bit-to-bit copy of the file, but with lower-level I/O
flow control features.
Some notable applications of dd are:
Binary file patching: let say one wants to replace offset 0x123AB of a file with the FF
C0 14 hexadecimal sequence, this can be done with the command line:
# printf '\xff\xc0\x14' | dd seek=$((0x123AB)) conv=notrunc bs=1
of=/path/to/file
Warning: One should be extremely cautious using dd, as with any command of this kind
it can destroy data irreversibly.
iconv
iconv converts the encoding of characters from one codeset to another.
The following command will convert the file foo from ISO-8859-15 to UTF-8, saving it
to foo.utf:
Unlike sed, iconv does not provide an option to convert a file in place.
However, sponge from the moreutils package can help:
The construction of some of these tar arguments may be considered legacy, but they
are still useful when performing specific operations. See tar(1) for details.
Note: Although GNU's tar is installed as the default tar program, official Arch Linux
projects like pacman and mkinitcpio use bsdtar from the libarchive package.
wipefs
wipefs can list or erase file system, RAID or partition-table signatures (magic strings)
from the specified device. It does not erase the file systems themselves nor any other
data from the device.
See wipefs(8) for more information.
For example, to erase all signatures from the device /dev/sdb and create a signature
backup ~/wipefs-sdb-offset.bakfile for each signature: