0% found this document useful (0 votes)

18 views107 pages

Linux Chapter With TOC OCR

The document provides a comprehensive overview of Linux fundamentals, including its history, command line importance, user management, file systems, and system initialization processes. It covers topics such as the differences between SysV init and systemd, user and group management, file permissions, and storage management techniques like LVM. Additionally, it emphasizes the significance of the command line interface and offers practical commands for managing services and file systems.

Uploaded by

pracheth sp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views107 pages

Linux Chapter With TOC OCR

Uploaded by

pracheth sp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 107

Linux Fundamentals .................................... 2

A Brief History Lesson .................................... 5

Why Command Line Matters .................................... 11

First Things First Hit That Power Button .................................... 12

SysV Init Traditional .................................... 18

Socket and Timer Units .................................... 23

Introduction to User Management .................................... 26

Creating and Managing Users .................................... 27

Group Management .................................... 28

Understanding File Permissions .................................... 29

Changing Permissions and Ownership .................................... 30

Special Permissions .................................... 31

Understanding Linux File Systems .................................... 36

Mounting and Unmounting File Systems .................................... 37

Managing Swap Space .................................... 40

Monitoring Storage Usage .................................... 41

File System Troubleshooting .................................... 42

Introduction to LVM Logical Volume .................................... 43

Creating and Managing Logical Volumes .................................... 44

Understanding Processes in Linux .................................... 47

Introduction to Shell Scripting .................................... 57

Using Variables in Shell Scripts .................................... 58

Conditionals in Shell Scripts .................................... 59

Loops in Shell Scripts .................................... 62

Realworld Example .................................... 64

Automating Tasks with Shell Scripts .................................... 68

Scheduling Jobs with Cron .................................... 72

1 Howcron Works .................................... 73

Understanding Load Averages and CPU .................................... 82

Commonly Adjusted ulimit Parameters .................................... 85

Commonly Adjusted sysctl Parameters .................................... 88

Making sysctl Changes Persistent .................................... 89

Optimizing System Performance with .................................... 91

Table of Contents (cont.)
Keep Logs Manageable .................................... 100

Diagnosing NetworkRelated Problems .................................... 102

RealWorld Use Case .................................... 103

The Path to Become an SRE
Engineer
Abe Bazouie
Linux Fundamentals
Linux, Unix or Minix. Wait … what???
What Is Linux, Really?

● Linux is an open-source operating system kernel that powers millions

of devices worldwide, from servers to smartphones.
● It’s a Unix-like system, inspired by its predecessors (Unix and Minix).
● Think of Linux as the foundation of a house—everything else (like your
applications) is built on top of it.
A Brief History Lesson

Minix, developed by
Andrew Tanenbaum
1987

1969 1987
Unix was created Linus Torvalds,
at Bell Labs by
built Linux as a
Dennis Ritchie
and Ken personal project.
Thompson
Why Linux Rocks

Why Should You Care About Linux?

● Powers 96.3% of the world’s top servers (including Google, Facebook,

and Netﬂix).
● Free and open-source (use it, modify it, share it).
● Highly reliable, secure, and customizable.
● Fun Fact: Even Android runs on Linux!
Unix vs. Linux vs. Minix

● Unix: The OG operating system, expensive and

proprietary.
● Minix: Unix’s lightweight teaching-oriented cousin.
● Linux: The open-source, community-driven, and inﬁnitely
customizable offspring.
The GNU Project – The Building Blocks of
Linux

● Founded in 1983 by Richard Stallman, the

GNU Project aimed to create a free
Unix-like operating system.
● GNU stands for “GNU's Not Unix” (a fun
recursive acronym!).
● Provided essential utilities like compilers,
editors, and shell programs—everything
except the kernel.
The Philosophy of GNU

● The GNU Project championed the idea of free software (freedom, not price).
● Created the GPL (GNU General Public License) to ensure software freedom.
● Inspired the open-source movement, which drives modern software
development.
What Makes Linux Tick? Think of it Like a Restaurant

● Kernel – The Chef

○ The kernel is like the chef in a restaurant.
○ It takes care of the ingredients (hardware like CPU, memory, and storage) and cooks (manages)
them to serve the dish (processes).
○ You never talk to the chef directly.

● Shell – The Waiter

○ The shell is like the waiter who takes your order (commands).
○ It listens to what you want, passes the request to the chef (kernel), and brings the results
(output) back to you.
○ Common shells: Bash, Zsh, etc.

● Userland – The Dining Room

○ The userland is like the dining area, where you sit, relax, and enjoy.
○ It includes the menu (applications), utilities (like salt and pepper shakers), and everything you
directly interact with.
○ Examples: Text editors, browsers, and system tools like ls.
Why Command Line Matters

The Power of the Command Line

● Direct communication with the system.

● Allows automation and scripting.
● Fun Fact: Command-line wizards are 50% cooler than GUI users (source: me :D).
First Things First: Hit That Power Button!

What happen once you push the Power button to turn on your computer?

● Boot sequence: BIOS → Bootloader → Kernel → Init.

● The init process is like the conductor of an orchestra—it starts and manages
all the system processes.
What’s Next?

● Deep dive into the init process: What it is, how it works, and why it matters.
● Understanding systemd, the modern init replacement.
Introduction to the init Process

● The init process is the ﬁrst process

started by the Linux kernel after booting.

● It has PID 1, meaning it's the parent of all

other processes on the system.

● Its job is to start system services and get

the system ready for use (logging in,
starting services like networking, etc.).

● Different types of init systems have

existed, such as:
○ SysV init (older method)
○ systemd (modern replacement)
Init1: parent of other processes
Init1: kill zombie processes
The Role of init in System Startup

How Does the init Process Work?

SysV init: Works by using runlevels to control which services start at boot. Each
runlevel represents a different system state (e.g., multi-user mode, single-user mode).

Init scripts: In SysV init, scripts located in /etc/init.d/ or /etc/rc.d/ deﬁne which
services start.

Runlevels:
● Runlevel 0: Halt (shuts down the system)
● Runlevel 1: Single-user mode (for system maintenance)
● Runlevel 3: Multi-user mode (text-based login)
● Runlevel 5: Multi-user mode (graphical login)
● Runlevel 6: Reboot
Systemd vs SysV Init: Modern Targets vs Traditional Runlevels

SysV Init (Traditional)

● Uses runlevels (0-6) to deﬁne system states.
Example:
○ Runlevel 3: Multi-user (text).
○ Runlevel 5: Multi-user (graphical).

Systemd (Modern)
● Replaces runlevels with target units.
Example Targets:
○ multi-user.target: Multi-user mode.
○ graphical.target: Graphical mode.

Key Improvement
● Targets are more descriptive and ﬂexible, enabling faster boot times and custom
system states.
Systemd vs SysV
SysV Runlevel systemd Target Description

Runlevel 0 poweroff.target Shuts down the system

Runlevel 1 rescue.target Single-user mode for

maintenance tasks

Runlevel 3 multi-user.target Multi-user mode with no

GUI (CLI only)

Runlevel 5 graphical.target Multi-user mode with

GUI (Graphical Login)

Runlevel 6 reboot.target Reboots the system

Introduction to systemd

systemd is the default init system in most modern Linux distributions.

Replaces older init systems like SysV init to manage services, processes, and
system boot.

It uses units to manage system components.

Managing Services with systemd

Start, stop, enable, or check the status of services using systemctl.

Example commands:

● systemctl start nginx.service: Start a service.

● systemctl status sshd.service: Check the status of a service.
● systemctl enable httpd.service: Enable a service to start on boot.
Target Units

Understanding Target Units:

Target units are used to group other units. For example:

● multi-user.target: Boots the system into multi-user mode (text-based).

● graphical.target: Boots the system into graphical mode (GUI).

Use systemctl isolate multi-user.target to switch between modes.

Socket and Timer Units

● Socket Units: Manage network connections (e.g., sshd.socket listens for

SSH connections).
● Timer Units: Schedule tasks based on time (e.g., logrotate.timer rotates
logs daily).

Examples:

● systemctl start sshd.socket: Start the socket for SSH connections.

● systemctl list-timers: View all active timers.
Journaling and Logs

systemd uses journald to manage logs for services and system processes.

View logs using journalctl:

● journalctl -b: View logs from the current boot.

● journalctl -u nginx.service: View logs for a speciﬁc service.
Introduction to User Management

● Each user is identiﬁed by a UID (User ID) and stored in /etc/passwd.

● Passwords are stored in hashed form in /etc/shadow.

● Users can be assigned a primary group and secondary groups.

Creating and Managing Users

useradd to create a new user.

usermod to modify user properties.

userdel to delete a user and their home directory.

Example commands:

● useradd abe
● usermod -aG sudo abe
● userdel -r abe
Group Management

● Groups provide a way to assign permissions to multiple users.

● Use groupadd, usermod -G, and groupdel for group management.

● Understanding primary groups and secondary group memberships.

Diagram ✅
Understanding File Permissions

Every ﬁle has read (r), write (w), and execute (x) permissions.

Permissions are assigned to three classes: owner, group, and others.

Example:

● rwxr-xr-- means:
○ rwx for the owner.
○ r-x for the group.
○ r-- for others.
Changing Permissions and Ownership

Use chmod to change ﬁle permissions.

chown and chgrp to change ﬁle and group ownership.

Example:

● chmod 755 ﬁlename

● chown abe:developers ﬁle.txt
Special Permissions

● Setuid (chmod u+s): Executes the file with the permissions of the file owner.
● Setgid (chmod g+s): Files created in a directory inherit the group of the
directory.
● Sticky Bit (chmod +t): Only the owner can delete or modify files in a directory.

Diagram ✅
Managing Privileges with sudo

● sudo (superuser do) allows users to execute commands with root

privileges without needing to log in as the root user.
● The sudoers file (located at /etc/sudoers) defines which users and groups
have sudo access.
● To edit the sudoers file, use the visudo command to avoid syntax errors.
● Best Practice: Avoid logging in as root directly. Instead, assign specific
commands to users via sudo for security.

Commands:

● sudo command: Run a command with root privileges.

● sudo -i: Start an interactive shell with root privileges.
● sudo visudo: Edit the sudoers ﬁle safely.
Conﬁguring sudo Permissions

sudoers ﬁle syntax:

● Format: user ALL=(ALL:ALL) ALL

● You can allow users to run speciﬁc commands by specifying them.
● Example: Allow a user to only run the systemctl command:
○ abe ALL=(ALL) /bin/systemctl

Using user groups to manage sudo access:

● Adding users to the sudo group grants them root privileges:

○ usermod -aG sudo username
Understanding the "ALL" in sudoers Syntax

user ALL=(ALL:ALL) ALL

1. user: The specific user account or group you’re granting sudo permissions to.
2. The first "ALL":
○ Meaning: This means that the user can run commands on all hosts (useful in
multi-host setups). If you’re only administering a single machine, this means "all
commands on this machine."
3. The (ALL:ALL) part:
○ The first "ALL" (inside the parentheses): This represents the target user. It means
the user can execute commands as any user on the system (including root).
○ The second "ALL" (inside the parentheses): This represents the target group. It
means the user can execute commands as any group.
4. The last "ALL":
○ Meaning: This means that the user can run all commands (as opposed to
specifying particular commands).
File Systems
and Storage
Management
Understanding Linux File Systems

A ﬁle system manages how data is stored and retrieved from a disk.

Common Linux ﬁle systems:

● ext4: The most common default ﬁle system.

● XFS: Known for handling large ﬁles eﬃciently.
● Btrfs: Offers advanced features like snapshots and pooling.

Key terms: Mounting, Partitions, Swap Space.

Mounting and Unmounting File Systems

● Mounting attaches a ﬁle system to a speciﬁc directory, making it accessible.

● Commands:
a. mount /dev/sda1 /mnt: Mount a file system.
b. umount /mnt: Unmount a file system.
● Persistent mounts are configured in /etc/fstab for automatic mounting at
boot.
Linux Filesystem Hierarchy

The Linux Filesystem Hierarchy Standard (FHS) deﬁnes the directory

structure and contents.

It starts with the root / directory, which contains other key directories like
/bin, /etc, /var, etc.

Each directory has a speciﬁc purpose:

● /bin: Essential command binaries (e.g., ls, cp).

● /etc: Configuration files for the system.
● /usr: User applications and files.
● /var: Variable files like logs and databases.
Understanding Key Directories in the
Filesystem
● /root: Home directory for the root user.
● /tmp: Temporary files.
● /var/log: System log files.
● /home: Home directories for non-root users.
● /mnt: Temporary mount points.
● /opt: Optional software.
● /dev: Device files, such as hard drives and USB devices.
● /proc: Information about running processes.

Diagram ✅
Managing Swap Space

Swap ????

● Swap space is used as virtual memory when the

system runs out of physical RAM.
● To create a swap file:
○ fallocate -l 1G /swapfile
○ mkswap /swapfile
○ swapon /swapfile
● Monitor swap usage using the free command.
Monitoring Storage Usage

Common tools:

● df: Shows disk space usage.

○ df -h: Shows human-readable disk space usage.
● du: Shows directory and ﬁle sizes.
○ du -sh /var/log: Shows the size of a directory.
● Quota: Disk space management for users.

Best practice: Regularly monitor and clean up disk space to avoid downtime.
File System Troubleshooting

● fsck: A tool for checking and repairing ﬁle system errors.

a. fsck /dev/sda1: Check and repair a ﬁle system.
● Use df and du for diagnosing disk space issues.
● Mounting options: Use options like ro (read-only) or noatime to control
mount behavior.
Introduction to LVM (Logical Volume
Management)

What is LVM?

● LVM allows you to manage and resize storage dynamically.

● LVM Structure:
○ Physical Volumes (PVs): The physical disks or partitions.
○ Volume Groups (VGs): Groups of physical volumes.
○ Logical Volumes (LVs): The storage units you create and manage.
● Key beneﬁt: You can resize, add, or remove volumes without rebooting the system.

Diagram ✅
Setting Up and Managing LVM

Creating and Managing Logical Volumes

● Example commands:
○ pvcreate /dev/sda1: Create a physical volume.
○ vgcreate myvg /dev/sda1: Create a volume group.
○ lvcreate -L 10G -n mylv myvg: Create a logical volume.
○ mkfs.ext4 /dev/myvg/mylv: Format the logical volume with a ﬁle system.
○ mount /dev/myvg/mylv /mnt: Mount the logical volume.
● Resizing logical volumes:
○ lvextend -L +5G /dev/myvg/mylv: Increase the size of the logical volume.
○ resize2fs /dev/myvg/mylv: Resize the ﬁle system to match the logical
volume.
Monitoring LVM

● Use vgdisplay to show information about volume groups.

● Use lvdisplay to check logical volume details.
● Example:
○ vgdisplay myvg
○ lvdisplay /dev/myvg/mylv
● Best practice: Regularly monitor volume groups and logical volumes to
ensure they have enough space.
Summary of File Systems and Storage
Management

You should now know how to:

● Mount and unmount ﬁle systems.

● Manage swap space.
● Monitor disk usage using df and du.
● Troubleshoot ﬁle system issues with fsck.
● Set up and manage LVM for dynamic storage allocation.

Best practice: Keep storage well-monitored to prevent performance issues.

Introduction to Process Management

Understanding Processes in Linux:

● A process is a running instance of a program.

● Processes can be in the foreground (interacting with users) or background
(running without user interaction).
● Processes have different priorities, which can be adjusted with nice and
renice.
● Key Concepts:
○ Foreground vs. Background processes.
○ Process ID (PID): Every process has a unique identiﬁer.
○ Parent and child processes: Processes can create other processes.

Diagram ✅
Managing Process Priorities with nice and
renice

nice: Adjusts the priority of a process when it starts.

● Lower nice value = higher priority.

renice: Changes the priority of an already running process.

Example commands:

● nice -n 10 myscript.sh: Start a script with lower priority.

● renice -n -5 1234: Change the priority of process 1234 to a higher priority.

Diagram ✅
Monitoring Processes with top and htop

Using top and htop to Monitor Processes:

● top: Displays real-time system summary, including CPU, memory usage,

and active processes.
○ Use top to identify resource-hungry processes.
● htop: An improved, interactive version of top with a more user-friendly
interface.
● Key commands:
○ top: Start the process monitor.
○ htop: Start an interactive process monitor.

Go deep into “top” …

Diagram ✅
Top, detail …
top, go deep to resources
Managing Processes with ps and kill

Viewing and Managing Processes with ps and kill

● ps: Lists processes running on the system. Use it to get details about
processes.
○ ps aux: List all running processes with details.
● kill: Sends signals to terminate or control processes.
○ kill -9 PID: Forcefully terminate a process.
● Key Concepts:
○ PID: Process ID, used to manage speciﬁc processes.
○ Signals: Control how processes are managed, such as termination
(SIGKILL) or stopping (SIGSTOP).

Diagram ✅
Managing Services with systemd

Conﬁguring and Monitoring Services with systemd:

● systemd is the modern init system used to manage services and

processes in Linux.
● Key commands:
○ systemctl start service: Start a service.
○ systemctl stop service: Stop a service.
○ systemctl status service: Check the status of a service.
● Use journalctl to view service logs:
○ journalctl -u service: View logs for a speciﬁc service.
Advanced Monitoring with strace and lsof

● strace: Monitors system calls made by a process. Useful for

troubleshooting:
○ strace -p PID: Monitor system calls for a specific process.
● lsof: Lists open files and network connections for a process:
○ lsof -p PID: List files opened by a process.
● Use cases:
○ strace helps identify why a process is stuck or misbehaving.
○ lsof helps track what files or sockets are being used by a process.

Diagram ✅
Summary of Process Management and
Monitoring

You should now understand:

● How to manage system processes with ps, kill, and systemctl.

● How to monitor processes using top, htop, strace, and lsof.
● The importance of adjusting process priorities with nice and renice.

Best Practice: Monitor critical processes regularly and use tools like strace and
lsof during incident response to pinpoint problems.
Shell Scripting
Introduction to Shell Scripting

● A shell script is a program written for the shell (command-line interpreter)

to automate repetitive tasks.
● Bash is the most common shell used for scripting in Linux.
● Advantages: Automating routine tasks, speeding up workﬂows, and
ensuring consistency.
● Key concepts in shell scripting:
○ Variables
○ Conditionals
○ Loops
Variables in Shell Scripts

Using Variables in Shell Scripts

Variables store data that can be reused in the script.

Deﬁning a variable:

● my_var="Hello, World!"
● Accessing the variable: echo $my_var

Variables can hold strings, numbers, and even command outputs.

Example:
name="Abe"
echo "Hello, $name!"

Diagram ✅
Conditionals in Shell Scripts

Conditionals allow the script to take different actions based on conditions.

Example using if statements:

if [ $age -ge 18 ]; then
echo "You are an adult."
else
echo "You are not an adult."
fi
Understanding “if” Statements

What is an if statement?:

● An if statement is used to execute commands based on conditions.

● The basic structure is:

if [ condition ]; then
# Commands to execute if condition is true
else
# Commands to execute if condition is false
fi
Understanding “if” Statements

Common conditions:
● Check if a ﬁle exists: [ -e /path/to/ﬁle ]
● Compare numbers: [ $var -eq 5 ]
● String comparisons: [ "$var" = "Hello" ]

Real-world Example:
● A script that checks if a directory exists before creating it:

if [ -d "/backup" ]; then
echo "Backup directory exists."
else
mkdir /backup
echo "Backup directory created."
fi
Loops in Shell Scripts
Understanding “for” Loops in Shell Scripting

What is a for loop?:

● A for loop iterates over a list of items and executes commands for each
item.
● The basic structure is:

for item in list; do

# Commands to execute for each item
done
Understanding “for” Loops in Shell Scripting

Real-world Example:

● A script that processes all .log ﬁles in a directory:

for file in /var/log/*.log; do

gzip "$file"
echo "$file has been compressed."
done

This loop iterates over each .log ﬁle in the /var/log directory and compresses it.
Understanding “while” Loops in Shell Scripting

What is a while loop?:

● A while loop repeats commands as long as a condition is true.

● The basic structure is:

while [ condition ]; do
# Commands to execute
done
Understanding “while” Loops in Shell Scripting

Real-world Example:

● A script that pings a server until it responds:

counter=1
while [ $counter -le 5 ]; do
echo "Counter: $counter"
((counter++))
done
Understanding “while” Loops in Shell Scripting

Another Real-world Example:

● A script that pings a server until it responds:

while ! ping -c 1 google.com &> /dev/null; do

echo "Waiting for the server to respond..."
sleep 5
done
echo "Server is up!"

This script repeatedly pings google.com until a response is received.

Automating Tasks with Shell Scripts

● Examples of tasks that can be automated:

○ Backups: Automate backing up important ﬁles or directories.
○ Log Rotation: Rotate logs periodically to prevent large log ﬁles.
○ System Monitoring: Automate resource monitoring and alerting.
● Example: Automating a backup task using tar:

tar -czf /backup/home_backup.tar.gz /home/user

● Use cron to schedule this script daily.

Scheduling Jobs with Cron

What is cron?:

● cron is a time-based job scheduler in Unix-like systems. It allows you to

schedule scripts or commands to run at speciﬁc intervals (e.g., hourly, daily).
● The cron daemon (crond) runs in the background and checks for scheduled
tasks.

What is crontab?:

● crontab is the cron table where you deﬁne the schedule for cron jobs.
● crontab ﬁle contains a list of cron jobs and their schedules for a user or
system.
Scheduling Jobs with Cron

● User crontabs: Each user can have their own crontab ﬁle, edited with
crontab -e. These are stored in /var/spool/cron/crontabs (exact location
may vary depending on the distribution).
● System-wide crontab: Located at /etc/crontab, this ﬁle is used for
scheduling system-wide tasks.
● Other cron directories:
○ /etc/cron.hourly, /etc/cron.daily, /etc/cron.weekly, and
/etc/cron.monthly allow for scheduling scripts to run at hourly, daily,
weekly, or monthly intervals.

Key Differences between cron and crontab:

● cron refers to the background daemon that runs scheduled tasks.

● crontab is the ﬁle (or command) where the scheduling of tasks is deﬁned.
Scheduling Jobs with Cron

Crontab Syntax:
● * * * * * /path/to/script.sh
○ Minute (0-59)
○ Hour (0-23)
○ Day of the Month (1-31)
○ Month (1-12)
○ Day of the Week (0-7, where 0 and 7 are both Sunday)
● Example: Run a backup script daily at midnight:
○ 0 0 * * * /home/user/backup.sh

Managing crontab:
● Edit crontab: crontab -e
● View crontab: crontab -l
● Remove crontab: crontab -r
Scheduling Jobs with Cron

Each user has their own crontab ﬁle, where scheduled jobs are deﬁned.

Crontab format:

● * * * * * /path/to/script.sh: Runs the script at a speciﬁc interval (e.g., every

minute).

Example to run a backup script daily at midnight:

0 0 * * * /home/user/backup.sh
Yeah … still Cron … but last one!

1. How cron Works:

○ The cron daemon (crond) reads all crontab ﬁles (user-speciﬁc and
system-wide) and cron directories.
○ It checks for tasks that match the current time and executes them.
2. User vs. System-wide Crontab:

○ User crontabs are speciﬁc to each user and are edited using crontab -e. These
tasks run with the user's permissions.
○ System-wide crontab (/etc/crontab) can include tasks that affect the whole
system and specify the user who should run the command.
3. Crontab Management:

○ Using crontab -e to edit the crontab file is safer than manually editing the file in
/var/spool/cron/crontabs.
○ The cron directories (/etc/cron.*) are often used for simple scripts that should
run on a regular basis without needing to edit the crontab file directly.
I lied … this is the last one :)

Recap the key points:

● cron is your go-to for automating tasks.

● Use crontab to deﬁne when and what tasks to run.
● Keep an eye on log ﬁles to ensure everything runs smoothly.

Reminder:

● Automate those repetitive tasks and make your life easier—just set it and
let cron handle the rest!
Scheduling One-Time Jobs with “at”

at schedules a one-time job to run in the future.

Example:

at 3:00 PM tomorrow
at> /path/to/script.sh

Use cases: Running tasks later without having them repeat, e.g., restarting a server, running maintenance scripts.

Use atq to view pending jobs and atrm to remove them.

System
Performance
Tuning
Introduction to System Performance Tuning

To understand how to monitor and optimize system performance for reliability.

Why It Matters:

● Proactive monitoring helps detect issues before they become critical.

● Optimization ensures eﬃcient use of resources, reducing costs and
increasing system stability.

Key Areas to Focus On:

● CPU Utilization
● Memory Usage
● Disk I/O
● System Limits
Monitoring CPU and Memory with “top” and “free”

top:

● Provides a real-time overview of CPU and memory usage.

● Key metrics to monitor:
○ %CPU: CPU usage of each process.
○ %MEM: Memory usage of each process.
○ Load Average: Represents the system load over 1, 5, and 15 minutes.
● Example: Use top to identify processes that are consuming the most
resources.

free:

● Displays total, used, free, and available RAM and swap space.
● Example: free -h gives a human-readable summary of memory usage.
Monitoring Disk I/O with “iostat”

Using iostat for Disk I/O Monitoring

iostat:

● Part of the sysstat package, it helps monitor disk I/O and CPU
performance.
● Provides statistics on disk reads/writes and CPU load.
● Example: iostat -x 5 provides extended I/O stats every 5 seconds.

Key Metrics:

● tps: Transactions per second.

● kB_read/s and kB_wrtn/s: Amount of data read and written per second.
● %util: Percentage of time the disk is busy. High values may indicate a
bottleneck.
System Resource Monitoring with “vmstat”
and “sar”
Using vmstat and sar for System Monitoring

vmstat:

● Reports information about processes, memory, paging, block I/O, and CPU
activity.
● Example: vmstat 5 displays system stats every 5 seconds.
● Key metrics:
○ r: Number of runnable processes (CPU queue length).
○ si/so: Swap-in and swap-out rates.
○ us/sy/id: CPU time spent in user/system/idle.
System Resource Monitoring with “vmstat”
and “sar”

Using vmstat and sar for System Monitoring

sar:

● Part of the sysstat package, it provides historical system performance

data.
● Example: sar -u 5 10 reports CPU usage every 5 seconds for 10 intervals.
● Historical analysis: Compare past performance trends to current data.
Understanding Load Averages and CPU
Utilization

Interpreting Load Averages and CPU Utilization

Load Average:

● Represents the average number of processes waiting for CPU time.

● Example Output: 0.50, 0.75, 1.25 for 1, 5, and 15 minutes.
● A load average of 1.0 means 1 process is waiting for CPU time on a
single-core system.
● For multi-core systems, divide by the number of cores to assess load.
Understanding Load Averages and CPU
Utilization

CPU Utilization:

● %us: User space processes.

● %sy: System/kernel processes.
● %wa: Time CPU spends waiting for I/O operations.
● %id: Idle time (low values can indicate a busy system).

Example: Compare load averages to CPU utilization to determine if the system

is CPU-bound.

Diagram ✅
Understanding ulimit

What is ulimit?
● ulimit is a shell command that allows you to control user-level resource limits on a
Linux system.
● These limits are essential for preventing resource exhaustion, such as excessive
CPU usage or too many open ﬁles, which can degrade system performance.

Why It Matters for SREs:

● Proper use of ulimit helps maintain system stability by capping resource usage for
user processes.
● Helps avoid scenarios where a misbehaving application consumes all system
resources, leading to a Denial of Service (DoS).

Basically it shows the max size/number of buffer size, core ﬁles, scheduling priority, ﬁle
locks, threads ...
Understanding ulimit

Commonly Adjusted ulimit Parameters:

● ulimit -n: Sets the maximum number of open ﬁle descriptors.

○ Example: ulimit -n 65535 increases the maximum number of open ﬁles.
○ Relevance: Increasing this limit is critical for high-traﬃc servers that handle many
simultaneous connections (e.g., web servers, databases).

● ulimit -u: Sets the maximum number of user processes.

○ Example: ulimit -u 2048 limits the user to 2048 processes.
○ Relevance: Prevents a user or application from creating too many processes, which
could overwhelm the system.

● ulimit -c: Controls the core dump size for debugging.

○ Example: ulimit -c unlimited allows core dumps of any size.
○ Relevance: Useful for troubleshooting application crashes by analyzing core dumps.
Understanding ulimit

Hands-On Example: Adjust the open ﬁle limit and apply it:

ulimit -n 10240 # Set maximum open files to 10,240

Pro Tip: Make changes permanent by editing conﬁguration ﬁles, like

/etc/security/limits.conf.
Understanding sysctl
What is sysctl?

● sysctl is a Linux tool that allows you to modify kernel parameters at

runtime.
● It is often used to tune network, memory, and security settings.
● Why It Matters for SREs:
○ Helps optimize kernel settings for high performance and low latency
environments.
○ Allows you to make ﬁne-tuned adjustments to the system to handle
production workloads.
Understanding sysctl

Commonly Adjusted sysctl Parameters:

● net.core.somaxconn: Sets the maximum number of queued connections.

○ Example: sysctl -w net.core.somaxconn=1024
○ Relevance: Important for web servers to handle a high number of incoming
connections without dropping packets.

● vm.swappiness: Controls swap usage.

○ Example: sysctl -w vm.swappiness=10
○ Relevance: Lowering this value reduces the system's tendency to swap memory,
which can improve performance for memory-intensive applications.

● fs.file-max: Sets the maximum number of file handles the kernel can allocate.
○ Example: sysctl -w fs.file-max=100000
○ Relevance: Essential for applications that need to open many files simultaneously,
like large databases or logging systems.
Understanding sysctl

Making sysctl Changes Persistent:

● Edit the ﬁle /etc/sysctl.conf and add the desired parameters:

net.core.somaxconn = 1024
vm.swappiness = 10
fs.file-max = 100000

● Apply changes with:

sysctl -p

Pro Tip: Always test sysctl changes in a staging environment before applying
them in production.
Best Practices for Using ulimit and sysctl in Production

Understand the Impact:

● Improperly conﬁguring ulimit or sysctl can negatively impact system stability.
● Always research the effects of each parameter before applying it.
Test in Staging:
● Test adjustments in a staging environment that mirrors production.
● Monitor for performance improvements and potential side effects.

Document Your Changes:

● Record any changes made to ulimit and sysctl settings.
● Keep notes on why the change was made and how it impacted performance.

Monitor After Applying Changes:

● Use tools like top, sar, and vmstat to monitor system resource usage after making
changes.
● Look for improvements in CPU utilization, memory usage, and network performance.
Optimizing System Performance with
“ulimit” and “sysctl”

Advanced Tuning with ulimit and sysctl for SREs

Understand how to use ulimit and sysctl to optimize system performance for
production environments.

But what is ulimit and sysctl ???

Troubleshooting
and Log
Management
Introduction to Troubleshooting and Log Management

Importance for SREs:

● Logs provide a record of system activities and errors, making them crucial
for diagnosing issues.
● Proactive log monitoring helps prevent issues from escalating into critical
incidents.

Focus Areas:

● Navigating logs in /var/log/.

● Using journalctl for systemd logs.
● Investigating boot failures, disk space issues, and system crashes.
● Diagnosing network issues with essential tools.
Navigating Logs in /var/log/

What is /var/log/?

● Directory where system logs are stored on Linux.

● Contains logs for system events, authentication, application errors, and more.

Key Log Files:

● /var/log/messages: General system logs (non-systemd).

● /var/log/syslog: System messages and logs from various services.
● /var/log/auth.log: Authentication and login attempts.
● /var/log/dmesg: Kernel ring buffer logs (hardware and boot messages).
Navigating Logs in /var/log/

Example:

● Use tail -f /var/log/syslog to monitor logs in real-time.

● Use grep to search for speciﬁc errors:

grep -i "error" /var/log/syslog

Using journalctl for Systemd Logs

What is journalctl?

● A command for querying and displaying logs managed by systemd's

journald.
● Allows ﬁltering logs by service, priority, date, and boot sessions.

Key Commands:

● View all logs: journalctl -xe (shows logs with extra detail).
● Filter logs by time: journalctl --since "2023-10-01" --until "2023-10-02"
● View logs for a speciﬁc service: journalctl -u nginx
● View logs from the previous boot: journalctl -b -1
Using journalctl for Systemd Logs

Example:

● Use journalctl -u sshd to troubleshoot SSH login issues.

● Filter logs for critical errors:

journalctl -p crit
Managing Logs with logrotate

Automating Log Management with logrotate

What is logrotate?
● A tool that automatically rotates, compresses, and deletes log files based on specified
criteria.
● Helps prevent log files from consuming too much disk space over time.
● Typically used for logs in the /var/log/ directory but can be configured for any log file.

Key Features:
● Rotation: Renames old log files and creates new ones (e.g., syslog becomes syslog.1).
● Compression: Compresses old logs to save space (e.g., .gz format).
● Retention: Keeps a specified number of old log files before deleting them.
● Custom Schedules: Rotate logs daily, weekly, monthly, or based on file size.
Managing Logs with logrotate

Configuration:
● Default configuration is in /etc/logrotate.conf.
● Custom configurations for specific services can be placed in
/etc/logrotate.d/.

Rotate a custom log ﬁle daily and keep 7 compressed backups:

/var/log/myapp.log {
Explanation:
● daily: Rotate logs every day. daily
● rotate 7: Keep 7 copies of old logs. rotate 7
● compress: Compress old logs. compress
● missingok: Skip rotation if the log ﬁle is missing. missingok
● notifempty: Don’t rotate if the log ﬁle is empty. notifempty
create 0640 root root
}
Best Practices for Using logrotate

Keep Logs Manageable:

● Rotate logs daily for high-activity logs (e.g., web server logs).
● Use weekly or monthly rotation for less active logs.

Use Compression:
● Compressing logs saves disk space, especially for logs that contain a lot of text data.
● Use compress in the conﬁguration to automatically gzip old logs.

Adjust Retention Based on Needs:

● For compliance: Retain logs for longer periods (e.g., rotate 30 for a month).
● For space management: Retain fewer logs to prevent disks from ﬁlling up.

Monitor Log Rotation:

● Review /var/lib/logrotate/status to see the status of rotated logs.
● Check logs for logrotate activity in /var/log/cron or /var/log/messages.
Investigating Common Issues

Boot Failures:

● Use dmesg and journalctl to check kernel logs for errors during boot.
● Look for error messages related to hardware or missing ﬁles.
● Example: journalctl -b to see logs from the latest boot.

Disk Space Issues:

● Use df -h to check available disk space:

○ Identify which partitions are filling up.
● Use du -sh /path to find large files or directories.

System Crashes:

● Check for OOM (Out of Memory) errors using dmesg or journalctl.

● Look in /var/log/messages or /var/log/syslog for panic or segfault entries.
Diagnosing Network-Related Problems

Essential Tools for Network Troubleshooting:

● ping: Test connectivity to a host.

○ Example: ping google.com to check internet connectivity.
● traceroute: Identify the path packets take to reach a host.
○ Example: traceroute 8.8.8.8 to see the route to Google's DNS server.
● netstat: Show network connections, routing tables, and interface statistics.
○ Example: netstat -tuln to display listening ports.
Diagnosing Network-Related Problems

Real-World Use Case:

● Use ping to check if a server is reachable during an incident.

● Use traceroute to identify where packets are getting delayed or dropped.
● Use netstat to ﬁnd which processes are using network ports, useful for
identifying rogue services.

Hands-On Example:

● Run traceroute and interpret the results to identify network bottlenecks.

● Use netstat to ﬁnd and terminate a problematic process:

netstat -tuln | grep ":80"

Real-World Log Analysis Scenario

Scenario: A web server is slow to respond. How do you diagnose the issue?

Step-by-step Analysis:

1. Check Web Server Logs: journalctl -u nginx for errors.

2. Check System Resource Usage: top and free -h to see if the system is under
stress.
3. Check Disk Space: df -h to ensure logs are not ﬁlling up the disk.
4. Check Network Activity: netstat to see if unusual connections are affecting
the server.

Outcome: Identify and resolve a misconﬁgured ﬁrewall that was slowing down the
server's response time.
Yes, …You did it!

Beautifully Deranged - Nova Black
No ratings yet
Beautifully Deranged - Nova Black
177 pages
LinuxAdministration Unlocked
No ratings yet
LinuxAdministration Unlocked
218 pages
Pages From 150 5300 13B Airport Design Taxiway Design
No ratings yet
Pages From 150 5300 13B Airport Design Taxiway Design
45 pages
Saep 394
No ratings yet
Saep 394
9 pages
Jacques Alain Miller Marginalia
100% (2)
Jacques Alain Miller Marginalia
22 pages
Grinding (Lecture 3)
No ratings yet
Grinding (Lecture 3)
27 pages
Laws of Physics
67% (3)
Laws of Physics
47 pages
Linux Commands Shell
No ratings yet
Linux Commands Shell
54 pages
Linux Shell Scripting
No ratings yet
Linux Shell Scripting
95 pages
Linux Notes
No ratings yet
Linux Notes
11 pages
Linux Chapter - Course Presentation
No ratings yet
Linux Chapter - Course Presentation
105 pages
Fransiskus Daud Try Surya A Bahasa Inggris PTK PPG DALJAB 2
No ratings yet
Fransiskus Daud Try Surya A Bahasa Inggris PTK PPG DALJAB 2
47 pages
GOOD Morning Wood
No ratings yet
GOOD Morning Wood
10 pages
1-Shell Scripting
No ratings yet
1-Shell Scripting
30 pages
Internship Report On Branding & Promotional Strategies of Ceylon Biscuits Bangladesh (PVT.) Limited - IUBAT - Sahadat Hossain
85% (13)
Internship Report On Branding & Promotional Strategies of Ceylon Biscuits Bangladesh (PVT.) Limited - IUBAT - Sahadat Hossain
92 pages
Water Management Plan
No ratings yet
Water Management Plan
110 pages
Linux Installation Configuration and Command Line Basics 1st Edition Nathan Clark PDF Download
No ratings yet
Linux Installation Configuration and Command Line Basics 1st Edition Nathan Clark PDF Download
70 pages
Linux Simple Notes
No ratings yet
Linux Simple Notes
24 pages
Startup and Shutdown PDF
No ratings yet
Startup and Shutdown PDF
16 pages
Parameters of Automated Cell Counter Automation in Hematology Laboratory and CBC Via Automated Blood Analyzer
100% (1)
Parameters of Automated Cell Counter Automation in Hematology Laboratory and CBC Via Automated Blood Analyzer
40 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
Conf TOW2019 Seminar Horsley Sem1
No ratings yet
Conf TOW2019 Seminar Horsley Sem1
88 pages
Unit 6. Linux Operating System
No ratings yet
Unit 6. Linux Operating System
92 pages
Hafeez Contractor
No ratings yet
Hafeez Contractor
10 pages
Liberty Tax (TY2023) Textbook ch3 Filing Status
No ratings yet
Liberty Tax (TY2023) Textbook ch3 Filing Status
22 pages
Brain Herniation PDF
No ratings yet
Brain Herniation PDF
5 pages
Section 1
No ratings yet
Section 1
32 pages
Commodore Amiga 3000 Hardware Technical Notes Revision 2 (1998-07) (Tsang, Calum)
No ratings yet
Commodore Amiga 3000 Hardware Technical Notes Revision 2 (1998-07) (Tsang, Calum)
26 pages
Linux
No ratings yet
Linux
13 pages
LINUX COMMAND LINE An Introduction To Linux Command Line Environment
No ratings yet
LINUX COMMAND LINE An Introduction To Linux Command Line Environment
174 pages
Linux Runlevels StartupScripts
No ratings yet
Linux Runlevels StartupScripts
31 pages
Acute GlomeruloNephritis - AGN
No ratings yet
Acute GlomeruloNephritis - AGN
36 pages
Industrial Important Question Answer Linux & AWS Cloud
No ratings yet
Industrial Important Question Answer Linux & AWS Cloud
60 pages
Food Pyramid
No ratings yet
Food Pyramid
21 pages
Linux Basics - Linux Guide To Learn Linux C - Steven Landy
No ratings yet
Linux Basics - Linux Guide To Learn Linux C - Steven Landy
81 pages
Linux Basics Training
No ratings yet
Linux Basics Training
28 pages
Embedded Linux To Start With
100% (1)
Embedded Linux To Start With
70 pages
COMP 201 OpenSource-L5-Linux SysAdm
No ratings yet
COMP 201 OpenSource-L5-Linux SysAdm
77 pages
Ubuntu 14.04 - Installation of Ubuntu 12.04
No ratings yet
Ubuntu 14.04 - Installation of Ubuntu 12.04
9 pages
Unit 1 - Task 3 - Challenge Yourself Test - Evaluation Quiz - Revisión Del Intento2
No ratings yet
Unit 1 - Task 3 - Challenge Yourself Test - Evaluation Quiz - Revisión Del Intento2
11 pages
Linux Chapter 6
No ratings yet
Linux Chapter 6
45 pages
Redhat Mod-1&2
No ratings yet
Redhat Mod-1&2
10 pages
FUSE: A Microservice Approach To Cross-Domain Federation Using Docker Containers
No ratings yet
FUSE: A Microservice Approach To Cross-Domain Federation Using Docker Containers
10 pages
LINUX
No ratings yet
LINUX
23 pages
Linux Notes
No ratings yet
Linux Notes
13 pages
Heading / Description: 1 - Poultry Shed/Duck House 78.EW.2.1 - Earthwork in Excavation by Manual Means LIFT UP TO 1.5 METRE in
No ratings yet
Heading / Description: 1 - Poultry Shed/Duck House 78.EW.2.1 - Earthwork in Excavation by Manual Means LIFT UP TO 1.5 METRE in
10 pages
Sarbananda Sonowal
No ratings yet
Sarbananda Sonowal
9 pages
Linux OS Day 1 & 2
No ratings yet
Linux OS Day 1 & 2
169 pages
Unix
100% (1)
Unix
5 pages
Linux
No ratings yet
Linux
112 pages
HO3 Long 5 Pages
No ratings yet
HO3 Long 5 Pages
5 pages
C-Programming Part1 Upto Fucntions
No ratings yet
C-Programming Part1 Upto Fucntions
261 pages
Rhcsa 1
100% (2)
Rhcsa 1
152 pages
Lecture 3
No ratings yet
Lecture 3
10 pages
OS Lab Handout-Merged
No ratings yet
OS Lab Handout-Merged
128 pages
Get To Know Linux: The /etc/init.d Directory
No ratings yet
Get To Know Linux: The /etc/init.d Directory
2 pages
9.controlling Services and Daemons
No ratings yet
9.controlling Services and Daemons
38 pages
Linux Admin Presentation
No ratings yet
Linux Admin Presentation
34 pages
Linux Intro
No ratings yet
Linux Intro
82 pages
ICTNWK559 Assessment Task 1
No ratings yet
ICTNWK559 Assessment Task 1
15 pages
Boot - Process, Systemd - RUN Levels
No ratings yet
Boot - Process, Systemd - RUN Levels
18 pages
Shell
No ratings yet
Shell
69 pages
CSC329 Introduction To Linux Administration
No ratings yet
CSC329 Introduction To Linux Administration
22 pages
Linux 2
No ratings yet
Linux 2
22 pages
FiNC 401 Exam 2018
No ratings yet
FiNC 401 Exam 2018
7 pages
NAGARAJ CV 2024 - May
No ratings yet
NAGARAJ CV 2024 - May
3 pages
50-Gerund Infinitive Test 1
No ratings yet
50-Gerund Infinitive Test 1
3 pages
Linux Essentials - Introduction To Linux NTI
No ratings yet
Linux Essentials - Introduction To Linux NTI
24 pages
Linux Lecture5
No ratings yet
Linux Lecture5
15 pages
Linux Day2
No ratings yet
Linux Day2
18 pages
CH1 SNA Lecture
No ratings yet
CH1 SNA Lecture
79 pages
Basic Shell Commands in Linux
No ratings yet
Basic Shell Commands in Linux
53 pages
Operating Systems and Linux II
No ratings yet
Operating Systems and Linux II
25 pages
Linux Basic Commands 1748453726
No ratings yet
Linux Basic Commands 1748453726
8 pages
Linux L1
No ratings yet
Linux L1
13 pages
Mukta
No ratings yet
Mukta
1 page
Linux Free
No ratings yet
Linux Free
20 pages
Systemd Rajesh
No ratings yet
Systemd Rajesh
31 pages
6 Stages of Linux Boot Process
100% (1)
6 Stages of Linux Boot Process
9 pages
Intro To Linux - For Training - Odp
No ratings yet
Intro To Linux - For Training - Odp
62 pages
6 Stages of Linux Boot Process (Startup Sequence)
No ratings yet
6 Stages of Linux Boot Process (Startup Sequence)
11 pages
Senarai Skim Pelaburan Yang Diharamkan Bank Negara
No ratings yet
Senarai Skim Pelaburan Yang Diharamkan Bank Negara
2 pages
29 (Number) - Wikipedia
No ratings yet
29 (Number) - Wikipedia
1 page
The Linux Boot Sequence: Administrator's Guide and From Chap. 6 of Linux Unleashed
No ratings yet
The Linux Boot Sequence: Administrator's Guide and From Chap. 6 of Linux Unleashed
7 pages
Linux Fundamentals
No ratings yet
Linux Fundamentals
33 pages
List of NBFC Excel
No ratings yet
List of NBFC Excel
7 pages
Startup and Shut Down
No ratings yet
Startup and Shut Down
18 pages
Linux Unit:1 Red Hat Supported Software's:: Language Support and Internationalization
No ratings yet
Linux Unit:1 Red Hat Supported Software's:: Language Support and Internationalization
12 pages
Linux for Beginners: Introduction to Linux Operating System and Essential Command Lines: Computer Programming
From Everand
Linux for Beginners: Introduction to Linux Operating System and Essential Command Lines: Computer Programming
Isaak Seel
3.5/5 (5)
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
From Everand
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
Steve Will
4.5/5 (3)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.