Network Management
Network Management
Management
Introduction to Networking
Monitoring and Management
These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license
(http://creativecommons.org/licenses/by-nc/3.0/) as part of the ICANN, ISOC and NSRC Registry Operations Curriculum.
Part I: Overview
Core concepts presented:
What is network monitoring
What is network management
Getting started
Why network management
The big three
Attack detection
Documentation
Consolidating the data
The big picture
Available, reachable
Resources
Performance
Round-trip-time, throughput
Detection of issues,
Expectations
A network in operation needs to be monitored
in order to:
- Deliver
- Whats
Uptime Expectations
What does it take to deliver 99.9 % uptime?
30.5 days x 24 hours = 732 hours a month
(732 (732 x .999)) x 60 = 44 minutes
only 44 minutes of downtime a month!
Baselining
What is normal for your network?
If youve never measured or monitored your
network you will need to know things like:
Typical load on links ( Cacti)
Level of jitter between endpoints ( Smokeping)
Typical percent usage of resources
Typical amounts of noise:
Network scans
Dropped data
Reported errors or failures
Trends
Reliability
Smokeping Connection health, rtt, service
response time, latency
Performance
Cacti
Attack Detection
Trends and automation allow you to know
when you are under attack.
The tools in use can help you to mitigate
attacks:
Network diagrams
Network description
- Monitoring
- Data collection
- Accounting
Ticket
Ticket
Ticket
- Improvements
- Upgrades
Ticket
Ticket
- User complaints
- Requests
- Fix problems
- Capacity planning
- Availability (SLAs)
- Trends
- Detect problems
Cricket
IFPFM
flowc
mrtg*
NetFlow*
NfSen*
ntop
perfSONAR
pmacct
rrdtool*
SmokePing*
Ticketing
RT*
Trac*
Redmine
Change Mgmt
Mercurial
Rancid* (routers)
CVS*
Subversion*
git*
Security/NIDS
Nessus
OSSEC
Prelude
Samhain
SNORT
Untangle
Logging
swatch*
syslog/rsyslog*
tenshi*
Net Management
Big Brother
Big Sister
Cacti*
Hyperic
Munin
Nagios*
OpenNMS*
Sysmon
Zabbix
Documentation
IPplan
Netdisco
Netdot*
Rack Table
Protocols/Utilities
SNMP*, Perl, ping
Questions?
Questions?
Diagnostic tools
Monitoring tools
Performance tools
Active and passive tools
SNMP
Ticket systems
Configuration and change management
2.
Passive tools
-
Automated tools
-
-
-
-
DNS/Web/Email
Radius/LDAP/SQL
SSH to routers
-
-
SNMP tools
Net SNMP tool set
- http://net-snmp.sourceforge.net/
Etc
Flowtools, flowc
NFSen
Monitoring!
Customer complaints
Ticketing systems
Why are they important?
-
customer complaints
Ticketing systems
Use ticket system to follow each case,
including internal communication between
technicians
Each case is assigned a case number
Each case goes through a similar life cycle:
- New
- Open
- ...
- Resolved
- Closed
Ticketing systems
Workflow:
Ticket System
Helpdesk
Tech
Eqpt
---------------------------------------------------------------T
T
T
T
query
|
|
|
|
from ---->|
|
|
|
customer
|--- request --->|
|
|
<- ack. -- |
|
|
|
|
|<-- comm -->
|
|
|
|
|- fix issue -> eqpt
|
|<- report fix -|
|
customer <-|<-- respond ----|
|
|
|
|
|
|
trac
-
-
-
redmine
-
A few tools:
-
As simple as:
cp named.conf named.conf.20070827-01
For routers:
- RANCID
Trac
Redmine
And, many other wiki products. Excellent for
documenting your network.
- Monitoring
- Data collection
- Accounting
Ticket
Ticket
Ticket
- Improvements
- Upgrades
Ticket
Ticket
- User complaints
- Requests
- Fix problems
- Capacity planning
- Availability (SLAs)
- Trends
- Detect problems
Questions