...
Name | Collection location | Parameters | Scope of coverage | Unit(s) of measure | Definition | Method of Measurement | Sources of Error | Comments |
---|---|---|---|---|---|---|---|---|
cpu_idle | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | Time the host CPU spends idle. | see CPU Utilization above, and section 6 of TST008 | |||
cpu_nice | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | Time the host CPU spent running user space processes that have been niced. The priority level a user space process can be tweaked by adjusting its niceness. | see CPU Utilization above, and section 6 of TST008 | |||
cpu_interrupt | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | Time the CPU has spent servicing (hardware) interrupts. | see CPU Utilization above, and section 6 of TST008 | |||
cpu_softirq | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | Time spent handling interrupts that are synthesized, and almost as important as Hardware interrupts (above). "In current kernels there are ten softirq vectors defined; two for tasklet processing, two for networking, two for the block layer, two for timers, and one each for the scheduler and read-copy-update processing. The kernel maintains a per-CPU bitmask indicating which softirqs need processing at any given time." [Ref] | see CPU Utilization above, and section 6 of TST008 | |||
cpu_steal | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | CPU steal is a measure of the fraction of time that a machine is in a state of “involuntary wait.” It is time for which the kernel cannot otherwise account in one of the traditional classifications like user, system, or idle. It is time that went missing, from the perspective of the kernel. | see CPU Utilization above, and section 6 of TST008 | |||
cpu_system | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | Time that the CPU spent running the kernel. | see CPU Utilization above, and section 6 of TST008 | |||
cpu_user | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | Time CPU spends running un-niced user space processes.
| see CPU Utilization above, and section 6 of TST008 | |||
cpu_wait | Host | The host CPUs, individually or total usage summed across all CPUs | nanoseconds or percentage | The time the CPU spends idle while waiting for an I/O operation to complete | see CPU Utilization above, and section 6 of TST008 | |||
total_vcpu_utilization | Host | The host CPUs used by a guest, total usage summed across all CPUs | nanoseconds or percentage | The total utilization summed across all execution contexts (except Idle) and all CPUs in Scope. | see CPU Utilization above, and section 6 of TST008 |
Networking
[TODO] Add a note on the vSwitch and add vSwitch specific metrics
At a minimum the following metrics should be collected for a Networking interface:
- Total Packets received and transmitted
- Total Octets (TX and RX)
- Dropped packets (TX and RX)
- Error frames (TX and RX) [TODO: Break this down further – just tried to do that...]
- Frame Check Sequence Errors or CRC Errors
- Runts (frames <64 octets in length)
- Giants (frames >6000 octets in length)
- Broadcast Packets (TX and RX)
- Multicast Packets (TX and RX)
Other Metrics that should be collected for a Networking interface (if possible):
- Average bitrate
- Average latency
Name | Collection location | Parameters | Scope of coverage | Unit(s) of measure | Definition | Method of Measurement | Sources of Error | Comments | ||
---|---|---|---|---|---|---|---|---|---|---|
Total Packets received | see section 7 of TST008 | |||||||||
Total Packets transmitted | see section 7 of TST008 | |||||||||
Total Octets received | see section 7 of TST008 | |||||||||
Total Octets transmitted | see section 7 of TST008 | |||||||||
Total Error frames received | see section 7 of TST008 | |||||||||
Total Errors when attempting to transmit a frame | ||||||||||
Name | Collection location | Parameters | Scope of coverage | Unit(s) of measure | Definition | Method of Measurement | Sources of Error | Comments | ||
see section 7 of TST008 | ||||||||||
Total Broadcast Packets transmitted | see section 7 of TST008 | |||||||||
Multicast Packet | Total Octets received | see section 7 of TST008 | ||||||||
Average bitrate | Total Octets transmitted | |||||||||
Average latency | see section 7 of TST008 | Total Error frames received |
| |||||||
RX Packets dropped | see section 7 of TST008 | Total Errors when attempting to transmit a frame | ||||||||
TX packets dropped | see section 7 of TST008 | Broadcast Packets |
Where possible the metrics, events and information should be supported for the following Networking MIBs
MIB Name | RFC | Description | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Multicast Packet | |||||||||||
Average bitrate | |||||||||||
IF-MIB | RFC2863 | Network interface sub-layers | |||||||||
EtherLike-MIB | RFC3635 | Ethernet like network interfaces | |||||||||
IP-MIB | RFC4293 | IP and ICMP without routing info | |||||||||
IP-FORWARD_MIB | RFC4292 | CIDR multipath IP routes | |||||||||
TCP-MIB | RFC4022 | TCP stack counters and info | |||||||||
UDP-MIB | RFC4133 | UDP counters and info | |||||||||
IPV6 MIBs | RFC2465 RFC2466 RFC2452 RFC2454 | IPv6 equivalents
| |||||||||
SCTP-MIB | RFC3873 | SCTP protocol | |||||||||
UCD-IPFWACC-MIB |
| IP firewall accounting firewall rules | Average latency |
Storage
Disk Utilization
Name | Collection location | Parameters | Scope of coverage | Unit(s) of measure | Definition | Method of Measurement | Sources of Error | Comments |
---|---|---|---|---|---|---|---|---|
...