...
Where collectd is running | Plugin | Type | Type Instance | Description | comment |
---|---|---|---|---|---|
Host/guest | CPU | percent/nanoseconds | idle | Time CPU spends idle. | Can be per cpu/aggregate across all the cpus. For more info, please see: http://man7.org/linux/man-pages/man1/top.1.html
http://blog.scoutapp.com/articles/2015/02/24/understanding-linuxs-cpu-stats |
percent/nanoseconds | nice | Time the CPU spent running user space processes that have been niced. The priority level a user space process can be tweaked by adjusting its niceness. | |||
percent/nanoseconds | interrupt | Time the CPU has spent servicing interrupts. | |||
percent/nanoseconds | softirq | (apparently) Time spent handling interrupts that are almost as important as Hardware interrupts (above). "In current kernels there are ten softirq vectors defined; two for tasklet processing, two for networking, two for the block layer, two for timers, and one each for the scheduler and read-copy-update processing. The kernel maintains a per-CPU bitmask indicating which softirqs need processing at any given time." [Ref] | |||
percent/nanoseconds | steal | CPU steal is a measure of the fraction of time that a machine is in a state of “involuntary wait.” It is time for which the kernel cannot otherwise account in one of the traditional classifications like user, system, or idle. It is time that went missing, from the perspective of the kernel. http://www.stackdriver.com/understanding-cpu-steal-experiment/ | |||
percent/nanoseconds | system | Time that the CPU spent running the kernel. | |||
percent/nanoseconds | user | Time CPU spends running un-niced user space processes. | |||
percent/nanoseconds | wait | The time the CPU spends idle while waiting for an I/O operation to complete | |||
Interface | if_dropped | in | The total number of received dropped packets. | http://www.onlamp.com/pub/a/linux/2000/11/16/LinuxAdmin.html | |
if_errors | in | The total number of received error packets. | |||
if_octets | in | The total number of received bytes. | |||
if_packets | in | The total number of received packets. | |||
if_dropped | out | The total number of transmit packets dropped | |||
if_errors | out | The total number of transmit error packets. | |||
if_octets | out | The total number of bytes transmitted | |||
if_packets | out | The total number of transmitted packets | |||
Memory | memory | buffered | The amount, in kibibytes, of temporary storage for raw disk blocks. | https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Deployment_Guide/s2-proc-meminfo.html | |
memory | cached | The amount of physical RAM, in kibibytes, left unused by the system. | |||
memory | free | The amount of physical RAM, in kibibytes, left unused by the system. | |||
memory | slab_recl | The part of Slab that can be reclaimed, such as caches. | Slab — The total amount of memory, in kibibytes, used by the kernel to cache data structures for its own usehttps://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Deployment_Guide/s2-proc-meminfo.html | ||
memory | slab_unrecl | The part of Slab that cannot be reclaimed even when lacking memory | |||
memory | used | mem_used = mem_total - (mem_free + mem_buffered + mem_cached + mem_slab_total); | https://github.com/collectd/collectd/blob/master/src/memory.c#L349 | ||
disk | disk_io_time | io_time | time spent doing I/Os (ms). You can treat this metric as a device load percentage (Value of 1 sec time spent matches 100% of load). | https://collectd.org/wiki/index.php/Plugin:Disk http://lxr.free-electrons.com/source/include/uapi/linux/if_link.h#L43 | |
disk_io_time | weighted_io_time | measure of both I/O completion time and the backlog that may be accumulating. | |||
disk_merged | read | the number of operations, that could be merged into other, already queued operations, i. e. one physical disk access served two or more logical operations. Of course, the higher that number, the better. | |||
disk_merged | write | the number of operations, that could be merged into other, already queued operations, i. e. one physical disk access served two or more logical operations. Of course, the higher that number, the better. | |||
disk_octects | read | the number of octets read from a disk or partition | |||
disk_octects | write | the number of octets written to a disk or partition | |||
disk_ops | read | the number of read operations issued to the disk | |||
disk_ops | write | the number of write operations issued to the disk | |||
disk_time | read | the average time an I/O-operation took to complete. Note from collectd Since this is a little messy to calculate take the actual values with a grain of salt. | |||
disk_time | write | the average time an I/O-operation took to complete. Note from collectd Since this is a little messy to calculate take the actual values with a grain of salt. | |||
pending_operations | shows queue size of pending I/O operations. | ||||
Ping | ping | Network latency is measured as a round-trip time in milliseconds. An ICMP “echo request” is sent to a host and the time needed for its echo-reply to arrive is measured. | Latency | ||
ping_droprate | droprate = ((double) (pkg_sent - pkg_recv)) / ((double) pkg_sent); | https://github.com/collectd/collectd/blob/master/src/ping.c#L703 | |||
ping_stddev | if pkg_recv > 1 latency_stddev = sqrt (((((double) pkg_recv) * latency_squared) - (latency_total * latency_total)) / ((double) (pkg_recv * (pkg_recv - 1)))); | https://github.com/collectd/collectd/blob/master/src/ping.c#L698 pkg_recv = # of echo-reply messages received latency_squared = latency * latency (for a received echo-reply message) latency_total = the total latency for received echo-reply messages | |||
load | load | shortterm | load average figures giving the number of jobs in the run queue (state R) or waiting for disk I/O (state D) averaged over 1 Minute measured CPU and IO utilization for 1 min using /proc/loadavg | http://man7.org/linux/man-pages/man5/proc.5.html https://github.com/collectd/collectd/blob/master/src/load.c | |
load | midterm | load average figures giving the number of jobs in the run queue (state R) or waiting for disk I/O (state D) averaged over 5 Minutes measured CPU and IO utilization for 5 mins using /proc/loadavg | |||
load | longterm | load average figures giving the number of jobs in the run queue (state R) or waiting for disk I/O (state D) averaged over 15 Minutes measured CPU and IO utilization for 15 mins using /proc/loadavg | |||
OVS events | gauge | link_status | |||
OVS Stats | collisions | Number of collisions. | per interface | ||
rx_bytes | Number of received bytes. | http://openvswitch.org/ovs-vswitchd.conf.db.5.pdf | |||
rx_crc_err | Number of CRC errors. | ||||
rx_dropped | Number of packets dropped by RX. | ||||
rx_errors | Total number of receive errors, greater than or equal to the sum of the RX errors above. | ||||
rx_frame_err | Number of frame alignment errors. | ||||
rx_over_err | Number of packets with RX overrun. | ||||
rx_packets | Number of received packets | ||||
tx_bytes | Number of transmitted bytes | ||||
tx_dropped | Number of packets dropped by TX | ||||
tx_errors | Total number of transmit errors, greater than or equal to the sum of the TX errors above. | ||||
tx_packets | Number of transmitted packets | ||||
Hugepages | bytes | used | total/pernode/both | ||
bytes | free | ||||
vmpage_number | used | ||||
vmpage_number | free | ||||
percent | used | ||||
percent | free | ||||
processes | fork_rate | ||||
ps_state | blocked | ||||
ps_state | paging | ||||
ps_state | running | ||||
ps_state | sleeping | ||||
ps_state | stopped | ||||
ps_state | zombies | ||||
Host only | Libvirt | disk_octets | read | ||
disk_octets | write | ||||
disk_ops | read | ||||
disk_ops | write | ||||
if_dropped | in | ||||
if_dropped | out | ||||
if_errors | in | ||||
if_errors | out | ||||
if_octets | in | ||||
if_octets | out | ||||
if_packets | in | ||||
if_packets | out | ||||
memory | actual | ||||
memory | balloon | ||||
memory | rss | ||||
memory | swap_in | ||||
memory | total | ||||
virt_cpu_total | This is in jiffies! | ||||
virt_vcpu | This is in jiffies! | ||||
RDT | ipc | per core group | |||
memory_bandwidth | local | ||||
memory_bandwidth | remote | ||||
bytes | llc | ||||
dpdkstats compatible with DPDK 16.04 (based on ixgbe, vhost support will be enabled in DPDK 16.11, patch support being upgraded to DPDK 16.07 in progress) | derive | rx_l3_l4_xsum_error | |||
errors | flow_director_filter_add_errors | ||||
flow_director_filter_remove_errors | |||||
mac_local_errors | |||||
mac_remote_errors | |||||
if_rx_dropped | rx_fcoe_dropped | ||||
rx_mac_short_packet_dropped | |||||
rx_management_dropped | |||||
rx_priorityX_dropped | where X is 0 to 7 | ||||
if_rx_errors | rx_crc_errors | ||||
rx_errors | |||||
rx_fcoe_crc_errors | |||||
rx_fcoe_mbuf_allocation_errors | |||||
rx_fcoe_no_direct_data_placement | |||||
rx_fcoe_no_direct_data_placement_ext_buff | |||||
rx_fragment_errors | |||||
rx_illegal_byte_errors | |||||
rx_jabber_errors | |||||
rx_length_errors | |||||
rx_mbuf_allocation_errors | |||||
rx_oversize_errors | |||||
rx_priorityX_mbuf_allocation_errors | where X is 0 to 7 | ||||
rx_q0_errors | if more queues are allocated then you get the errors per Queue | ||||
rx_undersize_errors | |||||
if_rx_octets | rx_error_bytes | bug - will move this to errors | |||
rx_fcoe_bytes | |||||
rx_fcoe_bytes | |||||
rx_good_bytes | |||||
rx_q0_bytes | per queue | ||||
rx_total_bytes | |||||
if_rx_packets | rx_broadcast_packets | ||||
rx_fcoe_packets | |||||
rx_flow_control_xoff_packets | |||||
rx_flow_control_xon_packets | |||||
rx_good_packets | |||||
rx_management_packets | |||||
rx_multicast_packets | |||||
rx_priorityX_xoff_packets | where X is 0 to 7 | ||||
rx_priorityX_xon_packets | where X is 0 to 7 | ||||
rx_q0_packets | per queue | ||||
rx_size_1024_to_max_packets | |||||
rx_size_128_to_255_packets | |||||
rx_size_256_to_511_packets | |||||
rx_size_512_to_1023_packets | |||||
rx_size_64_packets | |||||
rx_size_65_to_127_packets | |||||
rx_total_missed_packets | |||||
rx_total_packets | |||||
rx_xoff_packets | |||||
rx_xon_packets | |||||
if_tx_errors | tx_errors | ||||
if_tx_octets | tx_fcoe_bytes | ||||
tx_good_bytes | |||||
tx_q0_bytes | per queue | ||||
if_tx_packets | tx_broadcast_packets | ||||
tx_fcoe_packets | |||||
tx_flow_control_xoff_packets | |||||
tx_flow_control_xon_packets | |||||
tx_good_packets | |||||
tx_management_packets | |||||
tx_multicast_packets | |||||
tx_priorityX_xoff_packets | where X is 0 to 7 | ||||
tx_priorityX_xon_packets | where X is 0 to 7 | ||||
tx_q0_packets | per queue | ||||
tx_size_1024_to_max_packets | |||||
tx_size_128_to_255_packets | |||||
tx_size_256_to_511_packets | |||||
tx_size_512_to_1023_packets | |||||
tx_size_64_packets | |||||
tx_size_65_to_127_packets | |||||
tx_total_packets | |||||
tx_xoff_packets | |||||
tx_xon_packets | |||||
operations | flow_director_added_filters | ||||
flow_director_matched_filters | |||||
flow_director_missed_filters | |||||
flow_director_removed_filters |