Page History

...

Where collectd is running	Plugin	Type	Type Instance	Description	comment
Host/guest	CPU	percent/nanoseconds	idle	Time CPU spends idle.	Can be per cpu/aggregate across all the cpus. For more info, please see: http://man7.org/linux/man-pages/man1/top.1.html http://blog.scoutapp.com/articles/2015/02/24/understanding-linuxs-cpu-stats Note that jiffies operate on a variable time base, HZ. The default value of HZ should be used (1000), yielding a jiffy value of 0.001 seconds) [time(7)]. Also, the actual number of jiffies in each second is subject to system factors, such as use of virtualization. Thus, the percent calculation based on jiffies will nominally sum to 100% plus or minus error.
		percent/nanoseconds	nice	Time the CPU spent running user space processes that have been niced. The priority level a user space process can be tweaked by adjusting its niceness.
		percent/nanoseconds	interrupt	Time the CPU has spent servicing interrupts.
		percent/nanoseconds	softirq	(apparently) Time spent handling interrupts that are almost as important as Hardware interrupts (above). "In current kernels there are ten softirq vectors defined; two for tasklet processing, two for networking, two for the block layer, two for timers, and one each for the scheduler and read-copy-update processing. The kernel maintains a per-CPU bitmask indicating which softirqs need processing at any given time." [Ref]
		percent/nanoseconds	steal	CPU steal is a measure of the fraction of time that a machine is in a state of “involuntary wait.” It is time for which the kernel cannot otherwise account in one of the traditional classifications like user, system, or idle. It is time that went missing, from the perspective of the kernel. http://www.stackdriver.com/understanding-cpu-steal-experiment/
		percent/nanoseconds	system	Time that the CPU spent running the kernel.
		percent/nanoseconds	user	Time CPU spends running un-niced user space processes.
		percent/nanoseconds	wait	The time the CPU spends idle while waiting for an I/O operation to complete
	Interface	if_dropped	in	The number of received dropped packets.
		if_errors	in	The number of received error packets.
		if_octets	in	The number of received bytes.
		if_packets	in	The number of received packets.
		if_dropped	out	The number of transmit packets dropped
		if_errors	out	The number of transmit error packets.
		if_octets	out	The number of bytes transmitted
		if_packets	out	The number of transmitted packets
	Memory	memory	buffered	The amount, in kibibytes, of temporary storage for raw disk blocks.	https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Deployment_Guide/s2-proc-meminfo.html
		memory	cached	The amount of physical RAM, in kibibytes, left unused by the system.
		memory	free	The amount of physical RAM, in kibibytes, left unused by the system.
		memory	slab_recl	The part of Slab that can be reclaimed, such as caches.	`Slab` — The total amount of memory, in kibibytes, used by the kernel to cache data structures for its own use https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Deployment_Guide/s2-proc-meminfo.html
		memory	slab_unrecl	The part of Slab that cannot be reclaimed even when lacking memory
		memory	used	mem_used = mem_total - (mem_free + mem_buffered + mem_cached + mem_slab_total);	https://github.com/collectd/collectd/blob/master/src/memory.c#L349
	disk	disk_io_time	io_time	time spent doing I/Os (ms). You can treat this metric as a device load percentage (Value of 1 sec time spent matches 100% of load).	https://collectd.org/wiki/index.php/Plugin:Disk
		disk_io_time	weighted_io_time	measure of both I/O completion time and the backlog that may be accumulating.
		disk_merged	read	the number of operations, that could be merged into other, already queued operations, i. e. one physical disk access served two or more logical operations. Of course, the higher that number, the better.
		disk_merged	write	the number of operations, that could be merged into other, already queued operations, i. e. one physical disk access served two or more logical operations. Of course, the higher that number, the better.
		disk_octects	read	the number of octets read from a disk or partition
		disk_octects	write	the number of octets written to a disk or partition
		disk_ops	read	the number of read operations issued to the disk
		disk_ops	write	the number of write operations issued to the disk
		disk_time	read	the average time an I/O-operation took to complete. Note from collectd Since this is a little messy to calculate take the actual values with a grain of salt.
		disk_time	write	the average time an I/O-operation took to complete. Note from collectd Since this is a little messy to calculate take the actual values with a grain of salt.
		pending_operations		shows queue size of pending I/O operations.
	Ping	ping		Network latency is measured as a round-trip time in milliseconds. An ICMP “echo request” is sent to a host and the time needed for its echo-reply to arrive is measured.	Latency
		ping_droprate		droprate = ((double) (pkg_sent - pkg_recv)) / ((double) pkg_sent);	https://github.com/collectd/collectd/blob/master/src/ping.c#L703
		ping_stddev		if pkg_recv > 1 latency_stddev = sqrt (((((double) pkg_recv) * latency_squared) - (latency_total * latency_total)) / ((double) (pkg_recv * (pkg_recv - 1))));	https://github.com/collectd/collectd/blob/master/src/ping.c#L698 pkg_recv = # of echo-reply messages received latency_squared = latency * latency (for a received echo-reply message) latency_total = the total latency for received echo-reply messages
	load	load	shortterm
		load	midterm
		load average figures giving the number of jobs in the run queue (state R) or waiting for disk I/O (state D) averaged over 1 Minute measured CPU and IO utilization for 1 min using /proc/loadavg	http://man7.org/linux/man-pages/man5/proc.5.html https://github.com/collectd/collectd/blob/master/src/load.c
	load	midterm		load average figures giving the number of jobs in the run queue (state R) or waiting for disk I/O (state D) averaged over 5 Minutes measured CPU and IO utilization for 5 mins using /proc/loadavg
	load	longterm		load average figures giving the number of jobs in the run queue (state R) or waiting for disk I/O (state D) averaged over 15 Minutes measured CPU and IO utilization for 15 mins using /proc/loadavg	load	longterm
	OVS events	gauge	link_status
	OVS Stats	collisions			per interface
		rx_bytes
		rx_crc_err
		rx_dropped
		rx_errors
		rx_frame_err
		rx_over_err
		rx_packets
		tx_bytes
		tx_dropped
		tx_errors
		tx_packets
	Hugepages	bytes	used		total/pernode/both
		bytes	free
		vmpage_number	used
		vmpage_number	free
		percent	used
		percent	free
	processes	fork_rate
		ps_state	blocked
		ps_state	paging
		ps_state	running
ps_state		sleeping
ps_state		stopped
ps_state		zombies
Host only	Libvirt	disk_octets	read
		disk_octets	write
		disk_ops	read
		disk_ops	write
		if_dropped	in
		if_dropped	out
		if_errors	in
		if_errors	out
		if_octets	in
		if_octets	out
		if_packets	in
		if_packets	out
		memory	actual
		memory	balloon
		memory	rss
		memory	swap_in
		memory	total
		virt_cpu_total			This is in jiffies!
		virt_vcpu			This is in jiffies!
	RDT	ipc			per core group
		memory_bandwidth	local
		memory_bandwidth	remote
		bytes	llc
	dpdkstats compatible with DPDK 16.04 (based on ixgbe, vhost support will be enabled in DPDK 16.11, patch support being upgraded to DPDK 16.07 in progress)	derive	rx_l3_l4_xsum_error
		errors	flow_director_filter_add_errors
			flow_director_filter_remove_errors
			mac_local_errors
			mac_remote_errors
		if_rx_dropped	rx_fcoe_dropped
			rx_mac_short_packet_dropped
			rx_management_dropped
			rx_priorityX_dropped		where X is 0 to 7
		if_rx_errors	rx_crc_errors
			rx_errors
			rx_fcoe_crc_errors
			rx_fcoe_mbuf_allocation_errors
			rx_fcoe_no_direct_data_placement
			rx_fcoe_no_direct_data_placement_ext_buff
			rx_fragment_errors
			rx_illegal_byte_errors
			rx_jabber_errors
			rx_length_errors
			rx_mbuf_allocation_errors
			rx_oversize_errors
			rx_priorityX_mbuf_allocation_errors		where X is 0 to 7
			rx_q0_errors		if more queues are allocated then you get the errors per Queue
			rx_undersize_errors
		if_rx_octets	rx_error_bytes		bug - will move this to errors
			rx_fcoe_bytes
			rx_fcoe_bytes
			rx_good_bytes
			rx_q0_bytes		per queue
			rx_total_bytes
		if_rx_packets	rx_broadcast_packets
			rx_fcoe_packets
			rx_flow_control_xoff_packets
			rx_flow_control_xon_packets
			rx_good_packets
			rx_management_packets
			rx_multicast_packets
			rx_priorityX_xoff_packets		where X is 0 to 7
			rx_priorityX_xon_packets		where X is 0 to 7
			rx_q0_packets		per queue
			rx_size_1024_to_max_packets
rx_size_128_to_255_packets
rx_size_256_to_511_packets
rx_size_512_to_1023_packets
rx_size_64_packets
rx_size_65_to_127_packets
rx_total_missed_packets
rx_total_packets
rx_xoff_packets
rx_xon_packets
if_tx_errors		tx_errors
if_tx_octets		tx_fcoe_bytes
		tx_good_bytes
		tx_q0_bytes		per queue
if_tx_packets		tx_broadcast_packets
		tx_fcoe_packets
		tx_flow_control_xoff_packets
		tx_flow_control_xon_packets
		tx_good_packets
		tx_management_packets
		tx_multicast_packets
		tx_priorityX_xoff_packets		where X is 0 to 7
		tx_priorityX_xon_packets		where X is 0 to 7
		tx_q0_packets		per queue
	tx_size_1024_to_max_packets
	tx_size_128_to_255_packets
	tx_size_256_to_511_packets
	tx_size_512_to_1023_packets
	tx_size_64_packets
	tx_size_65_to_127_packets
	tx_total_packets
	tx_xoff_packets
	tx_xon_packets
operations	flow_director_added_filters
	flow_director_matched_filters
	flow_director_missed_filters
	flow_director_removed_filters

Space shortcuts

Page tree

Versions Compared

Old Version 16

New Version 17

Key