Anuket Project

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

 

The tables and lists of questions have been created by Sridhar Rao <Sridhar.Rao@spirent.com>

 

Terminology Definition

TermWhat we mean by that?
MetricA Measurement of a particular characteristic.
Ex: %ge of CPU used, Amount of Bandwidth used, etc. Complete definition can be found here
EventA record of something that has happened - A simple immutable fact.
Example: Link has gone down. A packet from a flow is dropped, etc. Complete definition can be found here 
AgentSoftware that runs on a node/system that needs to be monitored.
Client NodeA node that is monitored (Node on which agent runs)
Server NodeA node that collects metrics and events from the client node.
Sampling IntervalHow frequently the metrics are sent.
Push ModeFetching of events by subscribing
Poll ModeFetching of events via polling.
Writing of Metrics/eventssending/outputting of metrics or events.
Reading of Metrics/eventsreceiving/reading of measrements
Logging of Metrics/eventsLogging of monitored/received metric or event
Metric Types (data source types)

Guage: Value stored as-is
Derive: Derivative - Change of the value (rate)
Counter: Similar to Derive - but it is NEVER negative (due to wrap-around)
Absolute: 

 

 

Parameter Table

 

Parameters\Tools

CollectdCeilometerMonascastatsdnode-exportersensumunintelegrafnagiosdiamondcentreonicingaOpenNMSNSClient++Elastic BeatsReimannNote:
1. For some parameters the answer could be just YES/NO,
2. Whereas, for some we may have to provide a description/details
3. For some we may have to choose from the list [], whereas for some we may append a value to the list.
4. For some parameters, please provide the number of 'actual metrics' provided under that category. For example, collectd would provide 12 metrics for Processes-category

Use NA - If Not applicable.
Use NK - If it is Not Known
Lowest Sampling Interval1 sec               
CPU metrics                
Disk IO metrics                
Memory metrics                
Process metrics                
Network Interface MetricsInterface plugin: Standard 4 fields of rx/tx (octets, packets, errors, dropped).
Netlink plugin: uses netlink sockets and covers others
               
Libvirt MetricsYES -                
Container resource usage MonitoringYES               
Writing to Databases Support: [Influxdb, MongoDb,  MySql, PostgreSql, Carbon(graphite),  Prometheus, RRDCache,Redis, TSDB]YES for all               
Encryption SupportYES               
Extensibility - multilanguage support [Python, Java, Golang, C/C++, Lua]YES for all               
Interoperability [with other monitoring solutions]Sensu, statsd, telegraf?               
Write to Message Queues (AMQP, Kafka )YES for ALL               
Metrics Pub/sub Mode SupportYES               
Metrics Res/Resp Mode Support                 
Support for Events (polling, Pushing)                
Notification SupportYES               
Logging Support YES               
Hypervisor metrics                
Log-File AnalysisYES               
Other Writing Support:
[CSV, HTTP, RRD, UnixSocket]
                
Transport Protocol                
 Data-Format
[XML, JSON, etc]
JSON, Custom, XML               
Data-modelCustom               
Hardware:
IPMI, Battery, Sensors, 
YES for all               
Metric Types: Guage, Derive, Counter, absoluteYES for all               
Language (written)C               
Last-Updated                
Commercial Versions?                
Resource consumption by the agent                
License                
Webserver monitoring
[Nginix, Apache]
YES for all               
Platforms - OS?                
Configuration Tool support [Puppet, Chef, Ansible, Salt]                
Server-mode support?YES               
                   

 

Inference Questions

The Questions The Answer
Lowest Interval: Which agent supports the lowest sampling interval, and what is the value? 
Interoperability: Which agent is 'most interoperable'?  (Work with maximum of 'servers' (collection node) 
Large-scale deployment: Which agent is ideal for large-scale monitoring (Provide description in a separate page, if needed) 
Low-footprint: Which agent has the lowest footprint (memory and CPU)? 
Metrics: Which agent supports maximum number of metrics? 
Gaps: Are there any metrics that are not supported by any of the agent and that are relavant to NFV? 
Which agent is ideal for realtime analytics?- [Support for maximum scalable datastores, visualization tools and Analytics engines?] 
Is any of the agents been used in large-scale real-world deployments? If so, please provide the details on the performance. 
Which agent has the least/maximum dependency - Libraries, OS/Kernel versions, etc.? 
Which agent provides maximum 'freedom' w.r.t. Licenses (core agent + plugins)? 
Which agent is best for the following datastores: Influxdb, Graphite, ElasticSearch? 
Which agent support dynamic configuration? 
  
  
  
  
  
  
  
  • No labels