Skip to content

Latest commit

 

History

History
75 lines (57 loc) · 2.48 KB

README.md

File metadata and controls

75 lines (57 loc) · 2.48 KB

Graphite-Anomaly-Detector

Command line application that determines spikes in Graphite metrics for a given host and generates a CSV report

Usage

usage: generate_report [-h] -s SERVERS [SERVERS ...] -o OUTPUT [-l LIMIT]
                          [-p PATTERN] [-t TIMEOUT]

Generates CSV report with spikes found on Graphite hosts

optional arguments:
  -h, --help            show this help message and exit
  -s SERVERS [SERVERS ...], --servers SERVERS [SERVERS ...]
                        Servers queried by program (default: None)
  -o OUTPUT, --output OUTPUT
                        Output CSV file (default: [None])
  -l LIMIT, --limit LIMIT
                        Limit request count per host (default: [500])
  -p PATTERN, --pattern PATTERN
                        Glob pattern for metrics (default: ['*'])
  -t TIMEOUT, --timeout TIMEOUT
                        Timeout bethween requests for host (default: [1])

For example

./generate_report.py -s dumdum:81 -o report.csv -l 2 -p *.cpu.* -t 42

Will gather the first two metrics that match *.cpu.* with 42 seconds timeout bethween the requests. The command will ouptut the results in report.csv

Install

python setup.py install

Result

The program creates a csv file ginven as argument for the --output option having 5 columns:

HOST METRIC TIME RELEVANCE GRAPH_URL
host:81 cpu.0.wait 4 Jul 2013 .42 http://host:81/render/?target=cpu.0.wait

Developing Detectors

All detectors should extend the Detector class and implement the detect_anomalies method.

To change the detector in the program, just change the DETECTOR=SpikeDetector() line with an instance of your detector.

Testing Detectors

For testing you can use test_local.py.

It requires matplotlib for plotting the results.

test_local.py receives a Graphite json response for a given target and plots the following:

  • Original timeseries
  • Timeseries after running the detector
  • Detector results
  • A treshold line (you can remove it while testing)

You just have to replace SpikeDetector with your detector.

For testing large data, you can use the util.logger create_logfile function. It creates a json file that can be viewed using view_log.py script that plots a graph simmilar to the one yielded by test_local.py.