Graphite安装笔记

实验环境: CentOS7.5 x64


部署python的程序推荐使用virtualenv这列的虚拟环境来进行,防止污染系统自带的python包。

 

安装一些必要的rpm包

yum install cairo  -- 另外的2个包 cairo-devel pycairo 可以不用安装


安装virtualenv

yum install python-pip

pip install virtualenv


激活虚拟环境

virtualenv /opt/graphite

source /opt/graphite/bin/activate


安装graphite全套组件

export PYTHONPATH="/opt/graphite/lib/:/opt/graphite/webapp/"


pip install --no-binary=:all: https://github.com/graphite-project/whisper/tarball/master


pip install --no-binary=:all: https://github.com/graphite-project/carbon/tarball/master

 

pip install --no-binary=:all: https://github.com/graphite-project/graphite-web/tarball/master

如果这步提示某个依赖包版本不对,我们可以先人工执行下pip安装依赖包。下面提供了一个正常运行的graphite上看到的pip包清单:

attrs==19.1.0

Automat==0.7.0

cachetools==3.1.1

cairocffi==0.9.0

cffi==1.12.3

constantly==15.1.0

Django==1.11.24

django-tagging==0.4.3

gunicorn==19.9.0

hyperlink==19.0.0

idna==2.8

incremental==17.5.0

pycparser==2.19

PyHamcrest==1.9.0

pyparsing==2.4.2

pytz==2019.2

scandir==1.10.0

six==1.12.0

Twisted==19.7.0

txAMQP==0.8.2

urllib3==1.25.3

whisper==1.2.0

zope.interface==4.6.0


初始化

cd /opt/graphite/webapp/graphite

cp local_settings.py.example local_settings.py

 

vim local_settings.py 修改如下3行代码:

CONF_DIR = '/opt/graphite/conf'

STORAGE_DIR = '/opt/graphite/storage'

STATIC_ROOT = '/opt/graphite/webapp/content'

LOG_DIR = '/opt/graphite/storage/log/webapp'

 

初始化下数据,不然后面启动graphite后会提示no such user 这类报错

cd  /opt/graphite/webapp

PYTHONPATH=/opt/graphite/webapp django-admin.py migrate --settings=graphite.settings --run-syncdb   ## 如果上面第一步时候没有yum install cairo的话,这步会报错

 

拷贝graphite的配置文件

cd /opt/graphite/conf/

 

cp carbon.conf.example carbon.conf

cp storage-schemas.conf.example storage-schemas.conf   一般会改动这个文件里面的配置,达到自定义key存储周期

cp relay-rules.conf.example relay-rules.conf

cp dashboard.conf.example dashboard.conf

cp graphTemplates.conf.example graphTemplates.conf

cp blacklist.conf.example  blacklist.conf

cp aggregation-rules.conf.example aggregation-rules.conf

cp storage-aggregation.conf.example storage-aggregation.conf





启动graphite套件

/opt/graphite/bin/carbon-relay.py start

/opt/graphite/bin/carbon-cache.py start

 

ss -lntp | egrep carbon

LISTEN     0      50           *:7002                     *:*                   users:(("carbon-cache.py",pid=48433,fd=11))

LISTEN     0      50           *:2013                     *:*                   users:(("carbon-relay.py",pid=37683,fd=13))

LISTEN     0      50           *:2014                     *:*                   users:(("carbon-relay.py",pid=37683,fd=15))

LISTEN     0      50           *:2003                     *:*                   users:(("carbon-cache.py",pid=48433,fd=13))

LISTEN     0      50           *:2004                     *:*                   users:(("carbon-cache.py",pid=48433,fd=14))

 

/opt/graphite/bin/run-graphite-devel-server.py --port=8085 --libs=/opt/graphite/webapp /opt/graphite  前台启动



启动打点测试进程

python /opt/graphite/examples/example-client.py


安装nginx

yum install nginx

cd /etc/nginx/conf.d

cat graphite.conf  内容如下:

upstream graphite {

    server 127.0.0.1:8085 fail_timeout=0;

}

 

server {

    listen 80 default_server;

 

    server_name _;

 

    root /opt/graphite/webapp;

 

    access_log /var/log/nginx/graphite.access.log;

    error_log  /var/log/nginx/graphite.error.log;

 

    location = /favicon.ico {

        return 204;

    }

 

    # serve static content from the "content" directory

    location /static {

        alias /opt/graphite/webapp/content;

        expires max;

    }

 

    location / {

        try_files $uri @graphite;

    }

 

    location @graphite {

        proxy_pass_header Server;

        proxy_set_header Host $http_host;

        proxy_redirect off;

        proxy_set_header X-Real-IP $remote_addr;

        proxy_set_header X-Scheme $scheme;

        proxy_connect_timeout 10;

        proxy_read_timeout 10;

        proxy_pass http://graphite;

    }

}

 

创建nginx相关文件

touch /var/log/nginx/graphite.access.log

touch /var/log/nginx/graphite.error.log

chmod 640 /var/log/nginx/graphite.*

chown nginx.nginx /var/log/nginx/graphite.*

 

启动nginx

nginx -t

systemctl start nginx



访问 192.168.2.4/ 如下图

image.png



可以看到已经有数据采集到了。

 

 

我们可以到grafana里面配下graphite的数据源,然后展示数据,类似如下:

image.png




启动brubeck(前台)  【一款类似statsd的工具,只是不需要nodejs环境】

yum install jansson jansson-devel

yum install libmicrohttpd libmicrohttpd-devel

 

git clone https://github.com/github/brubeck.git

 

cd brubeck/

./script/bootstrap

 

 

mkdir /usr/local/brubeck/

 

cp brubeck /usr/local/brubeck/



vim /usr/local/brubeck/brubeck.json  增加配置文件如下:

 

{

  "sharding" : false,

  "server_name" : "prod-devops-graphite-01",

  "dumpfile" : "/var/lib/brubeck.dump",

  "capacity" : 15,

  "expire" : 20,

  "http" : ":8000",

 

  "backends" : [

    {

      "type" : "carbon",

      "address" : "localhost",

      "port" : 2003,

      "frequency" : 10

    }

  ],

 

  "samplers" : [

    {

      "type" : "statsd",

      "address" : "0.0.0.0",

      "port" : 8135,

      "workers" : 8,

      "multisock" : true,

      "multimsg" : 8

    }

  ]

}

 

【brubeck监听在8135端口上,用于接收业务方的打点。2003 端口指的是carbon-cache的端口】

 

./brubeck --config brubeck.json   前台启动进程



测试打点到brubeck

echo "db.dba.dble2.front_conn:11111|g" | nc -u -w1 127.0.0.1 8135


修改graphite存储数据的周期

cd /opt/graphite/conf

vim storage-schemas.conf

# Schema definitions for Whisper files. Entries are scanned in order,

# and first match wins. This file is scanned for changes every 60 seconds.

#

#  [name]

#  pattern = regex

#  retentions = timePerPoint:timeToStore, timePerPoint:timeToStore, ...

 

# Carbon's internal metrics. This entry should match what is specified in

# CARBON_METRIC_PREFIX and CARBON_METRIC_INTERVAL settings

[carbon]

pattern = ^carbon\.

retentions = 60:90d

 

[service-sla]

pattern = ^service.sla-pct.*

retentions = 10s:2d,1min:8d,10min:365d

 

[service1-rule]

pattern = ^service1\.timers\.Rule.*

retentions = 10s:5min,1min:1h,10min:7d

 

[service1-ic]

pattern = ^service1\.counters\.IC.*

retentions = 10s:24h,1min:7d

 

[logstash]

pattern = service1\.counters\.logstash.*

retentions = 10s:24h,1min:7d

 

[service1-timers]

pattern = ^service1\.timers\.*

retentions = 10s:5min,1min:1h,10min:1d,1h:365d

 

[service]

pattern = ^service1.*

retentions = 10s:24h,1min:6d,10min:365d


[counter]

pattern = ^.*\.counters\..*

retentions = 10s:24h,1min:7d,10min:30d

 

[timers]

pattern = ^.*\.timers\..*

retentions = 10s:5min,1min:1h,10min:1d,1h:30d

 

[db]

pattern = ^db\.*

retentions = 1min:30d

 

[default_10s_for_1hour]

pattern = .*

retentions = 10s:1h

 

vim storage-aggregation.conf  内容如下:

[min]

pattern = \.min$

xFilesFactor = 0.01

aggregationMethod = min

 

[max]

pattern = \.max$

xFilesFactor = 0.01

aggregationMethod = max

 

[all_sum]

pattern = \.sum$

xFilesFactor = 0.01

aggregationMethod = sum

 

[all_count]

pattern = \.count$

xFilesFactor = 0.01

aggregationMethod = sum

 

[service1_gauges]

pattern = ^service1.gauges

xFilesFactor = 0.01

aggregationMethod = last

 

[service1_counters]

pattern = ^service1\.counters

xFilesFactor = 0.01

aggregationMethod = sum

 

[service1]

pattern = ^service1.*

xFilesFactor = 0.01

 

[default_average]

pattern = .*

xFilesFactor = 0.01

aggregationMethod = average

 


注意,  修改为上面的2个文件后, 我们需要重启 carbon-cache进程才能生效。



建议:

1、graphite-web进程,建议通过supervisor来控制启停。

2、个人认为brubeck比statsd更好使,例如有些机器上可能不便于部署nodejs环境