PostgreSQL REPMGR “靠谱”的高可用方式

原创

wx5c241fe5127d0 2022-06-22 10:13:27 ©著作权

文章标签 postgresql bash 配置文件 文章分类 虚拟化云计算

©著作权归作者所有：来自51CTO博客作者wx5c241fe5127d0的原创作品，请联系作者获取转载授权，否则将追究法律责任

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql

REPMGR 是一种方便简单的适合企业使用的高可用方式，为什么选择REPMGR作为单体PG的高可用方式

1 REPMGR 是这三种里面最简单的高可用的方式,这里的意思是结构节点,搭建简单,处理简单

2 在网络有波动的情况下,比较好控制,如果遇到网络上的短暂的问题,REPMGR通过一系列的方式可以避免某些切换. 调整的参数明显

3 资料多,并且有2象限(现在是EDB)这样的公司作为后盾, 并且国内的瀚高也是用这个作为他们商业的高可用方式的二次开发对象.

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_02

目前我们采用的一个主+两个从的方式一个注册一个不注册 (一个从可能是延迟库,也可能为BIG DATA 提供抽取数据使用)

我们以POSTGRESQL 12.2 REPMGR 5.2.1 版本为例

1 从库DOWN

2 两个从库DOWN

3 主库DOWN

4 一主一从DOWN

5 全部DOWN

1 安装依赖包

安装依赖包

yum -y install readline-devel

yum -y install gcc* --skip-broken

yum -y install zlib*

yum -y install openssl openssl-devel

yum -y install pam pam-devel

yum -y install libxml2 libxml2-devel

yum -y install libxslt libxslt-devel

yum -y install gcc, bison, gcc-c++, readline, readline-devel, zlib, zlib-devel

yum -y install libssl*

yum -y install systemd*

yum -y install e2fsprogs-devel uuid-devel libuuid-devel

2 设置用户名密码

useradd postgres

passwd postgres

visudo

3 操作ssh 互信

ssh-keygen -t rsa

ssh-copy-id -i /home/postgres/.ssh/id_rsa.pub postgres@10.50.132.145

ssh-copy-id -i /home/postgres/.ssh/id_rsa.pub postgres@10.50.132.146

ssh-copy-id -i /home/postgres/.ssh/id_rsa.pub postgres@10.50.132.147

4 解压文件编译文件

tar -zxvf postgresql-12.2.tar.gz

./configure --prefix=/usr/local/postgres --bindir=/usr/local/postgres/bin --sysconfdir=/etc --libdir=/usr/local/postgres/libs --includedir=/usr/local/postgres/includes --datadir=/pgdata --datarootdir=/pgdata/root --with-pgport=5432 --with-openssl --with-pam --with-systemd --with-libxml --with-segsize=4 --with-ossp-uuid

gmake & gmake install

5 初始化数据库

sudo chown -R postgres:postgres /pgdata/

/usr/local/postgres/bin/initdb -D /pgdata/data --wal-segsize=64

添加.bash_profile配置文件

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_03

6 调整主库的postgresql配置文件 postgresql.conf 此处略过

新建用户 ,密码略过

repmgr

repl

调整主库的pg_hba.conf 配置文件

PostgreSQL REPMGR “靠谱”的高可用方式_bash_04

7 将.bash-profile 拷贝到其他的两个数据库上

scp -r /home/postgres/.bash_profile postgres@10.50.132.146

scp -r /home/postgres/.bash_profile postgres@10.50.132.147

8 安装repmgr

yum -y install flex

tar -zxvf repmgr-5.2.1.tar.gz

./configure

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_05

make

make install

PostgreSQL REPMGR “靠谱”的高可用方式_bash_06

并且需要在postgresql.conf 添加 repmgr 在 shared_perload_libraries

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_07

9 配置REPMGR 数据库

启动主库并且在主库中运行如下命令

create database repmgr;

create user repmgr with password 'repmgr';

alter user repmgr superuser login;

alter database repmgr owner to repmgr;

create user repl with superuser password 'repl';

\c repmgr

create extension repmgr;

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_08

10 需要配置 .pgpass 免密功能

10.50.132.145:5432:repmgr:repmgr:repmgr

10.50.132.145:postgres:repl:repl

10.50.132.146:5432:repmgr:repmgr:repmgr

10.50.132.146:5432:postgres:repl:repl

10.50.132.147:5432:repmgr:repmgr:repmgr

10.50.132.147:5432:postgres:repl:repl

然后改变文件属性

chmod 600 .pgpass

将文件传送到其他两台机器

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_09

11 配置repmgr 文件模板

配置文件模板

https://github.com/EnterpriseDB/repmgr/blob/master/repmgr.conf.sample

node_id=1

node_name='10.50.132.145'

conninfo='host=10.50.132.145 dbname=repmgr user=repmgr connect_timeout=10'

data_directory='/pgdata/data'

replication_user='repl'

replication_type='physical'

log_level='INFO'

log_facility='STDERR'

log_file='/pgdata/errorlog/repmgr.log'

log_status_interval=300

repmgr_bindir='/usr/local/postgres/bin'

passfile='/home/postgres/.pgpass'

failover='automatic'

promote_command='repmgr standby promote -f /etc/repmgr.conf'

follow_command='repmgr standby follow -f /etc/repmgr.conf -W --upstream-node-id=%n'

repmgrd_pid_file=/pgdata/repmgr.pid

PostgreSQL REPMGR “靠谱”的高可用方式_bash_10

12 注册主库

repmgr -f /etc/repmgr.conf primary register

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_11 开始克隆从库

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_12

克隆后在注册从库

repmgr -f /etc/repmgr.conf standby register

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_13

注:此时我们仅仅注册一台从库.另一台不进行注册,也不进行切换.

配置KEEPALIVE , 备库与主库类似不在写了

! Configuration File for keepalived

vrrp_script chk_pg {

script "/etc/keepalived/pg_check.sh"

interval 2

fall 2

rise 1

}

vrrp_instance VI_1 {

state BACKUP

interface ens192

virtual_router_id 51

mcast_src_ip 10.50.132.145

priority 100

advert_int 1

nopreempt

authentication {

auth_type PASS

auth_pass 1111

}

track_script {

chk_pg

}

virtual_ipaddress {

10.50.132.173/24 label ens192:1

}

/pg_check.sh

#!/bin/bash

source /etc/profile

N=`ps -C postgres --no-header | wc -l`

if [ $N -eq 0 ];then

echo '0'

exit 1

else

echo '1'

exit 0

14 启动repmgrd -f /etc/repmgr.conf

(在注册的REPMGR的两台主机上启动)

安装和启动就完成了

1 从库DOWN 分为注册的和没注册的

1.1 注册的从库将10.50.132.146 关闭, 通过查询并且连接VIP ,系统可以继续工作,不会影响到整体的业务

PostgreSQL REPMGR “靠谱”的高可用方式_bash_14

待 10.50.132.146 恢复后,我们启动 repmgrd -f /etc/repmgr.conf

并查看系统状态.

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_15

1.2 关闭10.50.132.147

应用系统不会受到影响, 并且在短时间PG_WAL 日志可以追溯的情况下,从库启动后会立即开始追数据. 从主库查看从库 147 已经连接

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_16

2 2个主库DOWN

10.50.132.146 10.50.132.147, 状态与上一样.

PostgreSQL REPMGR “靠谱”的高可用方式_bash_17

恢复就是启动数据库服务,并且在10.50.132.146上启动 repmgrd -f /etc/repmgr.conf

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_18

PostgreSQL REPMGR “靠谱”的高可用方式_bash_19

两个从库DOWN ,结论不会影响业务

3 主库DOWN

在第一时间从库开启判断机制,进行主从切换的准备

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_20

在预设1分钟后,还无响应,则自动开始切换

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_21

IP 漂移到从库

PostgreSQL REPMGR “靠谱”的高可用方式_bash_22

业务访问从库是可以进行操作的

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_23

下面进行失败的主库,从新连接会集群并作为从库

1 主库服务器启动

2 确认关闭keepalived

3 确认原主库关闭的状态下,执行 node rejoin命令

PostgreSQL REPMGR “靠谱”的高可用方式_配置文件_24

repmgr node rejoin -f /etc/repmgr.conf -d 'host=10.50.132.146 dbname=repmgr user=repmgr' --force-rewind --config-files=postgresql.conf,postgresql.conf --verbose

PostgreSQL REPMGR “靠谱”的高可用方式_postgresql_25