DBA童鞋对增量恢复的概念一定很熟悉,与mysql的增量恢复类似,使用“t1时刻的全备”+“t1至t2时刻的wal日志”,即可将postgres恢复至t2时刻。

前期准备:
配置postgres.conf:
wal_level=archive 或 hot_standby 或 更高级别
archive_mode = on
archive_command='DATE=date +%Y%m%d;DIR="/paic/pg6666/pg_archlog/$DATE";(test -d $DIR || mkdir -p $DIR) && cp %p $DIR/%f'

备份脚本backup.sh:

#!/bin/bash 

export LANG=en_US.utf8 
export PGHOME=/paic/postgres/base/9.4.0 
export LD_LIBRARY_PATH=$PGHOME/lib:/lib64:/usr/lib64:/usr/local/lib64:/lib:/usr/lib:/usr/local/lib:$LD_LIBRARY_PATH 
export DATE=`date +"%Y%m%d"` 
export PATH=$PGHOME/bin:$PATH:. 
export PGDATA=/paic/pg6666/data

BASEDIR="/paic/postgres/home/postgres/pg_bak" 

date +%F-%T 

if [ ! -d $BASEDIR/$DATE ]; then 
  mkdir -p $BASEDIR/$DATE 
  if [ $? -eq 0 ]; then 
    psql -h 127.0.0.1 -p 6666 -U postgres postgres -c "select pg_start_backup(now()::text)" 
    if [ $? -eq 0 ]; then 
      cp -r -L $PGDATA $BASEDIR/$DATE 
    else 
      echo -e "select pg_start_backup(now()::text) error" 
      exit 1 
    fi 
    psql -h 127.0.0.1 -p 6666 -U postgres postgres -c "select pg_stop_backup()" 
    date +%F-%T 
    echo -e "backup successed" 
    exit 0 
  else 
    echo -e "mkdir -p $BASEDIR/$DATE error" 
    exit 1 
  fi 
else 
  echo -e "$DATE backuped, don't backup repeated" 
  exit 1 
fi

恢复脚本recovery.sh:

#!/bin/bash 
export LANG=en_US.utf8 
export PGHOME=/paic/postgres/base/9.4.0 
export LD_LIBRARY_PATH=$PGHOME/lib:/lib64:/usr/lib64:/usr/local/lib64:/lib:/usr/lib:/usr/local/lib:$LD_LIBRARY_PATH 
export PATH=$PGHOME/bin:$PATH:. 
export PGDATA=/paic/pg6666/data
export DATE=`date +"%Y%m%d"`

if [ -z "$1" ]; then
  echo "1st argument is empty!"
else
  if [ -z "$2" ]; then
    echo "2nd argument is empty!"
  else
    if [ -f $PGDATA/postmaster.pid ]; then
      echo "shutdown database first!"
    else 
      cd $PGDATA
      rm -rf *
      cp -r /paic/postgres/home/postgres/pg_bak/$DATE/data/* $PGDATA/
      cd $PGDATA/pg_xlog
      rm -rf *
      cd $PGDATA
      cp $PGHOME/share/recovery.conf.sample ./recovery.conf
      echo restore_command = \'cp /paic/pg6666/pg_archlog/$DATE/%f %p\' >> ./recovery.conf
      echo recovery_target_time = \'$1 $2\' >> ./recovery.conf
      pg_ctl start
    fi
  fi
fi

backup.sh和recovery.sh中的目录请自行修改。

模拟故障恢复:
1.准备阶段
-bash-4.1$ psql
psql (9.4.0)
Type "help" for help.
postgres=# \c mydb alex
You are now connected to database "mydb" as user "alex".
确认数据库初始状态
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | test | table | alex
(1 row)
此时mydb数据库只有test表

创建aaa
mydb=# create table aaa(id int);
CREATE TABLE
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | aaa | table | alex
public | test | table | alex
(2 rows)
mydb=# checkpoint; --确保修改写入文件(非必须)
CHECKPOINT
mydb=# select pg_switch_xlog(); --确保修改写入归档(非必须)
pg_switch_xlog
·····················
0/E6000120
(1 row)
上述操作于2017-12-22 13:52:00前完成

接着在2017-12-22 13:53:00时,创建表bbb
mydb=# create table bbb(id int);
CREATE TABLE
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | aaa | table | alex
public | bbb | table | alex
public | test | table | alex
(3 rows)
mydb=# checkpoint;
CHECKPOINT
mydb=# select pg_switch_xlog();
pg_switch_xlog
·····················
0/E7013FC0
(1 row)
mydb=# \q
上述操作于2017-12-22 13:54:00前完成

2.现在尝试回滚数据库至指定时间点
-bash-4.1$ pg_ctl stop -m fast
waiting for server to shut down..... done
server stopped
-bash-4.1$ cd --我的recovery.sh 文件放在home目录,所以需切换目录
-bash-4.1$ . recovery.sh 2017-12-22 13:52:00 --传入时间参数$1:2017-12-22, $2:13:52:00 并执行脚本
pg_ctl: another server might be running; trying to start server anyway
server starting
-bash-4.1$ 2017-12-22 13:54:26 HKT:undefined:[13323]: LOG: redirecting log output to logging collector process
2017-12-22 13:54:26 HKT:undefined:[13323]: HINT: Future log output will appear in directory "/paic/pg6666/data/pg_log".
-bash-4.1$
-bash-4.1$
-bash-4.1$ psql
psql (9.4.0)
Type "help" for help.
postgres=# \c mydb alex
You are now connected to database "mydb" as user "alex".
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | aaa | table | alex
public | test | table | alex
(2 rows)
mydb=# \q
可知目前数据库已回滚至2017-12-22 13:52:00时,刚创建完表aaa状态

继续测试
-bash-4.1$ pg_ctl stop
waiting for server to shut down.... done
server stopped
-bash-4.1$ cd
-bash-4.1$ . recovery.sh 2017-12-22 13:53:10
pg_ctl: another server might be running; trying to start server anyway
server starting
-bash-4.1$ 2017-12-22 13:56:39 HKT:undefined:[13390]: LOG: redirecting log output to logging collector process
2017-12-22 13:56:39 HKT:undefined:[13390]: HINT: Future log output will appear in directory "/paic/pg6666/data/pg_log".
-bash-4.1$
-bash-4.1$ psql
psql (9.4.0)
Type "help" for help.
postgres=# \c mydb alex
You are now connected to database "mydb" as user "alex".
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | aaa | table | alex
public | bbb | table | alex
public | test | table | alex
(3 rows)
可知目前数据库已回滚至2017-12-22 13:53:10时,刚创建完表bbb的状态

3.现在模拟删表误操作的回滚
2017-12-22 14:01:00 删除表test
mydb=# drop table test;
DROP TABLE
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | aaa | table | alex
public | bbb | table | alex
mydb=# checkpoint;
CHECKPOINT
mydb=# select pg_switch_xlog();
pg_switch_xlog
·····················
0/E9005140
(1 row)
mydb=# \q

执行recovery.sh回滚数据库
-bash-4.1$ pg_ctl stop
waiting for server to shut down.... done
server stopped
-bash-4.1$ cd
-bash-4.1$ . recovery.sh 2017-12-22 14:00:00
pg_ctl: another server might be running; trying to start server anyway
server starting
-bash-4.1$ 2017-12-22 14:02:09 HKT:undefined:[13583]: LOG: redirecting log output to logging collector process
2017-12-22 14:02:09 HKT:undefined:[13583]: HINT: Future log output will appear in directory "/paic/pg6666/data/pg_log".
-bash-4.1$
-bash-4.1$ psql
psql (9.4.0)
Type "help" for help.
postgres=# \c mydb alex
You are now connected to database "mydb" as user "alex".
mydb=# \d
List of relations
Schema | Name | Type | Owner
--------+------+-------+-------
public | aaa | table | alex
public | bbb | table | alex
public | test | table | alex
(3 rows)
可知目前数据库已回滚至2017-12-22 13:14:00时,表test未被删除的状态

本实验仅限在测试环境模拟,有助于理解postgres的备份恢复机制,禁止用于生产!