这次做了一套核心系统的ogg同步,一下是遇到的report报错,做个记录
问题一: OGG-01705 Input checkpoint position.. for input trail file..greater than the size of the file
2019-09-10 09:28:08 ERROR OGG-01705 Input checkpoint position 314748185 for input trail file '/oggfs/dirdat/af000068' is greater than the size of the file (31469181
8). Please consult Oracle Knowledge Management Doc ID 1138409.1. for instructions.
报错已经给出了mos文档号,直接照这mos操作就可以了
原因很简单,因为seq号超过了队列文件大小,所以ogg找不到这个seq号
我这里的队列文件大小是314691818<314748185我输入的seq号
[oracle@jcyqdb02:10.176.34.111:/oggfs]$ ls -lrt /oggfs/dirdat/af0*
-rw-r----- 1 oracle oinstall 1023999656 Sep 6 16:21 /oggfs/dirdat/af000056
-rw-r----- 1 oracle oinstall 158146254 Sep 7 01:13 /oggfs/dirdat/af000057
-rw-r----- 1 oracle oinstall 1023999701 Sep 8 05:02 /oggfs/dirdat/af000058
-rw-r----- 1 oracle oinstall 1023999995 Sep 9 02:11 /oggfs/dirdat/af000059
-rw-r----- 1 oracle oinstall 1023999878 Sep 9 03:44 /oggfs/dirdat/af000060
-rw-r----- 1 oracle oinstall 1023999622 Sep 9 05:30 /oggfs/dirdat/af000061
-rw-r----- 1 oracle oinstall 1023999902 Sep 9 06:33 /oggfs/dirdat/af000062
-rw-r----- 1 oracle oinstall 1023999726 Sep 9 07:19 /oggfs/dirdat/af000063
-rw-r----- 1 oracle oinstall 1023999847 Sep 9 08:37 /oggfs/dirdat/af000064
-rw-r----- 1 oracle oinstall 1023999784 Sep 9 10:43 /oggfs/dirdat/af000065
-rw-r----- 1 oracle oinstall 1023999817 Sep 9 15:21 /oggfs/dirdat/af000066
-rw-r----- 1 oracle oinstall 1023999639 Sep 10 05:17 /oggfs/dirdat/af000067
-rw-r----- 1 oracle oinstall 314691818 Sep 10 09:13 /oggfs/dirdat/af000068
-rw-r----- 1 oracle oinstall 135205227 Sep 10 09:27 /oggfs/dirdat/af000069
-rw-r----- 1 oracle oinstall 281407673 Sep 10 10:28 /oggfs/dirdat/af000070
-rw-r----- 1 oracle oinstall 230887576 Sep 10 10:54 /oggfs/dirdat/af000071
-rw-r----- 1 oracle oinstall 516095204 Sep 10 10:59 /oggfs/dirdat/af000072
解决办法:
start * filterduptransactions
或者是seq输入有问题
问题二:OGG-01028: Incompatible record (102)
OGG-01028 Incompatible record (102) in /oggfs/dirdat/ac000061, rba 117653457 (getting header).
记录损坏,也可能是整个队列文件损坏
使用logdump去定位问题
Logdump 217 >open /oggfs/dirdat/ac000061
Current LogTrail is /oggfs/dirdat/ac000061
Logdump 224 >pos 117653457
Reading forward from RBA 117653457
Logdump 219 >count
** Count begins at RBA 117344506
Bad record found at (RBA 117653457, format 5.50 Unknown TokenID ffff)
4701 012a 4800 0036 4504 0041 00c8 05ff 02f2 b85a | G..*H..6E..A.......Z
964e 4b79 0000 0000 0bf7 e9dc 0000 44fb 0152 0000 | .NKy..........D..R..
0001 4550 4d5f 53ff ffff ffff ffff ffff ffff ffff | ..EPM_S.............
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff | .................. --说明记录损坏
LogTrail /oggfs/dirdat/ac000061 has 1036 records
Total Data Bytes 207204
Avg Bytes/Record 200
Insert 1036
After Images 1036
Average of 6 Transactions
Bytes/Trans ..... 42822
Records/Trans ... 172
Files/Trans ..... 1
Logdump 238 >n
Bad record found at (RBA 117653457, format 5.50 Unknown TokenID ffff)
4701 012a 4800 0036 4504 0041 00c8 05ff 02f2 b85a | G..*H..6E..A.......Z
964e 4b79 0000 0000 0bf7 e9dc 0000 44fb 0152 0000 | .NKy..........D..R..
0001 4550 4d5f 53ff ffff ffff ffff ffff ffff ffff | ..EPM_S.............
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff | ..................
Logdump 239 >n
Bad record found at (RBA 117653457, format 5.50 Unknown TokenID ffff)
4701 012a 4800 0036 4504 0041 00c8 05ff 02f2 b85a | G..*H..6E..A.......Z
964e 4b79 0000 0000 0bf7 e9dc 0000 44fb 0152 0000 | .NKy..........D..R..
0001 4550 4d5f 53ff ffff ffff ffff ffff ffff ffff | ..EPM_S.............
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff | ..................
--多次next不会调到下一条记录,说明整个队列文件损坏
alter replicat1 extseqno 62 extrba nextnumber1--跳过这个记录
alter replicat1 seqno 63 --跳过整个队列文件
根据队列文件生成时间,去源端重传队列文件,并在rba事务发生时间之前去启动replicat并加上handcollision参数
(在有主键或者有唯一索引的情况下handcollision也可以保证数据一致性,如果没有数据无法保证一致)
(其实这个问题相对要复杂一点点,队列文件损坏需要确认投递过程中损坏还是抽取损坏,如果是投递损坏可以重新投递,如果是抽取损坏需要重新抽取并投递)
问题三:
OGG-01032:There is a problem in network communication
2019-09-10 11:03:03 ERROR OGG-01032 There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using
ENCRYPT) or an unknown error. Length is 3 - 000000: 46 45 00 |FE. |.
2019-09-10 11:03:03 ERROR OGG-01668 PROCESS ABENDING.
初看以为是网络问题,直接去找网络工程师,结果非常尴尬
实际上目标端mgr都挂了,端口没开
问题四:
GGSCI (cdyxdb2) 64> alter DPEYKA etrollover
2019-09-10 11:21:45 INFO OGG-01520 Rollover performed. For each affected output trail of Version 10 or higher format, after starting the source extract, issue ALTER EXTSEQNO for that trail's reader (either pump EXTRACT or REPLICAT) to move the reader's scan to the new trail file; it will not happen automatically.
EXTRACT altered.
GGSCI (cdyxdb2) 94> !
info DPEYKA
EXTRACT DPEYKA Last Started 2019-09-10 11:22 Status RUNNING
Checkpoint Lag 04:21:44 (updated 00:00:20 ago)
Log Read Checkpoint File ./dirdat/ya010454
2019-09-10 07:00:41.000000 RBA 92947305
GGSCI (cdyxdb2) 95> !
info DPEYKA
EXTRACT DPEYKA Last Started 2019-09-10 11:22 Status RUNNING
Checkpoint Lag 04:16:43 (updated 00:00:00 ago)
Log Read Checkpoint File ./dirdat/ya010455
2019-09-10 07:06:25.000000 RBA 481294966