SCN:System Change Number
SCN是Oracle数据库的一个逻辑的内部时间戳,用以标识数据库在某个确切时刻提交的版本。在事务提交或回滚时,它被赋予一个惟一的标识事务的SCN,用来保证数据库的一致性。
SQL> select dbms_flashback.get_system_change_number, SCN_TO_TIMESTAMP(dbms_flashback.get_system_change_number) from dual; GET_SYSTEM_CHANGE_NUMBER SCN_TO_TIMESTAMP(DBMS_FLASHBACK.GET_SYSTEM_CHANGE_NUMBER) ------------------------ ------------------------------------------------------- 1819076 06-JUL-13 11.40.12.000000000 PM SQL>select current_scn from v$database; CURRENT_SCN ----------- 1819065
SCN在数据库中是无处不在的,常见的控制文件、数据文件头部、日志文件等都记录有SCN。
控制文件中
系统检查点SCN(System Checkpoint SCN)
SQL> select checkpoint_change# from v$database; CHECKPOINT_CHANGE# ------------------ 1809219
文件检查点SCN(Datafile Checkpoint SCN)
文件结束SCN(Stop SCN)
SQL> select name,checkpoint_change#,last_change# from v$datafile; NAME CHECKPOINT_CHANGE# LAST_CHANGE# --------------------------------------------- ------------------ ------------ +DATA/orcl/datafile/system.256.817343229 1809219 +DATA/orcl/datafile/sysaux.257.817343231 1809219 +DATA/orcl/datafile/undotbs1.258.817343231 1809219 +DATA/orcl/datafile/users.259.817343231 1809219 +DATA/orcl/datafile/example.265.817343543 1809219
数据文件头部
开始SCN(Start SCN)
SQL> select checkpoint_change# from v$datafile_header; CHECKPOINT_CHANGE# ------------------ 1809219 1809219 1809219 1809219 1809219
日志文件中
FIRST SCN:redo log file中第一条日志的SCN
NEXT SCN:redo log file中最后一条日志的SCN(即下一个redo log file的第一条日志的SCN)
通常,只有当前的重做日志文件组写满后才发生日志切换,但是可以通过设置参数ARCHIVE_LOG_TARGET控制日志切换的时间间隔,在必要时也可以采用手工强制进行日志切换.
一组redo log file写满后,会自动切换到下一组redo log file。上一组redo log的High SCN就是下一组redo log的Low SCN,且对于Current日志文件的High SCN为无穷大(FFFFFFFF)。
SQL> select group#,sequence#,status,first_change#,next_change# from v$log; GROUP# SEQUENCE# STATUS FIRST_CHANGE# NEXT_CHANGE# ---------- ---------- ---------------- ------------- ------------------ 1 34 INACTIVE 1746572 1770739 2 35 INACTIVE 1770739 1808596 3 36 CURRENT 1808596 281474976710655
实例崩溃恢复:
在open数据库时,Oracle通过控制文件进行了以下验证:
检查数据文件头部所记录的Start SCN 和控制文件中所记录的System Checkpoint SCN 是否一致,若不同则需要进行介质恢复
检查数据文件头部所记录的Start SCN 和控制文件中记录的Stop SCN是否也一致,若不同则需要进行实例恢复.
如果两个都一致了,说明所有已被修改的数据块已经写入到了数据文件中,才可以正常open,
当数据库open并正常运行期间,系统SCN、文件SCN和数据文件头部的开始SCN都是一致的,且(大于或)等于ACTIVE/CURRENT日志文件的最小FIRST SCN,但文件结束SCN为NULL(无穷大);
当数据库正常关闭时,Oracle通过完全检查点将buffer cache中的所有缓存写到磁盘上,同时根据关闭数据库的时间点更新控制文件中的系统SCN、文件SCN、结束SCN和数据文件头部中的开始SCN,且SCN都是一致的,且LRBA指针指向on disk RBA,否则需要前滚;
当数据库非正常关闭(崩溃/掉电)后启动实例时,Oracle将检测到控制文件中的系统SCN、文件SCN和数据文件头部的开始SCN都是一致的,但是结束SCN为NULL,则在需要参与实例崩溃恢复的redo log file中根据控制文件中记录的LRBA地址(前滚起点)和on disk RBA(前滚终点)地址找出相应的日志项进行实例崩溃恢复,最终才可将数据库open.
实例恢复的详细过程:
前滚阶段(前滚靠redo,又叫缓冲区恢复cache recovery,即负责恢复已经在内存中但还没有写入数据文件中的内容)
Oracle是按照redo log file的记录来前滚的(不管有没有commit),所以前滚完成后,data file中可能会有没有提交的数据(所以需要后面的回退过程).
另外,由于undo的生成也是要记录redo log的,所以还会按照redo重新生成后面回退时需要的undo信息.
数据库open阶段
前滚完毕后,数据库中所有已被修改的数据块已经写入到了数据文件中才可以正常open
回滚阶段(回滚靠undo,又叫事务恢复transaction recoery,即负责回退实例崩溃前没有提交的事务)
正常关闭数据库时:
系统SCN、文件SCN、结束SCN和数据文件头部中的开始SCN都是相等的,且(大于或)等于ACTIVE/CURRENT日志文件中的最小FIRST SCN
SQL> shutdown immediate Database closed. Database dismounted. ORACLE instance shut down. SQL> startup mount ORACLE instance started. Total System Global Area 459304960 bytes Fixed Size 2214336 bytes Variable Size 289408576 bytes Database Buffers 159383552 bytes Redo Buffers 8298496 bytes Database mounted. SQL> select checkpoint_change# from v$database; CHECKPOINT_CHANGE# ------------------ 1822573 SQL> select name,checkpoint_change#,last_change# from v$datafile; NAME CHECKPOINT_CHANGE# LAST_CHANGE# --------------------------------------------- ------------------ ------------ +DATA/orcl/datafile/system.256.817343229 1822573 1822573 +DATA/orcl/datafile/sysaux.257.817343231 1822573 1822573 +DATA/orcl/datafile/undotbs1.258.817343231 1822573 1822573 +DATA/orcl/datafile/users.259.817343231 1822573 1822573 +DATA/orcl/datafile/example.265.817343543 1822573 1822573 SQL> select name,checkpoint_change# from v$datafile_header; NAME CHECKPOINT_CHANGE# --------------------------------------------- ------------------ +DATA/orcl/datafile/system.256.817343229 1822573 +DATA/orcl/datafile/sysaux.257.817343231 1822573 +DATA/orcl/datafile/undotbs1.258.817343231 1822573 +DATA/orcl/datafile/users.259.817343231 1822573 +DATA/orcl/datafile/example.265.817343543 1822573 SQL> select group#,sequence#,status,first_change#,next_change# from v$log; GROUP# SEQUENCE# STATUS FIRST_CHANGE# NEXT_CHANGE# ---------- ---------- ---------------- ------------- ------------------ 1 37 CURRENT 1822207 281474976710655 3 36 INACTIVE 1808596 1822207 2 35 INACTIVE 1770739 1808596
正常open数据库时:
文件结束SCN为NULL(无穷大)
SQL> alter database open; Database altered. SQL> select name,checkpoint_change#,last_change# from v$datafile; NAME CHECKPOINT_CHANGE# LAST_CHANGE# --------------------------------------------- ------------------ ------------ +DATA/orcl/datafile/system.256.817343229 1822576 +DATA/orcl/datafile/sysaux.257.817343231 1822576 +DATA/orcl/datafile/undotbs1.258.817343231 1822576 +DATA/orcl/datafile/users.259.817343231 1822576 +DATA/orcl/datafile/example.265.817343543 1822576
异常关机(实例崩溃)时:
文件结束SCN仍为NULL(无穷大)
SQL> shutdown abort ORACLE instance shut down. SQL> startup mount ORACLE instance started. Total System Global Area 459304960 bytes Fixed Size 2214336 bytes Variable Size 289408576 bytes Database Buffers 159383552 bytes Redo Buffers 8298496 bytes Database mounted. SQL> select name,checkpoint_change#,last_change# from v$datafile; NAME CHECKPOINT_CHANGE# LAST_CHANGE# --------------------------------------------- ------------------ ------------ +DATA/orcl/datafile/system.256.817343229 1822576 +DATA/orcl/datafile/sysaux.257.817343231 1822576 +DATA/orcl/datafile/undotbs1.258.817343231 1822576 +DATA/orcl/datafile/users.259.817343231 1822576 +DATA/orcl/datafile/example.265.817343543 1822576
启动实例将进行实例恢复:
SQL> alter database open; Database altered. $ tailf /u01/app/oracle/diag/rdbms/orcl/orcl/trace/alert_orcl.log Sun Jul 07 00:10:07 2013 alter database open Beginning crash recovery of 1 threads parallel recovery started with 3 processes Started redo scan Completed redo scan read 192 KB redo, 87 data blocks need recovery Started redo application at Thread 1: logseq 37, block 533 Recovery of Online Redo Log: Thread 1 Group 1 Seq 37 Reading mem 0 Mem# 0: +DATA/orcl/onlinelog/group_1.261.817343457 Mem# 1: +FRA/orcl/onlinelog/group_1.257.817343463 Completed redo application of 0.15MB Completed crash recovery at Thread 1: logseq 37, block 918, scn 1843004 87 data blocks read, 87 data blocks written, 192 redo k-bytes read Sun Jul 07 00:10:13 2013 Thread 1 advanced to log sequence 38 (thread open) Thread 1 opened at log sequence 38 Current log# 2 seq# 38 mem# 0: +DATA/orcl/onlinelog/group_2.262.817343467 Current log# 2 seq# 38 mem# 1: +FRA/orcl/onlinelog/group_2.258.817343473 Successful open of redo thread 1 Sun Jul 07 00:10:14 2013 SMON: enabling cache recovery Successfully onlined Undo Tablespace 2. Verifying file header compatibility for 11g tablespace encryption.. Verifying 11g file header compatibility for tablespace encryption completed SMON: enabling tx recovery Database Characterset is AL32UTF8 No Resource Manager plan active Sun Jul 07 00:10:17 2013 replication_dependency_tracking turned off (no async multimaster replication found) Starting background process QMNC Sun Jul 07 00:10:21 2013 QMNC started with pid=28, OS id=7140 Sun Jul 07 00:10:31 2013 Completed: alter database open