标签云
asm恢复 bbed bootstrap$ dul In Memory kcbzib_kcrsds_1 kccpb_sanity_check_2 MySQL恢复 ORA-00312 ORA-00607 ORA-00704 ORA-00742 ORA-01110 ORA-01555 ORA-01578 ORA-01595 ORA-08103 ORA-600 2131 ORA-600 2662 ORA-600 3020 ORA-600 4000 ORA-600 4137 ORA-600 4193 ORA-600 4194 ORA-600 16703 ORA-600 kcbzib_kcrsds_1 ORA-600 KCLCHKBLK_4 ORA-15042 ORA-15196 ORACLE 12C oracle dul ORACLE PATCH Oracle Recovery Tools oracle加密恢复 oracle勒索 oracle勒索恢复 oracle异常恢复 Oracle 恢复 ORACLE恢复 ORACLE数据库恢复 oracle 比特币 OSD-04016 YOUR FILES ARE ENCRYPTED 勒索恢复 比特币加密文章分类
- Others (2)
- 中间件 (2)
- WebLogic (2)
- 操作系统 (103)
- 数据库 (1,770)
- DB2 (22)
- MySQL (77)
- Oracle (1,611)
- Data Guard (52)
- EXADATA (8)
- GoldenGate (24)
- ORA-xxxxx (166)
- ORACLE 12C (72)
- ORACLE 18C (6)
- ORACLE 19C (15)
- ORACLE 21C (3)
- Oracle 23ai (8)
- Oracle ASM (69)
- Oracle Bug (8)
- Oracle RAC (54)
- Oracle 安全 (6)
- Oracle 开发 (28)
- Oracle 监听 (29)
- Oracle备份恢复 (592)
- Oracle安装升级 (98)
- Oracle性能优化 (62)
- 专题索引 (5)
- 勒索恢复 (86)
- PostgreSQL (30)
- pdu工具 (6)
- PostgreSQL恢复 (9)
- SQL Server (32)
- SQL Server恢复 (13)
- TimesTen (7)
- 达梦数据库 (3)
- 达梦恢复 (1)
- 生活娱乐 (2)
- 至理名言 (11)
- 虚拟化 (2)
- VMware (2)
- 软件开发 (39)
- Asp.Net (9)
- JavaScript (12)
- PHP (2)
- 小工具 (22)
-
最近发表
- Oracle 19c 202507补丁(RUs+OJVM)-19.28
- 2025年的Oracle 8.0.5数据库恢复
- ORA-600 kokiasg1故障分析(obj$中核心字典序列全部被恶意删除)
- ORA-00756 ORA-10567故障数据0丢失恢复
- 数据库文件变成32k故障恢复
- tcp连接过多导致监听TNS-12532 TNS-12560 TNS-00502错误
- 文件系统格式化MySQL数据库恢复
- .sstop勒索加密数据库恢复
- 解决一次硬件恢复之后数据文件0kb的故障恢复case
- Error in invoking target ‘libasmclntsh19.ohso libasmperl19.ohso client_sharedlib’问题处理
- ORA-01171: datafile N going offline due to error advancing checkpoint
- linux环境oracle数据库被文件系统勒索加密为.babyk扩展名溯源
- ORA-600 ksvworkmsgalloc: bad reaper
- ORA-600 krccfl_chunk故障处理
- Oracle Recovery Tools恢复案例总结—202505
- ORA-600 kddummy_blkchk 数据库循环重启
- 记录一次asm disk加入到vg通过恢复直接open库的案例
- CHECKDB 发现了 N 个分配错误和 M 个一致性错误
- 达梦数据库dm.ctl文件异常恢复
- Oracle Recovery Tools修复ORA-00742、ORA-600 ktbair2: illegal inheritance故障
分类目录归档:Oracle
记录一次rman备份ORA-19502/ORA-27063错误原因分析
rman备份出现ORA-19502/ORA-27063错误
RMAN> 2> 3> 4> 5> 6> 7> 8> 9> 10> 11> 12> 13> 14> 15> 16> 17> 18> 19> 20> allocated channel: t11 channel t11: sid=824 instance=ncdb1 devtype=DISK allocated channel: t12 channel t12: sid=838 instance=ncdb1 devtype=DISK allocated channel: t13 channel t13: sid=809 instance=ncdb1 devtype=DISK allocated channel: t14 channel t14: sid=886 instance=ncdb1 devtype=DISK allocated channel: t15 channel t15: sid=620 instance=ncdb1 devtype=DISK allocated channel: t16 channel t16: sid=599 instance=ncdb1 devtype=DISK allocated channel: t17 channel t17: sid=482 instance=ncdb1 devtype=DISK allocated channel: t18 channel t18: sid=506 instance=ncdb1 devtype=DISK 一共开通8个通道 channel t12: starting full datafile backupset channel t12: specifying datafile(s) in backupset input datafile fno=00008 name=/dev/rnc32g_39 input datafile fno=00016 name=/dev/rnc32g_47 input datafile fno=00024 name=/dev/rnc32g_57 input datafile fno=00032 name=/dev/rnc32g_25 input datafile fno=00040 name=/dev/rnc32g_33 input datafile fno=00048 name=/dev/rnc32g_3 input datafile fno=00056 name=/dev/rnc32g_11 input datafile fno=00064 name=/dev/rnc32g_19 input datafile fno=00072 name=/dev/rnc32g_67 input datafile fno=00080 name=/dev/rnc32g_106 input datafile fno=00088 name=/dev/rnc32g_114 input datafile fno=00096 name=/dev/rnc32g_87 input datafile fno=00104 name=/dev/rnc32g_95 input datafile fno=00112 name=/dev/rnc32g_103 input datafile fno=00120 name=/dev/rnc32g_75 input datafile fno=00003 name=/dev/rnc50_sysaux input datafile fno=00130 name=/dev/rnc32g_119 channel t12: starting piece 1 at 14-MAY-12 --通道12备份数据文件 channel t17: starting full datafile backupset channel t17: specifying datafile(s) in backupset input datafile fno=00002 name=/dev/rnc32g_22 input datafile fno=00013 name=/dev/rnc32g_44 input datafile fno=00021 name=/dev/rnc32g_54 input datafile fno=00029 name=/dev/rnc32g_62 input datafile fno=00037 name=/dev/rnc32g_30 input datafile fno=00045 name=/dev/rnc32g_38 input datafile fno=00053 name=/dev/rnc32g_8 input datafile fno=00061 name=/dev/rnc32g_16 input datafile fno=00069 name=/dev/rnc32g_64 input datafile fno=00077 name=/dev/rncundo_33g_4 input datafile fno=00085 name=/dev/rnc32g_111 input datafile fno=00093 name=/dev/rnc32g_84 input datafile fno=00101 name=/dev/rnc32g_92 input datafile fno=00109 name=/dev/rnc32g_100 input datafile fno=00117 name=/dev/rnc32g_72 input datafile fno=00006 name=/dev/rnc50_4g_1 channel t17: starting piece 1 at 14-MAY-12 --通道17备份数据文件 channel t15: finished piece 1 at 15-MAY-12 piece handle=/rman/db_mpnb04jl_1_1 tag=TAG20120514T204954 comment=NONE channel t15: backup set complete, elapsed time: 06:07:59 channel t11: finished piece 1 at 15-MAY-12 piece handle=/rman/db_mlnb04jk_1_1 tag=TAG20120514T204954 comment=NONE channel t11: backup set complete, elapsed time: 06:17:25 channel t16: finished piece 1 at 15-MAY-12 piece handle=/rman/db_mqnb04jm_1_1 tag=TAG20120514T204954 comment=NONE channel t16: backup set complete, elapsed time: 06:34:49 channel t14: finished piece 1 at 15-MAY-12 piece handle=/rman/db_monb04jl_1_1 tag=TAG20120514T204954 comment=NONE channel t14: backup set complete, elapsed time: 06:40:05 channel t18: finished piece 1 at 15-MAY-12 piece handle=/rman/db_msnb04jn_1_1 tag=TAG20120514T204954 comment=NONE channel t18: backup set complete, elapsed time: 06:43:38 channel t13: finished piece 1 at 15-MAY-12 piece handle=/rman/db_mnnb04jl_1_1 tag=TAG20120514T204954 comment=NONE channel t13: backup set complete, elapsed time: 07:40:56 --这里可以看出rman的备份完成了通道11/13/14/15/16/18,也就是说目前为止通道12/17未完成. RMAN-03009: failure of backup command on t12 channel at 05/15/2012 04:39:58 ORA-19502: write error on file "/rman/db_mmnb04jl_1_1", blockno 30481025 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device Additional information: -1 Additional information: 1048576 ORA-19502: write error on file "/rman/db_mmnb04jl_1_1", blockno 30480897 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device channel t12 disabled, job failed on it will be run on another channel --通道12报错(硬盘空间不足) channel t11: starting full datafile backupset channel t11: specifying datafile(s) in backupset input datafile fno=00008 name=/dev/rnc32g_39 input datafile fno=00016 name=/dev/rnc32g_47 input datafile fno=00024 name=/dev/rnc32g_57 input datafile fno=00032 name=/dev/rnc32g_25 input datafile fno=00040 name=/dev/rnc32g_33 input datafile fno=00048 name=/dev/rnc32g_3 input datafile fno=00056 name=/dev/rnc32g_11 input datafile fno=00064 name=/dev/rnc32g_19 input datafile fno=00072 name=/dev/rnc32g_67 input datafile fno=00080 name=/dev/rnc32g_106 input datafile fno=00088 name=/dev/rnc32g_114 input datafile fno=00096 name=/dev/rnc32g_87 input datafile fno=00104 name=/dev/rnc32g_95 input datafile fno=00112 name=/dev/rnc32g_103 input datafile fno=00120 name=/dev/rnc32g_75 input datafile fno=00003 name=/dev/rnc50_sysaux input datafile fno=00130 name=/dev/rnc32g_119 channel t11: starting piece 1 at 15-MAY-12 --在通道12报错后,通道11已经完成了上次备份,所以启动备份通道12出错的数据文件 RMAN-03009: failure of backup command on t17 channel at 05/15/2012 04:39:58 ORA-19502: write error on file "/rman/db_mrnb04jm_1_1", blockno 30753793 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device Additional information: -1 Additional information: 1048576 ORA-19502: write error on file "/rman/db_mrnb04jm_1_1", blockno 30753665 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device channel t17 disabled, job failed on it will be run on another channel --通道17也因为磁盘空间报错 channel t13: starting full datafile backupset channel t13: specifying datafile(s) in backupset input datafile fno=00002 name=/dev/rnc32g_22 input datafile fno=00013 name=/dev/rnc32g_44 input datafile fno=00021 name=/dev/rnc32g_54 input datafile fno=00029 name=/dev/rnc32g_62 input datafile fno=00037 name=/dev/rnc32g_30 input datafile fno=00045 name=/dev/rnc32g_38 input datafile fno=00053 name=/dev/rnc32g_8 input datafile fno=00061 name=/dev/rnc32g_16 input datafile fno=00069 name=/dev/rnc32g_64 input datafile fno=00077 name=/dev/rncundo_33g_4 input datafile fno=00085 name=/dev/rnc32g_111 input datafile fno=00093 name=/dev/rnc32g_84 input datafile fno=00101 name=/dev/rnc32g_92 input datafile fno=00109 name=/dev/rnc32g_100 input datafile fno=00117 name=/dev/rnc32g_72 input datafile fno=00006 name=/dev/rnc50_4g_1 channel t13: starting piece 1 at 15-MAY-12 --通道13也尝试备份通道17失败的数据文件 RMAN-03009: failure of backup command on t11 channel at 05/15/2012 04:39:59 ORA-19504: failed to create file "/rman/db_mtnb104u_1_1" ORA-27044: unable to write the header block of file IBM AIX RISC System/6000 Error: 28: No space left on device Additional information: 3 Addition --因为当前没有空闲空间,通道11终止, --这个时候rman异常终止,导致后续的通道13终止记录未打印到日志
阅读完rman日志,很好理解因为存放rman备份的磁盘空间不足导致了一系列错误
检查磁盘剩余空间
Filesystem GB blocks Free %Used Iused %Iused Mounted on /dev/hd4 10.00 9.75 3% 6548 1% / /dev/hd2 10.00 4.55 55% 84383 8% /usr /dev/hd9var 5.00 4.04 20% 6290 1% /var /dev/hd3 5.00 3.87 23% 1551 1% /tmp /dev/hd1 10.00 9.91 1% 382 1% /home /proc - - - - - /proc /dev/hd10opt 5.00 4.89 3% 3502 1% /opt /dev/archalv 99.00 82.98 17% 96 1% /archa /dev/fslv01 40.00 19.49 52% 72324 2% /ora10 /dev/fslv00 1800.00 467.25 75% 10 1% /rman
这下让人迷糊了,磁盘空间还剩余467.25G,怎么会报错呢?
分析原因
RMAN-03009: failure of backup command on t12 channel at 05/15/2012 04:39:58 ORA-19502: write error on file "/rman/db_mmnb04jl_1_1", blockno 30481025 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device Additional information: -1 Additional information: 1048576 ORA-19502: write error on file "/rman/db_mmnb04jl_1_1", blockno 30480897 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device channel t12 disabled, job failed on it will be run on another channel RMAN-03009: failure of backup command on t17 channel at 05/15/2012 04:39:58 ORA-19502: write error on file "/rman/db_mrnb04jm_1_1", blockno 30753793 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device Additional information: -1 Additional information: 1048576 ORA-19502: write error on file "/rman/db_mrnb04jm_1_1", blockno 30753665 (blocksize=8192) ORA-27063: number of bytes read/written is incorrect IBM AIX RISC System/6000 Error: 28: No space left on device channel t17 disabled, job failed on it will be run on another channel
这两个通道在写入rman备份到磁盘中的时候,在05/15/2012 04:39:58发现磁盘空间不足,两个通道分别准备写入30480897/30753665号块的时候出错,那么当时这两个通道分别写入的数据块数为30480896/30753664,写入文件大小为(30480896+30753664)*8192/1024/1024/1024=467.1826171875G.这里可以看出磁盘剩余空间467.25G,其实当时已经写入了467.1826171875G,继续写入的时候出错.然后rman为了保证备份的正确性,自动删除了当时已经备份的467.1826171875G错误的备份文件.从而在备份结束后看到磁盘空间还有大量剩余而rman包空间不足的现象.
library cache latch等待事件
产生library cache latch原因
The library cache latches protect the cached SQL statements and objects' definitions held in the library cache within the shared pool. The library cache latch must be acquired in order to add a new statement to the library cache. During a parse, Oracle searches the library cache for a matching statement. If one is not found, then Oracle will parse the SQL statement, obtain the library cache latch and insert the new SQL.
每一个sql被执行之前,先要到library cache中根据hash_value查找parent cursor,这就需要先获得library cache latch;找到parent cursor后,就会去查找对应的child cursor,当发现无法找到时,就会释放library cache latch,获得share pool latch分配空间给硬解析后的产生的执行计划;然后再次获得library cache latch进行把执行计划放入share pool,转入library cache pin+lock(null模式)开始执行sql.library cache latch 的个数有限(与CPU_COUNT参数相关),当数据库中出现大量硬解析的时候,某一个sql无法得到library cache latch就会开始spin,达到spin count后还没得到,就会开始sleep,达到sleep时间后,醒来还再次试图过的library cache latch得不到就在spin再得不到又sleep…依此类推.
综上可知:在sql执行的过程中可以看出在出现High Versions Count和Hard Parse的情况下都有可能出现library cache latch等待.
关于Hard Parse见:shared pool latch 等待事件
关于High Versions Count见:关于High Versions Count总结
1._KGL_LATCH_COUNT控制library cache latches数量
The hidden parameter _KGL_LATCH_COUNT controls the number of library cache latches. The default value should be adequate, but if contention for the library cache latch cannot be resolved, it one may consider increasing this value. The default value for _KGL_LATCH_COUNT is the next prime number after CPU_COUNT. This value cannot exceed 67.
2.Library cache: mutex X
在10g及其以后版本中,很多latch使用mutex代替,我们常见的Library cache: mutex X is similar to library cache wait in earlier version.(_kks_use_mutex_pin=false可以禁止mutex)
ORA-07445 [ACCESS_VIOLATION] [UNABLE_TO_READ] []
alert中发现ORA-07445错误
ORA-07445: exception encountered: core dump [PC:0x7FFF65D0] [ACCESS_VIOLATION] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0] [UNABLE_TO_READ] []错误,导致数据库down掉
Mon May 14 14:34:34 2012 Exception [type: ACCESS_VIOLATION, UNABLE_TO_READ] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0, {empty}] Errors in file d:\app\administrator\diag\rdbms\orcl\orcl\trace\orcl_p001_1280.trc: ORA-07445: exception encountered: core dump [PC:0x7FFF65D0] [ACCESS_VIOLATION] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0] [UNABLE_TO_READ] [] Mon May 14 14:34:35 2012 Trace dumping is performing id=[cdmp_20120514143435] Mon May 14 14:35:10 2012 Exception [type: ACCESS_VIOLATION, UNABLE_TO_READ] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0, {empty}] Errors in file d:\app\administrator\diag\rdbms\orcl\orcl\trace\orcl_smon_1072.trc (incident=164712): ORA-07445: exception encountered: core dump [PC:0x7FFF65D0] [ACCESS_VIOLATION] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0] [UNABLE_TO_READ] [] ORA-12080: Buffer cache miss for IOQ batching Incident details in: d:\app\administrator\diag\rdbms\orcl\orcl\incident\incdir_164712\orcl_smon_1072_i164712.trc
分析trace文件
Oracle Database 11g Enterprise Edition Release 11.1.0.6.0 - Production With the Partitioning, OLAP, Data Mining and Real Application Testing options Windows Server 2003 Version V5.2 Service Pack 2 CPU : 8 - type 586, 4 Physical Cores Process Affinity : 0x00000000 Memory (Avail/Total): Ph:4892M/8189M, Ph+PgF:5638M/9795M, VA:925M/4095M Instance name: orcl Redo thread mounted by this instance: 1 Oracle process number: 12 Windows thread id: 1072, image: ORACLE.EXE (SMON) --以上信息得出操作系统和数据库版本2003 sp2+oracle11g(11.1.0.6 32位) Dump continued from file: d:\app\administrator\diag\rdbms\orcl\orcl\trace\orcl_smon_1072.trc ORA-07445: exception encountered: core dump [PC:0x7FFF65D0] [ACCESS_VIOLATION] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0] [UNABLE_TO_READ] [] ORA-12080: Buffer cache miss for IOQ batching ========= Dump for incident 164712 (ORA 7445 [PC:0x7FFF65D0]) ======== ----- Beginning of Customized Incident Dump(s) ----- Exception [type: ACCESS_VIOLATION, UNABLE_TO_READ] [ADDR:0xFFFFFFFF] [PC:0x7FFF65D0, {empty}] --这里的ORA-07445 [ACCESS_VIOLATION][UNABLE_TO_READ]根据经验结合这里的32位的环境, --怀疑是sga使用的内存太多,ORACLE数据库不能读SGA相关内存导致 --在trace中找出相关参数配置. [0004]: processes=300 [0004]: sessions=335 [0004]: __shared_pool_size=1124073472 [0004]: __large_pool_size=8388608 [0004]: __java_pool_size=16777216 [0004]: __streams_pool_size=251658240 [0004]: streams_pool_size=251658240 [0004]: sga_target=0 [0004]: __sga_target=1887436800 [0004]: memory_target=3145728000 [0004]: memory_max_target=4722786304 [0004]: db_block_size=8192 [0004]: __db_cache_size=478150656 [0004]: __shared_io_pool_size=0 [0004]: compatible=11.1.0.0.0 [0004]: log_buffer=8851456 [0004]: __pga_aggregate_target=780140544 --这里可以看到sga_target分配了内存为1887436800=1.7578125G --pga_aggregate_target分配了780140544=0.7265625G --两者内存之和大于2G,超过了32位ORACLE默认限制
查询MOS发现[1341681.1]
该错误原因
This is a resource issue (memory in particular). 32-bit windows systems, are limited to 2GB of addressable memory so if you are on this platform it's likely you are simply exceeding the capabilities of the 32bit operating system.
解决建议
First recommendation : If you have not already done so, add the /3GB switch to your boot.ini file and reboot the server. The boot.ini will be located in the root directory on the drive where windows is installed. The switch, /3GB, is placed at the end of the line that executes the WinNT loading process. This will allow applications such as oracle access to 3Gb or memory instead of 2Gb. Example: [operating systems] multi(0)disk(0)rdisk(0)partition(2)\WINNT="Windows NT Server Version 4.00" /3GB Second recommendation : You do not want to increase memory target. If anything, this should be decreased. You are limited to under 2GB of addressable memory on 32bit windows (the limit is actually about 1.85GB). This is for both SGA and PGA memory for all instances; you have to reduce the SGA size for the instance. The recommendation is to reduce sga_target, memory_target, and memory_max_target.