标签云
asm恢复 bbed bootstrap$ dul In Memory kcbzib_kcrsds_1 kccpb_sanity_check_2 MySQL恢复 ORA-00312 ORA-00607 ORA-00704 ORA-00742 ORA-01110 ORA-01555 ORA-01578 ORA-01595 ORA-08103 ORA-600 2131 ORA-600 2662 ORA-600 3020 ORA-600 4000 ORA-600 4137 ORA-600 4193 ORA-600 4194 ORA-600 16703 ORA-600 kcbzib_kcrsds_1 ORA-600 KCLCHKBLK_4 ORA-15042 ORA-15196 ORACLE 12C oracle dul ORACLE PATCH Oracle Recovery Tools oracle加密恢复 oracle勒索 oracle勒索恢复 oracle异常恢复 Oracle 恢复 ORACLE恢复 ORACLE数据库恢复 oracle 比特币 OSD-04016 YOUR FILES ARE ENCRYPTED 勒索恢复 比特币加密文章分类
- Others (2)
- 中间件 (2)
- WebLogic (2)
- 操作系统 (103)
- 数据库 (1,772)
- DB2 (22)
- MySQL (77)
- Oracle (1,612)
- Data Guard (52)
- EXADATA (8)
- GoldenGate (24)
- ORA-xxxxx (166)
- ORACLE 12C (72)
- ORACLE 18C (6)
- ORACLE 19C (15)
- ORACLE 21C (3)
- Oracle 23ai (8)
- Oracle ASM (69)
- Oracle Bug (8)
- Oracle RAC (54)
- Oracle 安全 (6)
- Oracle 开发 (28)
- Oracle 监听 (29)
- Oracle备份恢复 (593)
- Oracle安装升级 (98)
- Oracle性能优化 (62)
- 专题索引 (5)
- 勒索恢复 (86)
- PostgreSQL (31)
- pdu工具 (6)
- PostgreSQL恢复 (10)
- SQL Server (32)
- SQL Server恢复 (13)
- TimesTen (7)
- 达梦数据库 (3)
- 达梦恢复 (1)
- 生活娱乐 (2)
- 至理名言 (11)
- 虚拟化 (2)
- VMware (2)
- 软件开发 (39)
- Asp.Net (9)
- JavaScript (12)
- PHP (2)
- 小工具 (22)
-
最近发表
- 由于空间满导致PostgreSQL数据库异常处理
- 一次非常幸运的ORA-600 16703(tab$被清空)故障恢复
- Oracle 19c 202507补丁(RUs+OJVM)-19.28
- 2025年的Oracle 8.0.5数据库恢复
- ORA-600 kokiasg1故障分析(obj$中核心字典序列全部被恶意删除)
- ORA-00756 ORA-10567故障数据0丢失恢复
- 数据库文件变成32k故障恢复
- tcp连接过多导致监听TNS-12532 TNS-12560 TNS-00502错误
- 文件系统格式化MySQL数据库恢复
- .sstop勒索加密数据库恢复
- 解决一次硬件恢复之后数据文件0kb的故障恢复case
- Error in invoking target ‘libasmclntsh19.ohso libasmperl19.ohso client_sharedlib’问题处理
- ORA-01171: datafile N going offline due to error advancing checkpoint
- linux环境oracle数据库被文件系统勒索加密为.babyk扩展名溯源
- ORA-600 ksvworkmsgalloc: bad reaper
- ORA-600 krccfl_chunk故障处理
- Oracle Recovery Tools恢复案例总结—202505
- ORA-600 kddummy_blkchk 数据库循环重启
- 记录一次asm disk加入到vg通过恢复直接open库的案例
- CHECKDB 发现了 N 个分配错误和 M 个一致性错误
分类目录归档:PostgreSQL恢复
由于空间满导致PostgreSQL数据库异常处理
朋友和我反馈pg库异常,通过查看日志确认最初故障是由于磁盘空间满,导出出现类似:无法扩展文件 “pg_tblspc/16394/PG_13_202007201/16395/5055912.143″: No space left on device错误
2025-07-28 09:06:02.703 HKT [15352] 错误: 无法扩展文件 "pg_tblspc/16394/PG_13_202007201/16395/5055912.143": No space left on device 2025-07-28 09:06:02.703 HKT [15352] 提示: 检查空闲磁盘控件. 2025-07-28 09:06:02.703 HKT [15352] 语句: insert into 语句 2025-07-28 09:06:02.703 HKT [576] 错误: 无法扩展文件 "pg_tblspc/16394/PG_13_202007201/16395/4477723.73": No space left on device 2025-07-28 09:06:02.703 HKT [576] 提示: 检查空闲磁盘控件. 2025-07-28 09:06:02.703 HKT [576] 语句: update 语句 2025-07-28 09:06:02.706 HKT [11940] 错误: 无法扩展文件 "pg_tblspc/16394/PG_13_202007201/16395/5055912.143": No space left on device 2025-07-28 09:06:02.706 HKT [11940] 提示: 检查空闲磁盘控件. 2025-07-28 09:06:02.706 HKT [11940] 语句: insert into 语句
后续对d盘空间进行了清理,pg报无法打开文件”pg_tblspc/16394/PG_13_202007201/16395/4477706.16″(目标数据块5544906): Permission denied
2025-07-28 10:53:10.435 HKT [11920] 语句: insert into语句 2025-07-28 10:53:10.435 HKT [18300] 错误: 无法打开文件"pg_tblspc/16394/PG_13_202007201/16395/4477706.16"(目标数据块5544906): Permission denied
和报:警告: 无法写入pg_tblspc/16394/PG_13_202007201/16395/4477706的块5545387错误
2025-07-28 11:05:11.008 HKT [5380] 警告: 无法写入pg_tblspc/16394/PG_13_202007201/16395/4477706的块5545387 2025-07-28 11:05:11.008 HKT [5380] 详细信息: 多次失败 --- 写错误可能是永久性的 2025-07-28 11:05:11.015 HKT [12072] 错误: 无法打开文件"pg_tblspc/16394/PG_13_202007201/16395/4477706.17"(目标块5545398):上一段只有952个块 2025-07-28 11:05:11.015 HKT [12072] 上下文: 写入关系pg_tblspc/16394/PG_13_202007201/16395/4477706的块5545398 2025-07-28 11:05:11.015 HKT [12072] 语句: select 语句 2025-07-28 11:05:11.016 HKT [12072] 警告: 无法写入pg_tblspc/16394/PG_13_202007201/16395/4477706的块5545398 2025-07-28 11:05:11.016 HKT [12072] 详细信息: 多次失败 --- 写错误可能是永久性的
查看4477706相关文件情况,初步看部分文件大小不对,结合上述报错,可以断定该表异常
通过查询pg字典确认该表具体名称
xifenfei=# select oid,relname,relnamespace from pg_class where relfilenode=4477706; oid | relname | relnamespace -------+------------------+-------------- 16880 | xifenfei01_log | 2200 (1 行记录)
尝试pg_dump导入数据报错:多次失败 — 写错误可能是永久性的
这次运气比较好,损坏的对象是一个日志表,可以直接清理掉,对该日志表进行清理,然后正常导出数据,如果遇到的损坏表是业务需要的表,可以使用pdu(PostgreSQL表文件损坏恢复—pdu恢复损坏的表文件)进行恢复
pg_control丢失/损坏处理
-l :设置下一个wal日志名
可以通过 $PGDATA/pg_wal 目录中查找数值最新的WAL段的文件名+1
[postgres@localhost pg_wal]$ ls 00000001000000000000008E archive_status -l 00000001000000000000008F
-m:设置下一个和最旧的事务ID
mxid1:下一个多事务ID的安全值可以通过在 $PGDATA/pg_multixact/offsets目录中查找数值最大的文件名+1,然后乘以65536(0×10000)来确定。
mxid2:最旧的事务ID的安全值可以通过$PGDATA/pg_multixact/offsets目录中数字最小的文件名+1,乘以65536(0×10000)来确定。文件名是十六进制
[postgres@localhost pg_wal]$ cd ../pg_multixact [postgres@localhost pg_multixact]$ ls members offsets [postgres@localhost pg_multixact]$ cd offsets/ [postgres@localhost offsets]$ ls 0000 -m 0x10000,0x10000
-O:设置下一个多事务处理偏移量
安全值可以通过在 $PGDATA/pg_multixact/members 目录中查找数值最大的文件名,+1,然后乘以52352(0xCC80)来确定。文件名为十六进制
[postgres@localhost members]$ ls 0000 -O 0xCC80
-x:设置下一个事务ID
安全值可以通过在$PGDATA/pg_xact目录中查找数值最大的文件名,+1,然后乘以1048576(0×100000)来确定。请注意,文件名是十六进制
[postgres@localhost pg_xact]$ ls -ltr total 504 -rw------- 1 postgres postgres 262144 Mar 8 01:53 0000 -rw------- 1 postgres postgres 253952 Mar 9 10:54 0001 -x 0x200000
删除postmaster.pid文件
如果正常关闭库,该文件会被自动删除,异常关闭的才需要处理
[postgres@localhost pg_wal]$ pg_resetwal -l 00000001000000000000008F -m 0x10000,0x10000 -O 0xCC80 -x 0x200000 -f $PGDATA pg_resetwal: error: lock file "postmaster.pid" exists pg_resetwal: hint: Is a server running? If not, delete the lock file and try again. [postgres@localhost pg_wal]$ cd ../ [postgres@localhost data]$ ls -l postmaster.pid -rw------- 1 postgres postgres 75 Mar 9 11:02 postmaster.pid [postgres@localhost data]$ rm -rf postmaster.pid
touch pg_control文件
[postgres@localhost data]$ pg_resetwal -l 00000001000000000000008F -m 0x10000,0x10000 -O 0xCC80 -x 0x200000 -f $PGDATA pg_resetwal: error: could not open file "global/pg_control" for reading: No such file or directory pg_resetwal: hint: If you are sure the data directory path is correct, execute touch global/pg_control and try again. [postgres@localhost data]$ touch global/pg_control
重建pg_control
[postgres@localhost data]$ pg_resetwal -l 00000001000000000000008F -m 0x10000,0x10000 -O 0xCC80 -x 0x200000 -f $PGDATA pg_resetwal: warning: pg_control exists but is broken or wrong version; ignoring it Write-ahead log reset [postgres@localhost data]$ pg_ctl start waiting for server to start.... done server started
pg启动报invalid checkpoint record处理
pg库启动报PANIC: could not locate a valid checkpoint record错误
2025-03-09 10:59:10.365 EDT [73013] LOG: starting PostgreSQL 16.8 on x86_64-pc-linux-gnu, compiled by gcc (GCC) 8.3.1 20191121 (Red Hat 8.3.1-5), 64-bit 2025-03-09 10:59:10.365 EDT [73013] LOG: listening on IPv4 address "0.0.0.0", port 5432 2025-03-09 10:59:10.365 EDT [73013] LOG: listening on IPv6 address "::", port 5432 2025-03-09 10:59:10.367 EDT [73013] LOG: listening on Unix socket "/tmp/.s.PGSQL.5432" 2025-03-09 10:59:10.401 EDT [73018] LOG: database system was interrupted; last known up at 2025-03-09 10:54:43 EDT 2025-03-09 10:59:11.506 EDT [73018] LOG: invalid checkpoint record 2025-03-09 10:59:11.508 EDT [73018] PANIC: could not locate a valid checkpoint record 2025-03-09 10:59:12.004 EDT [73013] LOG: startup process (PID 73018) was terminated by signal 6: Aborted 2025-03-09 10:59:12.004 EDT [73013] LOG: aborting startup due to startup process failure 2025-03-09 10:59:12.006 EDT [73013] LOG: database system is shut down
从报错信息中看,是有无法读取到有效的checkpoint记录导致,初步怀疑是wal异常,检查pg_wal目录,发现wal日志为空
[root@localhost pg_wal]# ls -ltr total 16388 drwx------ 2 postgres postgres 6 Mar 6 08:10 archive_status [root@localhost pg_wal]#
进一步检查系统操作命令rm删除wal日志命令
cd pg_wal rm -rf 0000000*
初步确认是由于wal日志被意外删除导致pg库无法启动,尝试重置wal,由于库不是干净关闭,无法直接重置成功
[postgres@localhost log]$ pg_resetwal $PGDATA The database server was not shut down cleanly. Resetting the write-ahead log might cause data to be lost. If you want to proceed anyway, use -f to force reset.
使用pg_resetwal -f强制重置
[postgres@localhost log]$ pg_resetwal -f $PGDATA Write-ahead log reset
尝试启动pg库成功
[postgres@localhost log]$ pg_ctl start waiting for server to start....2025-03-09 11:02:02.609 EDT [73088] LOG: redirecting log output to logging collector process 2025-03-09 11:02:02.609 EDT [73088] HINT: Future log output will appear in directory "log". done server started
2025-03-09 11:02:02.609 EDT [73088] LOG: starting PostgreSQL 16.8 on x86_64-pc-linux-gnu, compiled by gcc (GCC) 8.3.1 20191121 (Red Hat 8.3.1-5), 64-bit 2025-03-09 11:02:02.609 EDT [73088] LOG: listening on IPv4 address "0.0.0.0", port 5432 2025-03-09 11:02:02.609 EDT [73088] LOG: listening on IPv6 address "::", port 5432 2025-03-09 11:02:02.610 EDT [73088] LOG: listening on Unix socket "/tmp/.s.PGSQL.5432" 2025-03-09 11:02:02.645 EDT [73092] LOG: database system was shut down at 2025-03-09 11:01:53 EDT 2025-03-09 11:02:02.650 EDT [73088] LOG: database system is ready to accept connections