oracle 11g rac 又一节点无法启动的生产case怎么办
这篇文章主要介绍了oracle 11g rac 又一节点无法启动的生产case怎么办,具有一定借鉴价值,感兴趣的朋友可以参考下,希望大家阅读完这篇文章之后大有收获,下面让小编带着大家一起了解一下。
一、环境描述
11g rac 双节点,AIX小型机
二、现象
节点2无法启动
crsctl start crs 执行报错。
三、问题分析处理
1.查看数据库日志
ArchivedLogentry399348addedforthread2sequence205493ID0xffffffff8452e669dest1:SatDec0911:13:472017Thread2advancedtologsequence205495(LGWRswitch)Currentlog#3seq#205495mem#0:+DATA/orcl2/onlinelog/group_3.257.890091875SatDec0911:13:512017ArchivedLogentry399349addedforthread2sequence205494ID0xffffffff8452e669dest1:SatDec0911:24:072017NOTE:ASMBterminatingErrorsinfile/u01/app/oracle/diag/rdbms/orcl2/PTS22/trace/PTS22_asmb_8847608.trc:ORA-15064:?ASM??????ORA-03113:?????????Errorsinfile/u01/app/oracle/diag/rdbms/orcl2/PTS22/trace/PTS22_asmb_8847608.trc:ORA-15064:?ASM??????ORA-03113:?????????ASMB(ospid:8847608):terminatingtheinstanceduetoerror15064SatDec0911:24:072017
--判断可能是通信问题orcldb2:/u01/app/oracle/diag/rdbms/orcl2/orcl22/trace$oerrora1506415064,00000,"communicationfailurewithASMinstance"//*Cause:TherewasafailuretocommunicatewiththeASMinstance,most//likelybecausetheconnectionwentdown.//*Action:Checktheaccompanyingerrormessagesformoreinformationonthe//reasonforthefailure.Notethatdatabaseinstanceswillalways//returnthiserrorwhentheASMinstanceisterminatedabnormally.
2.查看集群日志
2017-12-0911:23:51.026[cssd(7667900)]CRS-1612:Networkcommunicationwithnodeorcldb1(1)missingfor50%oftimeoutinterval.Removalofthisnodefromclusterin14.523seconds2017-12-0911:23:59.039[cssd(7667900)]CRS-1611:Networkcommunicationwithnodeorcldb1(1)missingfor75%oftimeoutinterval.Removalofthisnodefromclusterin6.509seconds2017-12-0911:24:03.052[cssd(7667900)]CRS-1610:Networkcommunicationwithnodeorcldb1(1)missingfor90%oftimeoutinterval.Removalofthisnodefromclusterin2.497seconds2017-12-0911:24:05.552[cssd(7667900)]CRS-1609:Thisnodeisunabletocommunicatewithothernodesintheclusterandisgoingdowntopreserveclusterintegrity;detailsat(:CSSNM00008:)in/u01/app/11.2.0/grid/log/orcldb2/cssd/ocssd.log.2017-12-0911:24:05.552[cssd(7667900)]CRS-1656:TheCSSdaemonisterminatingduetoafatalerror;Detailsat(:CSSSC00012:)in/u01/app/11.2.0/grid/log/orcldb2/cssd/ocssd.log2017-12-0911:24:05.614[cssd(7667900)]CRS-1652:StartingcleanupofCRSDresources.
3.查看系统日志
IDENTIFIERTIMESTAMPTCRESOURCE_NAMEDESCRIPTIONFE2DEE001209123617PSSYSXAIXIFDUPLICATEIPADDRESSDETECTEDINTHENETFE2DEE001209122517PSSYSXAIXIFDUPLICATEIPADDRESSDETECTEDINTHENETFE2DEE001209114417PSSYSXAIXIFDUPLICATEIPADDRESSDETECTEDINTHENETFE2DEE001209114317PSSYSXAIXIFDUPLICATEIPADDRESSDETECTEDINTHENETA924A5FC1209112417PSSYSPROCSOFTWAREPROGRAMABNORMALLYTERMINATED
综上所以的日志都指向数据库通信可能有问题。
检查心跳网络,在节点一上ping 节点二是通的,ping自己当然也是通的。
这里感觉好奇怪,貌似心跳也没问题啊。各种问好??????整理下思路,在节点二上ping 节点一,好嘛,真心ping不通。找到这个问题之后和客户沟通,发现网络刚刚做了调整导致的。经过网络工程师的处理。心跳网络恢复。轮到我上了,把集群给拉起来。
--root用户执行crsctlstopcrs--报错crsctlstopcrs-f强制关闭crsctlstartcrscrsctlstatres-t
感谢你能够认真阅读完这篇文章,希望小编分享的“oracle 11g rac 又一节点无法启动的生产case怎么办”这篇文章对大家有帮助,同时也希望大家多多支持亿速云,关注亿速云行业资讯频道,更多相关知识等着你来学习!
声明:本站所有文章资源内容,如无特殊说明或标注,均为采集网络资源。如若本站内容侵犯了原著者的合法权益,可联系本站删除。