Re: ASM disk corruption
- From: Andrew Kerber <andrew.kerber@xxxxxxxxx>
- To: pnedeljkovich@xxxxxxxxxxxxxxx
- Date: Mon, 22 Nov 2010 13:39:21 -0600
We had something almost identical happen, but in our case it was on our OCR
disk group (we are on 11gR2). The error message was the same. MOS could
not help at all. We still havent figured out how it happened. MOS
recommendation was to use normal or high redundancy instead of external.
Like we could waste all that space.
On Mon, Nov 22, 2010 at 1:32 PM, Peter Nedeljkovich <
pnedeljkovich@xxxxxxxxxxxxxxx> wrote:
> For all intents and purposes, yeah they’re down. As part of our
> troubleshooting, we had tried restarting the nodes. That’s when we noticed
> the corruption.
>
>
>
> *From:* Amaral, Rui [mailto:Rui.Amaral@xxxxxxxxxxxxxxxx]
> *Sent:* Monday, November 22, 2010 2:28 PM
>
> *To:* Peter Nedeljkovich; 'oracle-l@xxxxxxxxxxxxx'
> *Subject:* RE: ASM disk corruption
>
>
>
> Are all asm instance nodes down?
>
>
>
> Reason I ask is this: ASM caches the disk metadata in memory when it starts
> (I don’t remember all of the acronyms but I can look it up). If you have one
> of the asm instances still up then you can drop the disk from the running
> instance and once it has successfully dropped the disk then you might be
> able to bring things back up. Dropping the disk rebalances all objects
> across the remaining disks within the disk group and because the metadata
> would still be in memory on that running instance ASM should be able to move
> things around successfully.
>
>
> ------------------------------
>
> *From:* oracle-l-bounce@xxxxxxxxxxxxx [mailto:
> oracle-l-bounce@xxxxxxxxxxxxx] *On Behalf Of *Peter Nedeljkovich
> *Sent:* Monday, November 22, 2010 1:47 PM
> *To:* Amaral, Rui; 'oracle-l@xxxxxxxxxxxxx'
> *Subject:* RE: ASM disk corruption
>
>
>
> Sorry, should have mentioned this before. 11gR1 RAC on Linux 4.6
>
>
>
> *From:* Amaral, Rui [mailto:Rui.Amaral@xxxxxxxxxxxxxxxx]
> *Sent:* Monday, November 22, 2010 1:12 PM
> *To:* Peter Nedeljkovich; 'oracle-l@xxxxxxxxxxxxx'
> *Subject:* RE: ASM disk corruption
>
>
>
> What version of asm are you using Peter? Is this standalone as well or RAC?
>
>
> ------------------------------
>
> *From:* oracle-l-bounce@xxxxxxxxxxxxx [mailto:
> oracle-l-bounce@xxxxxxxxxxxxx] *On Behalf Of *Peter Nedeljkovich
> *Sent:* Monday, November 22, 2010 12:45 PM
> *To:* 'oracle-l@xxxxxxxxxxxxx'
> *Subject:* ASM disk corruption
>
>
>
> I’m quite new at this so I’m hoping that someone can help us to get back on
> line. FWIW, MOS has already suggested that we rebuild the group.
>
>
>
> We tried to start things up and got the following:
>
> WARNING: cache read a corrupted block gn=1 fn=7008 indblk=0 from disk 9
> NOTE: a corrupted block was dumped to
> /u01/app/asm/diag/asm/+asm/+ASM1/trace/+ASM1_rbal_9624.trc
> ERROR: cache failed to read gn=1 fn=7008 indblk=0 from disk(s): 9
> ORA-15196: invalid ASM block header [kfc.c:9133] [endian_kfbh] [7008]
> [2147483648] [0 != 1]
> System State dumped to trace file
> /u01/app/asm/diag/asm/+asm/+ASM1/trace/+ASM1_rbal_9624.trc
> NOTE: cache initiating offline of disk 9 group 1
> WARNING: initiating offline of disk 9.3915980183 (DISK18) with mask 0x7e
> NOTE: initiating PST update: grp = 1, dsk = 9, mode = 0x15
> kfdp_updateDsk(): 17
> kfdp_updateDskBg(): 17
> ERROR: too many offline disks in PST (grp 1)
>
>
>
> I’ve tried KFED, only as far as comparing it’s output between 2 disks. I’m
> now at a loss as to where to go next.
>
>
>
>
>
>
>
> Peter Nedeljkovich
>
> DBA
>
> Georgian College
>
> 705-728-1968 Ext. 1217
>
>
>
>
> NOTICE: Confidential message which may be privileged. Unauthorized
> use/disclosure prohibited. If received in error, please go to
> www.td.com/legal for instructions.
> AVIS : Message confidentiel dont le contenu peut ?tre privil?gi?.
> Utilisation/divulgation interdites sans permission. Si re?u par erreur,
> pri?re d'aller au www.td.com/francais/avis_juridique pour des
> instructions.
>
>
> --
> This message was scanned by the Georgian College ESVA and is believed to be
> clean.
>
>
> --
> This message was scanned by the Georgian College ESVA and is believed to be
> clean.
>
--
Andrew W. Kerber
'If at first you dont succeed, dont take up skydiving.'
Other related posts: