[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [cobalt-users] RAID Error



Sorry about the length of this reply. I did what you suggested and got
the following:

Partition check:
 hda:hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
ide0: reset: success
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
ide0: reset: success
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
end_request: I/O error, dev 03:00 (hda), sector 0
 unable to read partition table
 hdc: hdc1 hdc2 < hdc5 hdc6 > hdc3 hdc4
autodetecting RAID arrays
(read) hdc1's sb offset: 768000 [events: 00000042]
(read) hdc3's sb offset: 205056 [events: 00000042]
(read) hdc4's sb offset: 28181056 [events: 00000042]
(read) hdc6's sb offset: 131456 [events: 00000042]
autorun ...
considering hdc6 ...
  adding hdc6 ...
created md6
bind<hdc6,1>
running: <hdc6>
now!
hdc6's event counter: 00000042
md6: former device hda6 is unavailable, removing from array!
md6: max total readahead window set to 128k
md6: 1 data-disks, max readahead per data-disk: 128k
raid1: device hdc6 operational as mirror 1
raid1: md6, not all disks are operational -- trying to recover array
raid1: raid set md6 active with 1 out of 2 mirrors
md: updating md6 RAID superblock on device
hdc6 [events: 00000043](write) hdc6's sb offset: 131456
md: recovery thread got woken up ...
md: recovery thread finished ...
.
considering hdc4 ...
  adding hdc4 ...
created md4
bind<hdc4,1>
running: <hdc4>
now!
hdc4's event counter: 00000042
md4: former device hda4 is unavailable, removing from array!
md4: max total readahead window set to 128k
md4: 1 data-disks, max readahead per data-disk: 128k
raid1: device hdc4 operational as mirror 1
raid1: md4, not all disks are operational -- trying to recover array
raid1: raid set md4 active with 1 out of 2 mirrors
md: updating md4 RAID superblock on device
hdc4 [events: 00000043](write) hdc4's sb offset: 28181056
md: recovery thread got woken up ...
md: recovery thread finished ...
.
considering hdc3 ...
  adding hdc3 ...
created md3
bind<hdc3,1>
running: <hdc3>
now!
hdc3's event counter: 00000042
md3: former device hda3 is unavailable, removing from array!
md3: max total readahead window set to 128k
md3: 1 data-disks, max readahead per data-disk: 128k
raid1: device hdc3 operational as mirror 1
raid1: md3, not all disks are operational -- trying to recover array
raid1: raid set md3 active with 1 out of 2 mirrors
md: updating md3 RAID superblock on device
hdc3 [events: 00000043](write) hdc3's sb offset: 205056
md: recovery thread got woken up ...
md: recovery thread finished ...
.
considering hdc1 ...
  adding hdc1 ...
created md1
bind<hdc1,1>
running: <hdc1>
now!
hdc1's event counter: 00000042
md1: former device hda1 is unavailable, removing from array!
md1: max total readahead window set to 128k
md1: 1 data-disks, max readahead per data-disk: 128k
raid1: device hdc1 operational as mirror 1
raid1: md1, not all disks are operational -- trying to recover array
raid1: raid set md1 active with 1 out of 2 mirrors
md: updating md1 RAID superblock on device
hdc1 [events: 00000043](write) hdc1's sb offset: 768000
md: recovery thread got woken up ...
md: recovery thread finished ...
.
... autorun DONE.
VFS: Mounted root (ext2 filesystem) readonly.
Freeing unused kernel memory: 64k freed
Adding Swap: 131448k swap-space (priority -1)
trying to remove hda1 from md1 ...
trying to remove hda3 from md3 ...
trying to remove hda4 from md4 ...
trying to remove hda6 from md6 ...
 hda:hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
ide0: reset: success
hda: timeout waiting for DMA
ide_dmaproc: chipset supported ide_dma_timeout func only: 14
hda: irq timeout: status=0x58 { DriveReady SeekComplete DataRequest }
hda: timeout waiting for DMA
ide_dmaproc: chipset supported ide_dma_timeout func only: 14
hda: irq timeout: status=0x58 { DriveReady SeekComplete DataRequest }
hda: timeout waiting for DMA
ide_dmaproc: chipset supported ide_dma_timeout func only: 14
hda: irq timeout: status=0x58 { DriveReady SeekComplete DataRequest }
hda: timeout waiting for DMA
ide_dmaproc: chipset supported ide_dma_timeout func only: 14
hda: irq timeout: status=0x58 { DriveReady SeekComplete DataRequest }
hda: DMA disabled
ide0: reset: success
 hda1 hda2 < hda5 hda6 > hda3 hda4
 hda: hda1 hda2 < hda5 hda6 > hda3 hda4
trying to hot-add hda1 to md1 ...
bind<hda1,2>
md: updating md1 RAID superblock on device
hda1 [events: 00000044](write) hda1's sb offset: 768000
hdc1 [events: 00000044](write) hdc1's sb offset: 768000
.
md: recovery thread got woken up ...
md1: resyncing spare disk hda1 to replace failed disk
md: syncing RAID array md1
md: minimum _guaranteed_ reconstruction speed: 100 KB/sec.
md: using maximum available idle IO bandwith for reconstruction.
md: using 128k window.
trying to hot-add hda3 to md3 ...
bind<hda3,2>
md: updating md3 RAID superblock on device
hda3 [events: 00000044](write) hda3's sb offset: 205056
hdc3 [events: 00000044](write) hdc3's sb offset: 205056
.
trying to hot-add hda4 to md4 ...
bind<hda4,2>
md: updating md4 RAID superblock on device
hda4 [events: 00000044](write) hda4's sb offset: 28181056
hdc4 [events: 00000044](write) hdc4's sb offset: 28181056
.
trying to hot-add hda6 to md6 ...
bind<hda6,2>
md: updating md6 RAID superblock on device
hda6 [events: 00000044](write) hda6's sb offset: 131456
hdc6 [events: 00000044](write) hdc6's sb offset: 131456


Does this tell us anything useful?



-----Original Message-----
From: cobalt-users-admin@xxxxxxxxxxxxxxx
[mailto:cobalt-users-admin@xxxxxxxxxxxxxxx] On Behalf Of flash22@xxxxxxx
Sent: 17 January 2002 20:17
To: Cobalt List
Subject: RE: [cobalt-users] RAID Error


On Thu, 17 Jan 2002, Richard Sidlin wrote:

> Anyone help on this please?
>
> Can anyone tell me which hard drive is at fault here please. If it is
> the secondary, do I simply take it out and replace it with an
> identical drive? Does it need to be unpartitioned?

There is no way to tell from those messages, you need to look at the
boot logs, type 'dmesg' or look at var/log/dmesg, you should see the
partition tables for both drives registered

eg:
Partition check:
 hda: hda1 hda2 hda3 hda4
VFS: Mounted root (ext2 filesystem) readonly.

etc, (The VFS is different for raid that that) Make sure BOTH drives
show up.

Then do:
/sbin/fdisk -l /dev/hda
/sbin/fdisk -l /dev/hdc

To print out the raw partition tables on the physical drives, they are
supposed to match...

The raid messages only tell me sonething nasty happened and the kernel
is having trouble recovering it (It's trying to recover the first
partition, so that one is probably dead too, eg, all 4 are )

> Warning: RAID device <md3> has a dead partition!

There's more, but start there ;P

gsh



_______________________________________________
cobalt-users mailing list
cobalt-users@xxxxxxxxxxxxxxx
To Subscribe or Unsubscribe, please go to:
http://list.cobalt.com/mailman/listinfo/cobalt-users