Good Day List,
Xserve G4
OS X 10.4.9
Xserve RAID, 2 LUN's of 6X250GB HDD's striped via Apple RAID 1.
Direct connected via copper SFP-SFP cables.
During periods of high IO (I think) RAID volume unmounts.
system.log:
Nov 14 23:53:45 eng-tvstfgfx1 kernel[0]: AppleRAID::completeRAIDRequest -
error 0xe00002ca detected for set "gfx_raid01"
(E1AD0375-67AE-11D8-8EAA-000A958B4568), member
64391080-FF5B-4546-9CAF-D50500000000, set byte offset = 641573285888.
Nov 14 23:53:45 eng-tvstfgfx1 kernel[0]: disk5: I/O error.
Nov 14 23:53:45 eng-tvstfgfx1 kernel[0]: AppleRAID::recover() member
64391080-FF5B-4546-9CAF-D50500000000 from set "gfx_raid01"
(E1AD0375-67AE-11D8-8EAA-000A958B4568) has been marked offline.
Nov 14 23:53:45 eng-tvstfgfx1 kernel[0]: AppleRAID::restartSet - restarting
set "gfx_raid01" (E1AD0375-67AE-11D8-8EAA-000A958B4568).
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: AppleRAID::completeRAIDRequest -
underrun detected, expected = 0x8000, actual = 0x0, set = "gfx_raid01"
(E1AD0375-67AE-11D8-8EAA-000A958B4568)
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: disk5: data underrun.
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: disk5: media is not present.
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: disk5: media is not present.
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: jnl: do_jnl_io: strategy err 0x6
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: jnl: end_transaction: only wrote 0
of 24576 bytes to the journal!
Nov 14 23:54:00 eng-tvstfgfx1 kernel[0]: jnl: close: journal 0x2c89d7c, is
invalid. aborting outstanding transactions
One of the LUN's is no longer available to the OS. Restart, and the volume
mounts as expected.
$ diskutil verifyVolume
indicates volume does not need repair.
Once volume is restored:
$ diskutil checkRAID
Name: gfx_raid01
Unique ID: E1AD0375-67AE-11D8-8EAA-000A958B4568
Type: Stripe
Status: Online
Device Node: disk5
Apple RAID Version: 1
----------------------------------------------------------------------
# Device Node UUID Status
----------------------------------------------------------------------
1 disk3s3 411AA7C0-8186-40D8-82D1-E50000000000 Online
0 disk4s3 90191C39-E1DD-44FC-9371-DC7600000000 Online
----------------------------------------------------------------------
This happens consistently when an rsync cron job is enabled. Therefore it is
currently disabled. Also, I believe a Retrospect job triggered the behavior
and log entries above. I have only witnessed this behavior under these 2
scenarios.
There are no errors with the underlying LUN's in RAID Admin.
Using /bin/dd to write to the volume results in 127.89 MB/s.
The HBA has been replaced and the behavior remains.
Any thoughts?
Thank you,
-dgf
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Macos-x-server mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden