This sounds like an issue I was having ... which appears to have been
a bad controller.
I noticed in the situation the affinity with LUN would not respond ...
while the remainder of the san was fine. If I rebooted a workstation,
nothing came up, ditto on the MDCs. However, If i looked at Apple
System Profiler, I would NOT see all of my LUNs on the MDC ... it was
seeing 6 of 13.
Looking at my switches (SB5200s) ... they thought everything was Ok.
I think based on error messages in the cv_log where a specific disk
was reported missing (disk6) ... I tried power cyclying that array ...
as that was the last disk that the system of enumerating (6 of 13).
... upon power cycle, everything was fine.
I had this happen to me in Janurary, March then April (2 weeks in a
row)... each time I tried something different ... resulting in me
finally replacing the controller on the array ... and everything seems
fine now.
The other things I did where:
- Reboot
- cable swap
- reset the cache on the controller
- firmware update
All of which didn't really fix the problem ... a reboot was just a
simple stop-gap solution.
-Dave
On Thu, May 29, 2008 at 7:56 AM, Damien Weiss <email@hidden> wrote:
>
> From your previous email, it looks like a fsm has died on the MDCs. Judging
> from your fix of rebooting everything, I'd ask that you look at your RSCN
> settings. Perhaps you have added a new workstation, but have not changed
> the RSCN settings for it?
>
>
> Thanks,
> Damien
>
> On May 29, 2008, at 6:46 AM, Vilius Šumskas wrote:
>
>> More on this.
>>
>> I just grep'd server logs and during the freeze I see May 29 09:13:22
>> xserve1 kernel[0]: disk3: I/O error. in the logs. Disk3 contains one of
>> the
>> LUNs I suppose because cvlabel -l revealed:
>>
>> /dev/rdisk3 [APPLE Xserve RAID 1.51] acfs "RAID3_Left" Sectors:
>> 8790767583. SectorSize: 512. Maximum sectors: 8790767583.
>>
>> Is disk3 and rdisk3 the same?
>>
>> I have also noticed that during XserveRAID power off procedure upper
>> controller on the third array (exactly the same RAID3_Left) switched off
>> veeeryyyy slowly.
>>
>> Could this be the symptoms of malfunctioning RAID controller? Or should I
>> look at the disks in that LUN more closely (like run a parity check or
>> something)?
>>
>> --
>> Best Regards,
>>
>> Vilius Šumskas
>> LNK TV IT manager
>> mob.: +370 614 75713
>> http://www.lnk.lt
>>
>> _______________________________________________
>> Do not post admin requests to the list. They will be ignored.
>> Xsan-Users mailing list (email@hidden)
>> Help/Unsubscribe/Update your Subscription:
>> http://lists.apple.com/mailman/options/xsan-users/email@hidden
>>
>> This email sent to email@hidden
>
> _______________________________________________
> Do not post admin requests to the list. They will be ignored.
> Xsan-Users mailing list (email@hidden)
> Help/Unsubscribe/Update your Subscription:
> http://lists.apple.com/mailman/options/xsan-users/email@hidden
>
> This email sent to email@hidden
>
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Xsan-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/xsan-users/email@hidden
This email sent to email@hidden