Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: SOLUTION: Kernel Panic when changing tapes during verify with BRU and Exabyte 221L FC



see end of message for solution.

On Mar 11, 2006, at 7:27 PM, Mark Nassy wrote:


just added some clarifications. to the best of my knowledge all devices have the latest publicly available firmware updates.


I am using the Apple Fibre Channel Integration Guide [1], [2] the help set up a SAN. i did a full backup that spanned 3 tapes and now i am trying to verify the backup. when BRU tries the change a tape in the library during the verification process i get an IOError. if i do not close the BRU monitor screen the xserve will eventually reboot itself and log it as a kernel panic. If i close the monitor window it does not reboot and log a kernel panic.

Tolis Group's paid support has not returned my phone calls, and their email support said it should not happen, or that something is wrong with my fabric.

i moved all connections to one QLogic switch (I have two switches. they are not interconnected).
port 1 on the Xserve is set to 1Gbps and AL.
port 0 on the Xserve is set to automatic.
IO Stream Guard is enabled for initiator ports on the switch.


i connected the tape library robotics directly to port 1
i also tried connecting the tape library to the switch.

i connected the tape library robotics directly to port 1 on the hba on the xserve.
i also tried connecting the tape library robotics to the switch (the tape drive is connected to the switch).




i swapped the optical cables connecting the tape drive. (i only have two optical cables).

i swapped the optical cables connecting the tape drive with the one connecting the robotics. (i only have two optical cables).


i made sure all the cables were in their sockets properly.

none of these helped. BRU is the only backup software installed. i hope to try NetVault on an external hard drive to test soon and see if i get the same or better results. does anyone have any other suggestions how i may resolve.






Software and Hardware: BRU Server 1.20d with 25 clients Mac OS X Server 10.4.5 Xserve G5 Xserve RAID ExaByte 221L FC QLogic SANBox 5200

$ sudo strings /usr/local/bin/bru-server/history
...
Archive id: 440a3ce00647
Archive I/O: 732037600 blocks (1464075200Kb) written
Archive I/O: 0 blocks (0Kb) read
Files written: 180201 files (169137 regular, 11064 other)
Files read: 0 files (0 regular, 0 other)
Files skipped: 0 files
Volumes used: 1
Write errors: 0 soft, 0 hard
Read errors: 0 soft, 0 hard
Checksum errors: 0
brufilter: [I181] wrote 243890752 blocks (487781504 KBytes) on
volume [3], 5:53:35, 22992 Kb/sec
>>
**** brufilter: data stream summary ****
>>
Archive id: 440a3cf80649
Archive I/O: 732037760 blocks (1464075520Kb) written
Calculating catalog MD5 checksum
Saving catalog to tape
Finished backup of xserve1.intranet.deltaenterprise.com,
192.168.10.21 (Mar 5 10:50:10 2006)
>>
Starting verify pass
Unloading tape from drive 1 of changer0
Looking for tape A00001L3 in slot 1 of EXABYTE
Looking for tape in 'ntape0'
Tape is online
Reading header
Found tape A00001L3 (440a3cf80649:0)
Seeking to offset 0
Loading tape 'A00002L3'
Traceback (most recent call last):
File "../server/server.py", line 2213, in ?
File "../server/server.py", line 1733, in __init__
File "../server/server.py", line 1946, in serve
File "../server/server.py", line 1547, in read
File "../server/server.py", line 1684, in line_input
File "../server/server.py", line 287, in __init__
File "./backup.py", line 2230, in backup
File "./verify.py", line 396, in verify_pass
File "./verify.py", line 104, in __init__
File "./verify.py", line 329, in verify
File "./verify.py", line 144, in position_next_volume
File "./restore.py", line 295, in load_volume
File "./restore.py", line 227, in next_tape_loader
File "./restore.py", line 33, in unload_changer
File "./archival.py", line 809, in empty_changer_drive
File "./tape_hw.py", line 123, in eject
File "./tape_hw.py", line 22, in scsi_command
IOError: (0x0B:0x4E00) Overlapped commands attempted
...



$ cat /Library/Logs/panic.log
...
Sat Mar 11 18:00:59 2006
panic(cpu 1 caller 0x009A50A0): DART entry exception: HyperTransport write logical page 0x008C0


Latest stack backtrace for cpu 1:
Backtrace:
0x00095718 0x00095C30 0x0002683C 0x009A50A0 0x00861D70 0x00861DA8 0x00831048 0x002D0040
0x002CEF08 0x000A9894
Kernel loadable modules in backtrace (with dependencies):
com.apple.driver.AppleGPIO(1.1.9d0)@0x85f000
dependency: com.apple.driver.IOPlatformFunction (1.8.0d12)@0x470000
com.apple.driver.MacIOGPIO(1.1.9d0)@0x82f000
com.apple.driver.AppleMacRISC4PE(1.8.5d0)@0x99e000
dependency: com.apple.iokit.IOPCIFamily(1.7)@0x458000
dependency: com.apple.driver.IOPlatformFunction (1.8.0d12)@0x470000
Proceeding back via exception chain:
Exception state (sv=0x00D99C80)
PC=0x00000000; MSR=0x0000D030; DAR=0x00000000; DSISR=0x00000000; LR=0x00000000; R1=0x00000000; XCP=0x00000000 (Unknown)


Kernel version:
Darwin Kernel Version 8.5.0: Sun Jan 22 10:38:46 PST 2006; root:xnu-792.6.61.obj~1/RELEASE_PPC
*********



suggested tests from Tolis's web page [3]

admin$ libctl load 1 0


admin$ tapectl display Available Tape Devices: ntape0: IBM ULTRIUM-TD3 54K1 admin$ libctl display Available Tape Changers: changer0: EXABYTE Exabyte 221L 11.6


admin$ tapectl -v -f ntape0 status Vendor = IBM , Model = ULTRIUM-TD3 Revision Level = 54K1 Medium Type: 0x38 (loaded) Density Code: 0x44 - LTO Gen 3 BlockSize: 0 At block 0 Currently at BOT

admin$  tapectl -f ntape0 rewind
admin$ echo $?
0


admin$ libctl -v -f changer0 status
Vendor = EXABYTE , Model = Exabyte 221L
Revision Level = 11.6
Unit has barcode reader
Robots: 1 (86), Drives: 1 (82), Tape Slots: 21 (1 - 21) Number of Import/Export Slots: 1 (81).
Drive 0: Full : Tape From Slot 1
IE Slot 1: Empty
Slot 1: Empty
Slot 2: Full : Ready : A00002L3
Slot 3: Full : Ready : A00003L3
Slot 4: Full : Ready : A00004L3
Slot 5: Full : Ready : A00005L3
Slot 6: Full : Ready : A00006L3
Slot 7: Full : Ready : A00007L3
Slot 8: Full : Ready : A00008L3
Slot 9: Full : Ready : A00009L3
Slot 10: Full : Ready : A00010L3
Slot 11: Full : Ready : A00011L3
Slot 12: Full : Ready : A00012L3
Slot 13: Full : Ready : A00013L3
Slot 14: Full : Ready : A00014L3
Slot 15: Full : Ready : A00015L3
Slot 16: Full : Ready : A00016L3
Slot 17: Full : Ready : A00017L3
Slot 18: Full : Ready : A00018L3
Slot 19: Full : Ready : A00019L3
Slot 20: Full : Ready : A00020L3
Slot 21: Full : Ready : CLNA01L3




admin$ tapectl -f ntape0 rewoff ; libctl -f changer0 unload


admin$ libctl -f changer0 status
Vendor = EXABYTE , Model = Exabyte 221L
Revision Level = 11.6
Unit has barcode reader
Robots: 1 (86), Drives: 1 (82), Tape Slots: 21 (1 - 21) Number of Import/Export Slots: 1 (81).
Drive 0: Empty
IE Slot 1: Empty
Slot 1: Full : Ready : A00001L3
Slot 2: Full : Ready : A00002L3
Slot 3: Full : Ready : A00003L3
Slot 4: Full : Ready : A00004L3
Slot 5: Full : Ready : A00005L3
Slot 6: Full : Ready : A00006L3
Slot 7: Full : Ready : A00007L3
Slot 8: Full : Ready : A00008L3
Slot 9: Full : Ready : A00009L3
Slot 10: Full : Ready : A00010L3
Slot 11: Full : Ready : A00011L3
Slot 12: Full : Ready : A00012L3
Slot 13: Full : Ready : A00013L3
Slot 14: Full : Ready : A00014L3
Slot 15: Full : Ready : A00015L3
Slot 16: Full : Ready : A00016L3
Slot 17: Full : Ready : A00017L3
Slot 18: Full : Ready : A00018L3
Slot 19: Full : Ready : A00019L3
Slot 20: Full : Ready : A00020L3
Slot 21: Full : Ready : CLNA01L3


[1] http://www.exabyte.com/products/datasheets/appleintegration.pdf
[2] http://www.tolisgroup.com/docs/Apple-FC-guide.pdf
[3] http://www.tolisgroup.com/OSX_Troubleshooting.html

i replaced the tape library and tape drive and updated the bru software. now i can successfully verify my full backup that spans 3 tapes without the server crashing. i was able to speak with Tolis Group paid support and the tech very helpful and the person i spoke with was very knowledgeable and helpful.


mark andrew nassy
email@hidden

_______________________________________________
Do not post admin requests to the list. They will be ignored.
Xsan-Users mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/xsan-users/email@hidden

This email sent to email@hidden


Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.