SRDB ID | Synopsis | Date | ||
48121 | Sun Fire[TM] 12K/15K: PCI SERR panic after missing ce0 device on the X2222A adapter during boot | 29 Oct 2002 |
Status | Issued |
Description |
- Problem Statement: Missing ce0 device on the X2222A Sun[TM] Dual Fast Ethernet + Dual SCSI PCI Adapter during OBP probe can lead to a PCI SERR panic during OS boot - Symptoms: . The panic occurs while booting the OS. After the initial failure occurs, there are successive boot failures. . Exposure to this problem comes after a setkeyswitch on/HPOST execution of the domain. The problem can appear at install or after a period of stable operation and during normal reboot operations. . If OBP is unable to probe the ce0 interface of the X2222A adapter, this is the symptom of a condition that results in a PCI SERR panic while booting the OS. The signature is: WARNING: pcisch-2: PCI fault log start: PCI SERR PCI error occurred on device #6 dwordmask=0 bytemask=0 pcisch-2: PCI primary error (0):pcisch-2: PCI secondary error (0):pcisch-2: PBM AFAR 0.00000000: WARNING: pcisch2: PCI config space CSR=0x4280<signaled-system-error> pcisch-2: PCI fault log end.
SOLUTION SUMMARY:
- Troubleshooting: A. To determine if the panic is the X2222A card failure, execute the following commands at the OBP prompt: . Execute show-disks and probe-scsi-all to identify if there are missing scsi connections or disk targets. . Execute show-nets to identify if there are missing ce interfaces. ok show-nets a) /pci@1d,700000/pci@1/network@1 b) /pci@1c,700000/network@3,1 c) /pci@1c,700000/pci@1/network@1 d) /pci@1c,700000/pci@1/network@0 NOTE: /pci@1d,700000/pci@1/network@0 is the ce0 of this adapter and missing from the probe. B. Set the OBP variable diag-switch?=true to enable OBP device probing diagnostic output on the console: . The following is an example device probe of PCI B in a good state: Probing PCI B pci Probing /pci@1d,700000 Device 1 pci Probing /pci@1d,700000/pci@1 Device 0 network Probing /pci@1d,700000/pci@1 Device 1 network Probing /pci@1d,700000/pci@1 Device 2 scsi disk tape scsi disk tape Probing /pci@1d,700000/pci@1 Device 3 Nothing there Probing /pci@1d,700000/pci@1 Device 4 Nothing there Probing /pci@1d,700000/pci@1 Device 5 Nothing there Probing /pci@1d,700000/pci@1 Device 6 Nothing there Probing /pci@1d,700000/pci@1 Device 7 Nothing there Probing /pci@1d,700000/pci@1 Device 8 Nothing there Probing /pci@1d,700000/pci@1 Device 9 Nothing there Probing /pci@1d,700000/pci@1 Device a Nothing there Probing /pci@1d,700000/pci@1 Device b Nothing there Probing /pci@1d,700000/pci@1 Device c Nothing there Probing /pci@1d,700000/pci@1 Device d Nothing there Probing /pci@1d,700000/pci@1 Device e Nothing there Probing /pci@1d,700000/pci@1 Device f Nothing there Probing /pci@1d,700000 Device 2 bootbus-controller iosram Probing /pci@1d,700000 Device 3 pci108e,1100 network firewire usb . The following is an example device probe of PCI B in a failing state: Probing PCI B pci Probing /pci@1d,700000 Device 1 pci Probing /pci@1d,700000/pci@1 Device 1 network Probing /pci@1d,700000/pci@1 Device 2 scsi disk tape scsi disk tape Probing /pci@1d,700000/pci@1 Device 3 Nothing there Probing /pci@1d,700000/pci@1 Device 4 Nothing there Probing /pci@1d,700000/pci@1 Device 5 Nothing there Probing /pci@1d,700000/pci@1 Device 6 Nothing there Probing /pci@1d,700000/pci@1 Device 7 Nothing there Probing /pci@1d,700000/pci@1 Device 8 Nothing there Probing /pci@1d,700000/pci@1 Device 9 Nothing there Probing /pci@1d,700000/pci@1 Device a Nothing there Probing /pci@1d,700000/pci@1 Device b Nothing there Probing /pci@1d,700000/pci@1 Device c Nothing there Probing /pci@1d,700000/pci@1 Device d Nothing there Probing /pci@1d,700000/pci@1 Device e Nothing there Probing /pci@1d,700000/pci@1 Device f Nothing there Probing /pci@1d,700000 Device 2 bootbus-controller iosram Probing /pci@1d,700000 Device 3 pci108e,1100 network firewire usb NOTE: /pci@1d,700000/pci@1 Device 0 is missing from the probe. C. To determine that the panicked PCI device instance corresponds to the X2222A card with a missing ce device: . Perform a grep of a previously captured /etc/path_to_inst file for the pcisch instance = 2 (pcisch-2). Use explorer output if available. "/pci@1d,700000" 2 "pcisch" . Grep "pcisch2" from a previous successful start up in the /var/adm/messages file: pcisch2 at root: SAFARI 0x1d 0x700000 pcisch2 is /pci@1d,700000 . Alternatively, you can use the Solaris Device Path Decoder at http://kwyjibo.aus.sun.com. - Resolution: . Replace the identified X2222A card. This has resolved the problem in all previous instances of this bug. . Verify OBP sees all 5 PCI devices by setting diag-switch?=true: pci (bridge), 2x network, 2x scsi ports (where Device 2 represents the two port connections): Probing PCI B pci Probing /pci@1d,700000 Device 1 pci Probing /pci@1d,700000/pci@1 Device 0 network Probing /pci@1d,700000/pci@1 Device 1 network Probing /pci@1d,700000/pci@1 Device 2 scsi disk tape scsi disk tape . If the replacement card does not correct the panic, be certain to redo the troubleshooting steps above to confirm that the replacement card is not experiencing the same failure. . HPOST will be modified to drive the JTAG buss for the PCI adapters during the auto connect sequence to set TRST to low and change clock on. - Summary of part number and patch ID's X2222A - 501-5727-03 Dual FastEthernet + Dual SCSI PCI Adapter SMS 1.2 patch 112488-08 - References and bug IDs4723789 - PCI devices within Cauldron adapter intermittantly not seen4732416 - hpost needs to modify auto-connect to properly connect the Cauldron - Additional background information: None. - Meta-Data/Problem categorization: Product/Platform: SF15K/SF12K Category: hardware - Keywords ce missing panic PCI SERR pcisch adapter dual
INTERNAL SUMMARY:
SUBMITTER: Gino Valencia BUG REPORT ID: 4723789, 4732416 APPLIES TO: Hardware/Sun Fire /15000, Hardware/Sun Fire /12000 ATTACHMENTS: