SRDB ID   Synopsis   Date
47291   Sun Fire[TM] 12K/15K: WDC Event on CPU### at TL=0   25 Sep 2002

Status Issued

Description

The following error messages appear in the domain messages file:

NOTICE: [AFT0] WDC Event on CPU419 at TL=0, errID 0x0000002f.517933f1 
AFSR 0x00300440<ME,PRIV,UCC,WDC>.000000a8 AFAR 0x000001c1.fa859580 INVALID 
Fault_PC 0x100596ec Esynd 0x00a8 AMBIGUOUS SB13/P3/E0 J7400 
[AFT0] errID 0x0000002f.517933f1 Data Bit 95 was in error and corrected 
Mar 14 15:52:12 2002 NOTICE: [AFT0] UCC Event on CPU419 in Privileged mode at 
TL =0, errID 0x0000002f.51e2a326                                     
SOLUTION SUMMARY:

Explanation:

From Kevin Siebenthal:

A WDC event, is a correctable Data Cache Parity Error during a Write-back operation. Since we only detect PEs when reading out of the data cache, and a writeback operation is generated when a dirty (modified) cache line is being pre-empted for a new cache line, and therefore, the modified cache line has to be written back to memory, it is safe to assume that the CPU was not directly involved in the Parity Error other than the fact that it occurred in the ECache connected to it.

It is overwhelmingly likely that the error is in the ECache SRAMs (It is slightly possible the failure lies in the CPU itself), but we swap the same FRU (System Board) to correct this regardless of whether it is the CPU or the Ecache SRAM which caused the failure.

Action:

For multiple occurrences of such an error on the same Ecache SRAM and the same bit, replace the implicated SB. The actual bad part, as stated above is most likely the Ecache SRAM, but the SRAM is not a FRU. Therefore, the SB is the FRU which would be replaced.

For a single occurrence of a correctable Data Cache Parity Error, no action should be taken except to log the failure and monitor the domain to be sure it doesn't repeat on the same bit. This is similar to an ECC correctable memory error and should be treated as such.

Keywords: WDC, TL=0, event

INTERNAL SUMMARY:

SUBMITTER: Joshua Freeman APPLIES TO: AFO Vertical Team Docs/HAS, Hardware/Sun Fire /15000 ATTACHMENTS:


Copyright (c) 1997-2003 Sun Microsystems, Inc.