Document fins/I0779-1


FIN #: I0779-1

SYNOPSIS: Ultra 5/10 workstations may encounter uncorrectable data errors on
          IDE disks during boot after using shutdown -y -i5 -g0 command

DATE: Feb/22/02

KEYWORDS: Ultra 5/10 workstations may encounter uncorrectable data errors on
          IDE disks during boot after using shutdown -y -i5 -g0 command


---------------------------------------------------------------------
- Sun Proprietary/Confidential: Internal Use Only -
---------------------------------------------------------------------  
                            FIELD INFORMATION NOTICE
                  (For Authorized Distribution by SunService)



SYNOPSIS: Ultra 5/10 workstations may encounter uncorrectable data errors 
          on IDE disks during boot after using shutdown -y -i5 -g0 command.
          

SunAlert:           Yes

TOP FIN/FCO REPORT: Yes 

PRODUCT_REFERENCE:  Ultra 5/10 with IDE drives

PRODUCT CATEGORY:   Storage / Disks


PRODUCTS AFFECTED:

Systems Affected
----------------
Mkt_ID   Platform   Model   Description   Serial Number
------   --------   -----   -----------   -------------
  -        A21 	     ALL    Ultra 5             -
  -        A22       ALL    Ultra 10            -


X-Options Affected
------------------
Mkt_ID   Platform   Model   Description                           Serial Number
------   --------   -----   -----------                           -------------
X6174A      -         -     X-Opt Internal 20GB Drive for U5/U10 SB1	-
X5236A      -         -     X-Opt Internal 9.1GB Drive for U5/U10	-
X5227A      -         -     X-Opt Internal 4.3GB Drive for U10	        -



PART NUMBERS AFFECTED:

Part Number      Description                               Model
-----------      -----------                               -----
370-4327-03      20GB Disk 3 1/2" 7200 RPM Ultra ATA/100     -
370-3693-01      9.1GB Disk 3 1/2" 7200 RPM Ultra ATA/66     -
370-3176-01      4.3GB Disk 3 1/2" 4500 RPM Ultra ATA-3      -
370-3692-01      4.3GB Disk 3 1/2" 5400 RPM Ultra ATA-3      -
370-3863-02      8.4GB Disk 3 1/2" 5400 RPM Ultra ATA-4      -


REFERENCES:

BugId:   4380416 - init 5 corrupts filesystems on ultra-10 440MHz on 
                   2.5.1 systems.
         4435428 - darwin:EIDE disk with write-cache enabled should be 
                   flushed before power-off.
         4337637 - grover: IDE doesn't complete all writes.
         
PatchId: 106197 - SunOS 5.5.1: Jumbo patch for ide and atapi fixes.
         103640 - SunOS 5.5.1: kernel, nisopaccess, & libthread patch.
         106407 - SunOS 5.6: Jumbo patch for ide and atapi fixes.
         105181 - SunOS 5.6: Kernel update patch. 
         108974 - SunOS 5.8: dada, uata, dad, sd and scsi drivers patch.
         108528 - SunOS 5.8: kernel update patch.
 
ESC:     529449 - init 5 corrupts filesystems on ultra5.
         530062 - darwin:EIDE disk with write-cache enabled should be 
                  flushed before power-off.

	
PROBLEM DESCRIPTION:

On Sun Ultra 5/10 workstations, use of the following shutdown(1M)
command 'shutdown -y -i5 -g0' might result in the IDE disk drive not
getting "sync'ed" properly.  If the disk is not "sync'ed" (see
sync(1M))
properly, "Uncorrectable data Errors" are reported which may lead to
unnecessary disk replacement.

Failing Configuration: Ultra 5/10 systems with internal IDE drives.

Here are two different cases of error messages which may be seen in the
/var/sadm/messages files upon boot up after using the shutdown -y -i5
-g0 command:

    Case 1
    ------
        Jan 9 15:04:16 Ultra-10  unix: WARNING:
        /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):
        Jan 9 15:04:16 Ultra-10        Uncorrectable data Error: Block 289ed0
        Jan 9 15:04:18 Ultra-10  unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0

(dad1):
        Jan 9 15:04:18 Ultra-10        disk not responding to selection
        Jan 9 15:04:18 Ultra-10  unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@0,0

(dad0):
        Jan 9 15:04:18 Ultra-10   ATA transport failed: reason `reset`:
retrying 
command
        Jan 9 15:04:22 Ultra-10  unix: dad1:    disk okay
        Jan 9 15:42:06 Ultra-10  unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@2,0

(dad1):
        Jan 9 15:42:06 Ultra-10  Uncorrectable data Error: Block 8ca48
        Jan 9 15:42:07 Ultra-10  unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@2,0

(dad1):
        Jan 9 15:42:07 Ultra-10  disk not responding to selection
        Jan 9 15:42:07 Ultra-10  unix: dad1:      disk okay
        Jan 9 15:45:26 Ultra-10  unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@2,0

(dad1):
        Jan 9 15:45:26 Ultra-10  Uncorrectable data Error: Block 5201c4
        Jan 9 15:45:27 Ultra-10  disk not responding to selection
   
   Case 2
   ------     
        Mar 29 11:04:16 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0

(dad1):
        Mar 29 11:04:16 Ultra-10        Uncorrectable data Error: Block 289ed0
        Mar 29 11:04:18 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0

(dad1):
        Mar 29 11:04:18 Ultra-10        disk not responding to selection
        Mar 29 11:04:18 Ultra-10 unix: WARNING: /pci@1f,0/pci@1,1/ide@3/dad@0,0

(dad0):
        Mar 29 11:04:18 Ultra-10        ATA transport failed: reason 'reset': 
retrying command
        Mar 29 11:04:22 Ultra-10 unix: dad1:    disk okay
                                           
   NOTE: In some cases, the fsck(1M) command may need to be run manually
         to repair file systems when the system is booted.  But in most 
         cases fsck activation during reboot.

Unlike SCSI disks, IDE drives have their internal write cache enabled.
Whenever the system is powered down, the data in the disk cache is not
flushed causing possible data corruption.

The fix requires writing a new entry point in the IDE driver that will
send a disk flush command before powering down the disk.


IMPLEMENTATION:
 
        ---
       |   |  	MANDATORY (Fully Pro-Active)
  	---    
  	 
  
        ---
       | X | 	CONTROLLED PRO-ACTIVE (per Sun Geo Plan) 
        --- 
  	 
  	   			
        ---
       |   | 	REACTIVE (As Required)
        ---

	
CORRECTIVE ACTION:

The following recommendation is provided as a guideline for authorized
Enterprise Services Field Representatives who may encounter the above 
mentioned problem.

The recommended workaround is to use the "-i0" option instead of the
"-i5" option for shutdown:

    # shutdown -y -i0 -g0                        

Please install the following patches as a final solution:

     ---------------------------------------------------------
    | Solaris |  Solaris patchId       | IDE PatchId          |
    |=========================================================| 
    | 2.5.1   |  103640 (or later)  | 106197 (or later) |  
    |---------+------------------------+----------------------|
    | 2.6     |  105181 (or later)  | 106407 (or later) |
    |---------+------------------------+----------------------|
    | 8       |  108528 (or later)  | 108974 (or later) |
     ---------------------------------------------------------

     NOTE: For Solaris 7, patch numbers are not yet available.


COMMENTS:

There have been cases in the field that the disk drives were
inadvertently replaced.

The boot disk is less prone to data loss as this is not used normally
to store user data.  But any other internal IDE drives could cause
substantial damage to the customer as data perceived by applications to
have been written to the disks are lost.

--------------------------------------------------------------------------

Implementation Footnote:
________________________

i)   In case of MANDATORY FINs, Enterprise Services will attempt to contact   
     all affected customers to recommend implementation of the FIN. 
        

ii)  For CONTROLLED PROACTIVE FINs, Enterprise Services mission critical sup-  

     port teams will recommend implementation of the FIN (to their respective 
     accounts), at the convenience of the customer. 


iii) For REACTIVE FINs, Enterprise Services will implement the FIN as the need 

     arises.

--------------------------------------------------------------------------
 
All released FINs and FCOs can be accessed using your favorite network 
browser as follows:
 
SunWeb Access: 
______________
 
* Access the top level URL of http://cte.ebay/FIN_FCO/

* From there, select the appropriate link to query or browse the FIN and
  FCO Homepage collections.
 
SunSolve Online Access:
_______________________
 
* Access the SunSolve Online URL at http://sunsolve.Corp/

* From there, select the appropriate link to browse the FIN or FCO index.
 
Internet Access:
_______________
 
* Access the top level URL of https://infoserver.Sun.COM
 
--------------------------------------------------------------------------
    
General:
________

Send questions or comments to finfco-manager@cte.Corp

--------------------------------------------------------------------------


Copyright (c) 1997-2003 Sun Microsystems, Inc.