Changes between Version 7 and Version 8 of Internal/NodeFailureModes


Ignore:
Timestamp:
Feb 19, 2009, 9:26:38 PM (15 years ago)
Author:
ssugrim
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • Internal/NodeFailureModes

    v7 v8  
    1 {{{
    2 #!rst
     1= List of Node Failures =
    32
    4 List of Common Node Failures
    5 ============================
    6 
    7 
    8 +------------------------------------------------+--------+----------------------------------------+
    9 | Failure Mode                                   | List of|Solution - Notes                        |
    10 |                                                | Nodes  |                                        |
    11 +================================================+========+========================================+
    12 |Pxe Halt - Locks up during execution of PXE code|[1,5]   |- Multiple resets (more than 1)         |
    13 |                                                |        |  may be required                       |
    14 |                                                |        |- Might require node Change             |
    15 +------------------------------------------------+--------+----------------------------------------+
    16 |Dead Node ID box top LED (the blinking one)     |[1,5]   |- Power cycle Fixed it                  |
    17 |                                                |        |- Rabbit Issue?                         |
    18 +------------------------------------------------+--------+----------------------------------------+
    19 |First Power on Halt                             |[3,8]   |- Locks during the first attempt        |
    20 |                                                |[17,4]  |- [3,8] Post after reset                |
    21 |                                                |        |- [17,4] no serial console output       |
    22 |                                                |        |- Change node?                          |
    23 +------------------------------------------------+--------+----------------------------------------+
    24 |Disk Failure - Not detected on POST             |Fix Me  |- Change disk                           |
    25 +------------------------------------------------+--------+----------------------------------------+
    26 |Disk Failure -  Kernel throws write errors      |Fix Me  |- Change disk                           |
    27 +------------------------------------------------+--------+----------------------------------------+
    28 }}}
     3|| ''' Node ''' || ''' Failure Mode ''' || ''' Solution / Notes ''' ||
     4|| [1,5] ||Pxe Halt - Locks up during execution of PXE code ||Multiple resets (more than 1) [[BR]] may be required [[BR]] Might require node Change ||
     5|| [1,5] ||Dead Node ID box top LED (the blinking one) || Power cycle Fixed it [[BR]] Rabbit Issue? ||
     6|| [3,8] ||First Power on Hal || Locks during the first attempt [[BR]] [3,8] Post after reset ||
     7|| [17,4] ||First Power on Hal || Locks during the first attempt [[BR]] [17,4] no serial console output ||
    298