4 | | List of Common Node Failures |
5 | | ============================ |
6 | | |
7 | | |
8 | | +------------------------------------------------+--------+----------------------------------------+ |
9 | | | Failure Mode | List of|Solution - Notes | |
10 | | | | Nodes | | |
11 | | +================================================+========+========================================+ |
12 | | |Pxe Halt - Locks up during execution of PXE code|[1,5] |- Multiple resets (more than 1) | |
13 | | | | | may be required | |
14 | | | | |- Might require node Change | |
15 | | +------------------------------------------------+--------+----------------------------------------+ |
16 | | |Dead Node ID box top LED (the blinking one) |[1,5] |- Power cycle Fixed it | |
17 | | | | |- Rabbit Issue? | |
18 | | +------------------------------------------------+--------+----------------------------------------+ |
19 | | |First Power on Halt |[3,8] |- Locks during the first attempt | |
20 | | | |[17,4] |- [3,8] Post after reset | |
21 | | | | |- [17,4] no serial console output | |
22 | | | | |- Change node? | |
23 | | +------------------------------------------------+--------+----------------------------------------+ |
24 | | |Disk Failure - Not detected on POST |Fix Me |- Change disk | |
25 | | +------------------------------------------------+--------+----------------------------------------+ |
26 | | |Disk Failure - Kernel throws write errors |Fix Me |- Change disk | |
27 | | +------------------------------------------------+--------+----------------------------------------+ |
28 | | }}} |
| 3 | || ''' Node ''' || ''' Failure Mode ''' || ''' Solution / Notes ''' || |
| 4 | || [1,5] ||Pxe Halt - Locks up during execution of PXE code ||Multiple resets (more than 1) [[BR]] may be required [[BR]] Might require node Change || |
| 5 | || [1,5] ||Dead Node ID box top LED (the blinking one) || Power cycle Fixed it [[BR]] Rabbit Issue? || |
| 6 | || [3,8] ||First Power on Hal || Locks during the first attempt [[BR]] [3,8] Post after reset || |
| 7 | || [17,4] ||First Power on Hal || Locks during the first attempt [[BR]] [17,4] no serial console output || |