• Linas Vepstas's avatar
    [PATCH] powerpc/pseries: clear PCI failure counter if no new failures · ac325acd
    Linas Vepstas authored
    The current PCI error recovery system keeps track of the number of PCI card
    resets, and refuses to bring a card back up if this number is too large.
    The goal of doing this was to avoid an infinite loop of resets if a card is
    obviously dead.  However, if the failures are rare, but the machine has a
    high uptime, this mechanism might still be triggered; this is too harsh.
    
    This patch will avoids this problem by decrementing the fail count after an
    hour.  Thus, as long as a pci card BSOD's less than 6 times an hour, it
    will continue to be reset indefinitely.  If it's failure rate is greater
    than that, it will be taken off-line permanently.
    
    This patch is larger than it might otherwise be because it changes
    indentation by removing a pointless while-loop.  The while loop is not
    needed, as the handler is invoked once fo each event (by schedule_work());
    the loop is leftover cruft from an earlier implementation.
    Signed-off-by: default avatarLinas Vepstas <linas@austin.ibm.com>
    Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
    Signed-off-by: default avatarPaul Mackerras <paulus@samba.org>
    ac325acd
eeh_event.h 2.03 KB