[TriLUG] diagnostic/stress test software
Steve Litt
slitt at troubleshooters.com
Fri Jun 8 09:05:36 EDT 2007
On Thursday 07 June 2007 13:02, Jason Watts wrote:
> Gents,
>
> we have a linux server that keeps has hung twice on boot this month (its
> restarted 3times a week)
>
> we are looking for a self booting diagnostic tool that will do both
> diagnostic and stress testing.
>
> we have found one "
> http://www.smithmicro.com/default.tpl?group=product_full&sku=CKDWINEE&prodv
>iew=intro" , but i wanted to see if you guys had any other recomendations?
>
> jsn
If I had to fix this box, I'd try booting til it hangs, and see whether it
completes memory counting or not. If it doesn't complete counting memory,
it's not the OS.
I'd wait until I could reproduce it within 10 reboots. I would see whether it
counts memory on boot, when it hangs. If it doesn't complete counting of
memory, I'd do this:
I'd disconnect every peripheral from the motherboard, with only video card,
RAM and video card still connected, and boot it about 50 times to see if it
counts memory (obviously with the hard disks disconnected no OS will run). If
it does not hang I'd start putting back peripherals, booting many times for
each peripheral, until it hangs again. I'd keep a written log of the
reconnections and boot attempts. Obviously, when it hangs again, suspect the
last reconnected periperhal. If, after all periperhals were reconnected, it
still doesn't hang even with 50 boots, then assume it was a bad connection,
use electronic lubricant:
http://www.troubleshooters.com/tpromag/200310/200310.htm
If, with all peripherals disconnected, it still hangs, then it's either
memory, video card, processor or mobo. Swap the video card. Try booting with
one stick at a time and vary the stick. If it's the mobo or CPU, replace em.
If when hanging it always completes memory counting, throw in a different disk
with a tiny OS, and ascertain that it will boot 50 times out of 50 times. If
so, you've isolated it to your hard disk or your OS.
HTH
SteveT
Steve Litt
Author: Universal Troubleshooting Process books and courseware
http://www.troubleshooters.com/
More information about the TriLUG
mailing list