[TriLUG] Re: [Dulug] Opteron >4GB w/RHEL 3

Joshua Baker-LePain jlb17 at duke.edu
Mon Feb 16 11:10:49 EST 2004


On Mon, 16 Feb 2004 at 9:12am, Mark T. Voelker wrote

> I'm working on setting up some new lab hardware, among which is a shiny
> new dual Opteron server running RHEL 3.  The box has two Opteron 244
> CPU's and 6GB of DDR ECC RAM installed in six 1GB sticks (there are 8
> total DIMM slots).  The motherboard is a Tyan S2882 (a.k.a. Thunder K8S
> Pro).  Everything seems to run fine in the limited testing I've done so
> far, but every few seconds I see this appear in the syslog:
> 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel: CPU 0: Silent Northbridge MCE 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel: Northbridge status
> a40000000005001b 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     GART TLB error generic level
> generic 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     extended error gart error 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     link number 0 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     error address valid 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     error uncorrected 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     previous error lost 
> Feb 12 18:04:11 rtp-wbu-sh-m1 kernel:     error address 00000000fafe1a68

Have you tried flashing the board to the most recent BIOS?  I know, e.g., 
that the Fedora preview release had issues with the Tyan boards unless 
they were completely up to date regarding the BIOS.

FWIW, I'm running RHEL3 on an 8GB RAM, dual Opteron box with the Rioworks 
HDAMA motherboard with no such odd messages in the logs.

-- 
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University



More information about the TriLUG mailing list