[TriLUG] System monitoring tool

matusiak dave at matusiak.org
Fri Apr 23 11:46:27 EDT 2004


Big Brother rocks!  http://bb4.com/

free for non-commercial use; otherwise a license applies.  you can do 
all you list below (don't know about print queues) and has a developer 
site where anyone can send useful code.  great community!!   
http://www.deadcat.net/

good luck!
/drm

On Apr 23, 2004, at 11:32 AM, Ron Joffe wrote:

> I'm looking for some suggestions for an open source tool (or set of 
> tools)
> which would allow me to monitor a number of customer systems.
>
> At each customer site I have a number of Linux servers. On each server 
> I
> currently run a number of shell scripts out of cron for the following
> processes:
>
> 1. Check disk space on given local partitions.
> 2. Check multiple types of on disk error logs (these are typically os 
> and
> application logs which I scan for keywords.
> 3. Check multiple application status (i.e. is an oracle process 
> currently
> running)
> 4. Check within oracle database for certain errors (using SQL 
> statements)
> 5. Validate status of print queues.
> 6. Ping other servers in the network
>
> Currently I have these processes running out of cron on a regular basis
> (timing depends on a number of factors but can be between every minute 
> to
> every hour).
>
> If a problem occurs, then I have set up a list of email's to which the 
> system
> mails the errors.
>
> What I am lacking is a process that allows me to use more of a 
> centralized
> approach, and a more hierarchy as to the email's that the alerts 
> generate.
>
> For example if disk space is filling up, I would like person #1 to get 
> a
> single email when it reaches a threshold, and when nobody responds 
> within X
> minutes and correct s the issue, then send email to person #2 etc.
>
> Also I am looking for a central "Dashboard" to give me an overview of 
> system
> status. However the client systems would have to connect to my "central
> server" to update. They would have to push info up, rather then my 
> central
> server querying them. This is due to network / firewall configurations.
>
> I have looked at sourceforge, etc and have found a number of 
> interesting
> projects (Zabbix, OpenNMS, OSSIM, etc etc). Does anyone have any 
> experience,
> suggestions as to which product would fit?
>
> I can spend more time and modify my scripts/code to do this, but just
> wondering what others are using for similar processes?
>
> Thanks
>
> Ron




More information about the TriLUG mailing list