Feature Request: Mechanism for monitoring LSF state via scout, nagios, etc #245

mipearson · 2014-08-17T03:04:10Z

We've had quite a few instances where LSF hasn't been able to ship logs to logstash. This is usually due to firewall errors, network interruptions or misconfiguration of LSF/logstash. Most recently this was because the certificate I created had an expiry of 1 month.

While it's possible to monitor the logstash end (did I receive any events from this host in the last 30 minutes?), it'd be much nicer to be able to get LSF's side of the story, too.

I recommend that this work in a similar way to "show slave status" in mysql: a human & machine readable state file which shows:

the datetime of the most recently read log line
the datetime of the most recently successfully shipped log line
the number of log lines in the send queue

A nagios or scout plugin could then easily error if the time between those two points is too large.

JPvRiel · 2015-02-27T15:03:53Z

At our organisation, we're also keen to logstash-forwarder gain more intelligent ways to be managed and monitored.

On Linux, I see it does output to syslog, but of course, unless I'm using something like rsyslog to output to elasticsearch directly or another syslog server, I'm not going to know from a 'central' location how my thousands of logash-forwarders (agents are doing).

Perhaps a heartbeat/health function or API could be added, just like there are plans for the main logstash service at elastic/logstash#2611

tbragin · 2015-07-31T17:42:11Z

Duplicate of #183

tbragin closed this as completed Jul 31, 2015

tbragin added the duplicate label Jul 31, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Mechanism for monitoring LSF state via scout, nagios, etc #245

Feature Request: Mechanism for monitoring LSF state via scout, nagios, etc #245

mipearson commented Aug 17, 2014

JPvRiel commented Feb 27, 2015

tbragin commented Jul 31, 2015

Feature Request: Mechanism for monitoring LSF state via scout, nagios, etc #245

Feature Request: Mechanism for monitoring LSF state via scout, nagios, etc #245

Comments

mipearson commented Aug 17, 2014

JPvRiel commented Feb 27, 2015

tbragin commented Jul 31, 2015