« Back to Blog

Behind the Scenes at Jungle Disk - Maintaining Uptime with a Log Management Platform

By Chris (Rain) Avila
Jan 23, 2017

Analyzing log data through a log management platform is a great way to maintain uptime of critical systems. At Jungle Disk, we recently migrated away from our previous log management platform and have begun using a new system called Logentries.

Logentries has many of the same features of our previous log management platform, however, it differentiated itself in a few major ways. To start, Logentries recognizes key:value pairs in logs. This means that if you can get a JSON output from your apps, you can take that and specifically search for red flags. If we identify a “level”:”fatal” error on any disks being deleted, we can quickly and easily use a compound where clause to identify the name of the object(s) which had an error and the reason. If your app cannot provide you a login JSON, Logentries also supports regex.

The most frequent use we see is when used in conjunction with their “Alerts” system. Logentries has three different types of alerts including Basic Tag, Anomaly and Inactivity.

Alerts generated in Logentries can be sent to e-mail. I don’t find e-mail to be a method that helps you be proactive about alerts. We already use PagerDuty as a method of notifying us of any issues with servers and, fortunately, Logentries hooks right in. Now, any alerts we receive will give us a call letting us know an alert was triggered. If you don’t use PagerDuty, you can receive that alert from a few other methods such as Campfire, HipChat, a custom webhook or Slack. Being able to get alerts in this way from logs really helps our team stay on top of what’s going on in our environment.

Logentries ends up being much more than a log aggregator and filter for our team given its powerful alerting capabilities and integrations. We can see it as a crucial part in helping maintain uptime for our business.