When Monolog meet ELK

Outdated article!

If it's a technical article, take care, maybe some informations are not exact anymore.
Otherwise, please keep in mind that this article was written quite a long time ago.

For this first article since 2 years (I know, it was long, did you miss me ? :D) I'm going to talk about Monolog, Gelf and ELK. It's just a quick introduction but you will find a lot of resources in this article.

Monolog

I'm sure you already know Monolog, the (almost) perfect logging library for PHP. :)

I strongly suggest you to read the core concepts of Monolog if you're not familiar with channels, handlers and processors. In 2 words, channel is the name of the logger, handlers are its outputs and processors are here to add extra information in your logs.

As we will see at the end of this article, you can make really interesting filters with your channels and the extra data added by your processors.

Gelf

Gelf means Graylog Extended Log Format. This new format has been created by Graylog to avoid all syslog inconvenients like the length limit and the lack of data types and compression.

Gelf messages can be sent by UDP (fortunately !) and of course, the awesome Monolog provides a GelfHandler.

Here is an example of a gelf message (can be found in the specs) :

{
  "version": "1.1",
  "host": "example.org",
  "short_message": "A short message that helps you identify what is going on",
  "full_message": "Backtrace here\n\nmore stuff",
  "timestamp": 1385053862.3072,
  "level": 1,
  "_user_id": 9001,
  "_some_info": "foo",
  "_some_env_var": "bar"
}

As described in the specs, some information are mandatory (but don't worry about it, just let Monolog deal with it) and you can add as many information as you wish.

Here is a small example of a custom handler to log gelf messages in logstash:

#config_dev.yml
monolog:
    ...
    handlers:
        my_logstash_handler:
            type: gelf
            publisher:
                hostname: %logstash_host%
                port: %logstash_port%
            formatter: monolog.formatter.gelf_message
            level: INFO

ELK

ELK is an acronym for ElasticSearch / Logstash / Kibana.

ElasticSearch

ElasticSearch is a very powerful distributed search engine which provides a RESTful API. In the ELK stack, ElasticSearch is the storage backend. All our logs will be stored insite an index.

Logstash

Logstash has been created to manage logs. It collects, parses and stores them. There is a lot of existing inputs (41), filters (50) and outputs (55). For example, look at this configuration file :

input {
    gelf {
        codec => "json"
    }
}

output {
    elasticsearch {
        hosts => "elasticsearch:9200"
    }
}

We have configured a single input which is gelf. As we saw it, by default, gelf logs are sent through UDP on port 12201 and of course, logstash knows it.

There is no filter in this configuraton as we don't really need it for this example. By the way, the gelf message will be directly sent to ElasticSearch.

And finally, there is an elasticsearch_http output. So, logstash will call the ElasticSearch API to insert logs into an index, generated per day.

You can take a look at the full documentation for the gelf input and the elasticsearch_http output to have more information.

Kibana

Kibana is a very powerful tool to see and interact with your data. It's very easy to use and you can create a lot of dashboards to visualize all your logs. Take a look at the project homepage to see some examples.

Tips

Create dashboards for everything !

Is there a particular error in your production environment ? Create a dashboard just for it ! You will have all available information in a single place, it will be easier to aggregate information and understand when the error occurred.

Context is your friend bro !

Of course, you all know PSR-3, which defined a standard PSR\Log\LoggerInterface (used by Monolog obviously). But did you read the "Context" section ? Did you notice that all methods defined in the interface take a $context array as second argument ? Do you use it ? No ? You should !

This context is the best way to provide more information with your log. You can easily add all needed information to know WHEN an error (for example) occurred. And once you send all this context to the ELK stack, you can easily filter your logs according to the context. Does this error occur every time with the same user ? Is it only with this particular entity ? Anyway, just add context and you will be able to group your logs according to it. :)