Skip to content

vanilla/fluent-plugin-elasticsearch

 
 

Repository files navigation

Fluent::Plugin::Elasticsearch, a plugin for Fluentd

Gem Version Dependency Status Build Status Coverage Status Code Climate

I wrote this so you can search logs routed through Fluentd.

Installation

$ gem install fluent-plugin-elasticsearch

Usage

In your fluentd configuration, use type elasticsearch. Additional configuration is optional, default values would look like this:

host localhost
port 9200
index_name fluentd
type_name fluentd

More options:

hosts host1:port1,host2:port2,host3:port3

You can specify multiple elasticsearch hosts with separator ",".

If you specify multiple hosts, plugin writes to elasticsearch with load balanced. (it's elasticsearch-ruby's feature, default is round-robin.)

If you specify this option, host and port options are ignored.

time_key event_time # defaults to nil

By default, records are inserted into Elasticsearch as-is. This command allows fluentd to dynamically create a key containing the event time and merge it into the record because sending it to Elasticsearch. If this command is non-nil, use it as the name of this time key.

time_format %Y.%m.%d # defaults to nil (ISO-8601)

If time_key is being used, control the format of the dynamic time key's value. This will be passed as an argument to strftime().

Sharding settings

fluent-plugin-elasticsearch allows built-in sharding of records into time-boxed indexes. The following settings control this behavior.

shard true # defaults to false

This allows index_name rewriting to split data into shards by date.

shard_format %{prefix}-%{date}

Adjust the format of the resulting index name while sharding. Possible replacement strings are:

prefix, date, index, type

shard_prefix logs # defaults to index_name

Store records under this prefix. If this setting is not configured (left as nil), the value of index_name is used. To prevent this, explicitly define shard_prefix as false.

shard_dateformat %Y.%m.%d

Format the date part of the shard_format according to this date replacement string. This will be passed as an argument to strftime().

utc_index true

By default, the records inserted into index logstash-YYMMDD with utc (Coordinated Universal Time). This option allows to use local time if you set utc_index to false.

Logstash Pre-configuration

fluent-plugin-elasticsearch comes with some logstash setting pre-configured to allow easy integration with tools like kibana.

logstash_format true # defaults to false

Enabling this setting causes records to be sharded by date, in an index prefixed by "logstash".

logstash_prefix logstash

By default, with logstash_format=true, records are inserted into index logstash-YYMMDD. This option allows to insert into specified index like mylogs-YYMMDD.

logstash_dateformat %Y.%m.%d

By default, with logstash_format=true, a custom key called @timestamp is dynamically created inside each record using the event's time at the point of log ingestion. Control its format using this key. This will be passed as an argument to strftime().

{"@timestamp":"2014-04-07T000:00:00-00:00"}

include_tag_key true # defaults to false
tag_key tag # defaults to tag

This will add the fluentd tag in the json record. For instance, if you have a config like this:

<match my.logs>
  type elasticsearch
  include_tag_key true
  tag_key _key
</match>

The record inserted into elasticsearch would be

{"_key":"my.logs", "name":"Johnny Doeie"}

id_key request_id # use "request_id" field as a record id in ES

By default, all records inserted into elasticsearch get a random _id. This option allows to use a field in the record as an identifier.

This following record {"name":"Johnny","request_id":"87d89af7daffad6"} will trigger the following ElasticSearch command

{ "index" : { "_index" : "logstash-2013.01.01, "_type" : "fluentd", "_id" : "87d89af7daffad6" } }
{ "name": "Johnny", "request_id": "87d89af7daffad6" }

fluentd-plugin-elasticsearch is a buffered output that uses elasticseach's bulk API. So additional buffer configuration would be (with default values):

buffer_type memory
flush_interval 60
retry_limit 17
retry_wait 1.0
num_threads 1

Please consider using fluent-plugin-forest to send multiple logs to multiple ElasticSearch indices:

<match my.logs.*>
  type forest
  subtype elasticsearch
  remove_prefix my.logs
  <template>
    logstash_prefix ${tag}
    # ...
  </template>
</match>

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

If you have a question, open an Issue.

About

No description or website provided.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Ruby 100.0%