Skip to content

Latest commit

 

History

History
213 lines (172 loc) · 8.72 KB

File metadata and controls

213 lines (172 loc) · 8.72 KB

About

This plugin is a substitute for the logstash-filter-elapsed plugin. The elapsed-plugin requires a transaction to be executed in a specified order and then decorates the last part of the transaction (or creates a new event) with the elapsed time. The order of which the parts of a transaction is received cannot always be predicted when using multiple workers for a pipeline. Hence the need for this plugin. This plugin, like elapsed, uses a unique identifier to pair events in a transaction. But instead of defining a start and an end for a transaction - only the unique identifier is used. Per default the transaction time is stored together with the unique identifier in a new event, which may be stored in the same or another index. The information from the first, last, oldest or newest event may be attached with the new transaction_time event.

Usage

The TransactionTime filter measures the time between two events in a transaction

This filter is supposed to be used instead of logstash-filters-elapsed when you know that the order of a transaction cannot be guaranteed. Which is most likely the case if you are using multiple workers and a big amount of events are entering the pipeline in a rapid manner.

The configuration:

    filter {
      transaction_time {
        uid_field => "Transaction-unique field"
        ignore_uid => []
        timeout => seconds
        timestamp_tag => "name of timestamp"
        replace_timestamp => ['keep', 'oldest', 'newest']
        filter_tag => "transaction tag"
        attach_event => ['first','last','oldest','newest','none']
        release_expired => [true,false]
        store_data_oldest => []
        store_data_newest => []
        periodic_flush => [true,false]
      }
    }
  • uid_field The only required parameter is "uid_field" which is used to identify the events in a transaction. A transaction is concidered complete when two events with the same UID has been captured. It is when a transaction completes that the transaction time is calculated.

  • ignore_uid The ignore_uid field takes an array of strings. These strings represent specific UIDs that should be ignored. This can be useful for ignoring parsing errors. Example:

    ignore_uid => ["%{[transactionUID][0]}", ""] 

    Will ignore events having empty string or "%{[transactionUID][0]}" in the uid_field.

  • timeout The timeout parameter determines the maximum length of a transaction. It is set to 300 (5 minutes) by default. The transaction will not be recorded if timeout duration is exceeded. The value of this parameter will have an impact on the memory footprint of the plugin.

  • timestamp_tag The timestamp_tag parameter may be used to select a specific field in the events to use when calculating the transaction time. The default field is @timestamp.

  • replace_timestamp The new event created when a transaction completes may set its own timestamp to when it completes (default) or it may use the timestamp of one of the events in the transaction. The parameter replace_timestamp is used to specify this behaviour.

  • filter_tag Since this plugin exclusivly calculates the time between events in a transaction, it may be wise to filter out the events that are infact not transactions. This will help reduce both the memory footprint and processing time of this plugin, especially if the pipeline receives a lot of non-transactional events. You could use grok and/or mutate to apply this filter like this:

     filter {
       grok{
         match => { "message" => "(?<message_type>.*)\t(?<msgbody>.*)\t+UID:%{UUID:uid}" }
       }
       if [message_type] in ["MaterialIdentified","Recipe","Result"."ReleaseMaterial"]{
         mutate {
           add_tag => "Transaction"
         }
       }
       transaction_time {
         uid_field => "UID"
         filter_tag => "Transaction"
       }
     }

In the example, grok is used to identify the message_type and then the tag "transaction" is added for a specific set of messages. This tag is then used in the transaction_time as filter_tag. Only the messages with this tag will be evaluated.

Note: Do not use reserved name "TransactionTime" which is added to all events created by this plugin

  • attach_event The attach_event parameter can be used to append information from one of the events to the new transaction_time event. The default is to not attach anything. The memory footprint is kept to a minimum by using the default value.

  • release_expired The release_expired parameter determines if the first event in an expired transactions should be released or not. Defaults to true

  • store_data_oldest/store_data_newest The parameters store_data_oldest and store_data_newest are both used in order to attach specific fields from oldest respectively newest event. An example of this could be:

   store_data_oldest => ["@timestamp", "work_unit", "work_center", "message_type"]
   store_data_newest => ["@timestamp", "work_unit", "work_center", "message_type"]

Which will result in the genereated transaction event inluding the specified fields from oldest and newest events in a hashmap named oldest/newest under the hash named "transaction_data" Example of output data:

"transaction_data" => {
       "oldest" => {
           "message_type" => "MaterialIdentified",
             "@timestamp" => 2018-10-31T07:36:23.072Z,
              "work_unit" => "WT000743",
            "work_center" => "WR000046"
       },
       "newest" => {
           "message_type" => "Recipe",
             "@timestamp" => 2018-10-31T07:36:28.188Z,
              "work_unit" => "WT000743",
            "work_center" => "WR000046"
       }
   }

Logstash Plugin

This is a plugin for Logstash.

It is fully free and fully open source. The license is Apache 2.0, meaning you are pretty much free to use it however you want in whatever way.

Documentation

Logstash provides infrastructure to automatically generate documentation for this plugin. We use the asciidoc format to write documentation so any comments in the source code will be first converted into asciidoc and then into html. All plugin documentation are placed under one central location.

Need Help?

Need help? Try #logstash on freenode IRC or the https://discuss.elastic.co/c/logstash discussion forum.

Developing

1. Plugin Developement and Testing

Code

  • To get started, you'll need JRuby with the Bundler gem installed.

  • Create a new plugin or clone and existing from the GitHub logstash-plugins organization. We also provide example plugins.

  • Install dependencies

bundle install

Test

  • Update your dependencies
bundle install
  • Run tests
bundle exec rspec

2. Running your unpublished Plugin in Logstash

2.1 Run in a local Logstash clone

  • Edit Logstash Gemfile and add the local plugin path, for example:
gem "logstash-filter-awesome", :path => "/your/local/logstash-filter-awesome"
  • Install plugin
bin/logstash-plugin install --no-verify
  • Run Logstash with your plugin
bin/logstash -e 'filter {awesome {}}'

At this point any modifications to the plugin code will be applied to this local Logstash setup. After modifying the plugin, simply rerun Logstash.

2.2 Run in an installed Logstash

You can use the same 2.1 method to run your plugin in an installed Logstash by editing its Gemfile and pointing the :path to your local plugin development directory or you can build the gem and install it using:

  • Build your plugin gem
gem build logstash-filter-awesome.gemspec
  • Install the plugin from the Logstash home
bin/logstash-plugin install /your/local/plugin/logstash-filter-awesome.gem
  • Start Logstash and proceed to test the plugin

Contributing

All contributions are welcome: ideas, patches, documentation, bug reports, complaints, and even something you drew up on a napkin.

Programming is not a required skill. Whatever you've seen about open source and maintainers or community members saying "send patches or die" - you will not see that here.

It is more important to the community that you are able to contribute.

For more information about contributing, see the CONTRIBUTING file.