Added line and columns to plugin name #11155

andsel · 2019-09-20T09:58:44Z

Note
this PR is superseeded by #11288 for the Java pipeline part, for the Ruby pipeline we have to check if it worthwhile

[Feature Discussion]

We want to give a more useful name/id to the plugin so that could be easily mapped to which point in the pipeline it refers to. I think the best option would be to have a line number referring the position into the pipeline config file, like input-L5. What do you think about it?

solves #11154

yaauie · 2019-10-03T21:51:46Z

IIRC, the line_and_column isn't quite as helpful as we would expect it to be, because it represents the position of the element after concatenating all source configuration files together, and it is an especially common practice in large pipelines to split up the configuration into many source files.

andsel · 2019-10-07T13:52:19Z

@yaauie thanks for the hint. We could do this:

in the management HTTP API for pipelines (localhost:9600/_node/stats/pipelines?pretty) expose the triple source file, line, column as code reference
in the logs keep the plugin.id
provide an HTTP API (eg: localhost:9600/_node/pipelines/main/<plugin.id>) that return the plugin information regarding its definition

yaauie · 2019-10-07T15:46:58Z

IIRC, the source files for a pipeline are concatenated together into a single string, which is fed through the grammar to get an AST, the nodes of which are used to create the plugins and control structure. Because the AST contains line and column metadata from the concatenated source string and not the original files, there is no way presently for a plugin instance to be mapped back to a specific file source.

We would either need to change how the AST is built (e.g., build multiple independent AST's and allow them to be concatenated after parsing to produce a net-same resulting AST with improved source metadata), or somehow map the line_and_column through a utility that is aware of which lines came from which files (e.g., if we know lines 0-72 in file X, 73-97 in file Y, an error on line 77 of the concatenated source is actually line 4 of file Y).

jsvd · 2019-10-07T16:01:57Z

I agree that providing this information means carrying new metadata from much earlier in the compilation process.

The goal of this suggestion is to have a way for users to correlate logs and metrics from plugins with the actual plugins without requiring the user to id them all. Linking auto generated IDs with file+line+column seems to be most user friendly, but happy to go back to drawing board at any time.

andsel · 2019-10-07T16:19:23Z

Merging ASTs could be a mess, because there is not any formal rule how the pipelines config files could be sliced, so we could have the filter section splitted in something like :

filter {
    geoip {
   }

and

   grok{}
}

At this point the remapping of line and cols after the global AST in created is the unique viable solution

andsel · 2019-10-10T12:27:38Z

Added the source file and source line into configuration reference for each metric node (Ruby pipeline)

yaauie

I think we are going to have to find a way to get this metadata to plugin instances without routing through their initialize methods (e.g., a map on the pipeline indexed by plugin id?). Changing the method signatures of classes that are extended by plugins in external codebases (including customer-owned private plugins) is going to put us in a position where we cannot be confident that our changes do not break real-world plugins.

logstash-core/lib/logstash/config/pipeline_config.rb

logstash-core/lib/logstash/outputs/base.rb

logstash-core/lib/logstash/codecs/delegator.rb

logstash-core/lib/logstash/config/mixin.rb

logstash-core/lib/logstash/compiler.rb

logstash-core/lib/logstash/pipeline.rb

logstash-core/src/main/java/org/logstash/common/SourceWithMetadata.java

logstash-core/lib/logstash/config/pipeline_config.rb

yaauie · 2019-10-15T16:52:42Z

logstash-core/lib/logstash/config/pipeline_config.rb

@@ -44,5 +44,19 @@ def display_debug_information
      logger.debug("Merged config")
      logger.debug("\n\n#{config_string}")
    end
+
+    def lookup_source_and_line(merged_config_line)
+      remaining_lines = merged_config_line


iterating over all of the parts for each lookup, doing all of the work to count lines of each config part each time, seems like it would have performance side-effects.

I think we can do the bulk of the work just once, by caching a mapping of file names, offsets, and sizes, and then using that mapping at lookup time instead of going all the way to the config parts.

Below is untested, and may be subject to off-by-one errors

def lookup_source_and_line(merged_line_number) source_map.each do |part_offset, part_lines, part_id| rebased_line_number = merged_line_number - part_offset next if rebased_line_number > part_lines return [part_id, rebased_line_number] end raise IndexError end private def source_map @source_map ||= begin offset = 0 source_map = [] config_parts.each do |config_part| part_lines = config_part.getLinesCount() @source_map << [offset, part_lines, config_part.id] offset += part_lines end source_map.freeze end end

Thanks for all the suggestions. I've also introduced a class to express the concept of segments, and make it more readable

…h line references

…trics) and at runtime with method

…eline)

…avoid multiple calculations

…ng API

andsel · 2019-10-22T10:58:05Z

Jenkins test this please

yaauie

In general, I think this works as-is, but the complexity of routing everything through serialisation and the changes to the delegators and what they have to break out of the config arguments feels like it will be fragile.

I also do not immediately see a path forward that is less fragile. I'm going to take a stab at reworking this, and will circle back in the next day or so with a more concrete review and/or suggestions for a path forward.

yaauie · 2019-10-22T15:36:48Z

logstash-core/spec/logstash/config/pipeline_config_spec.rb

+    context "when pipeline is constructed from multiple files" do
+      let (:pipeline_conf_string_part1) { 'input {
+                                             generator1
+                                           }' }


TODO: test case where first file has trailing newline to ensure it doesn't offset the line numbers from subsequent files

andsel · 2019-10-25T08:01:33Z

logstash-core/src/main/java/org/logstash/common/SourceWithMetadata.java

@@ -39,6 +39,10 @@ public String getText() {
        return text;
    }

+    public int getLinesCount() {


This counter could be cached, calculated only when text field is changed

andsel · 2019-10-31T11:05:57Z

I've created a PR #11288 which contains the common parts and the Java execution pipeline logic.
The files not ported that interests the Ruby's execution are:

logstash-core/lib/logstash/config/config_ast.rb
logstash-core/lib/logstash/pipeline.rb
logstash-core/spec/logstash/pipeline_spec.rb
logstash-core/lib/logstash/inputs/base.rb
logstash-core/spec/logstash/inputs/base_spec.rb

andsel · 2020-01-15T10:57:22Z

Closed new work #11288 for reason of #11497 (comment)

andsel mentioned this pull request Sep 20, 2019

Default Plugin.id should be more useful #11154

Open

robbavey mentioned this pull request Sep 20, 2019

[Meta Issue] Logging Improvements #11074

Closed

8 tasks

andsel mentioned this pull request Sep 20, 2019

Added plugin.name to fish tag log lines #11078

Closed

jsvd added discuss work in progress labels Sep 30, 2019

yaauie requested changes Oct 11, 2019

View reviewed changes

yaauie reviewed Oct 15, 2019

View reviewed changes

andsel added 17 commits October 21, 2019 18:19

Added line and columns to plugin name

0896771

Introduced new Plugin's node attribute to separate name from name wit…

8cd3fcc

…h line references

Exposed code reference for plugins in HTTP monitoring API (through me…

3299560

…trics) and at runtime with method

Removed the code reference from plugin (keeping in metrics)

1df10f6

Fixed bad param passing into filters

3d60817

Remapping global config line to source file and source line

6502920

Remapping global config line to source file and source line (Java pip…

aeae43a

…eline)

Minor, removed debug string

ed59016

Minor, removed unusefull method

9fae7da

Minor, code style fixes

6c3a90c

Moved parameters that broken API to be assigned with another method

edf4360

Fixed failing tests

1815412

fixed code-ref for Ruby code generation and fixed all failing tests

89fc967

Avoid double line computation and used caching of source segments to …

dafe861

…avoid multiple calculations

[WIP] fixes for codecs with nested parameters

a9ded3d

Removed test that verified behaviour never used

6b7453d

Added config-ref in metrics also for codec in Java pipeline

90342f9

andsel added 2 commits October 21, 2019 18:19

Added integration test to validate config references in HTTP monitori…

6d8a286

…ng API

Minor, renamed property

2e8cba9

andsel force-pushed the feature/better_plugin_name branch from a0061d1 to 2e8cba9 Compare October 21, 2019 16:19

andsel added 2 commits October 22, 2019 09:23

Fixed badly changed metric name

1d815b3

Method parent_config_reference= is present only in Codec's Delegator

0ba9e94

andsel requested a review from yaauie October 22, 2019 12:06

yaauie reviewed Oct 22, 2019

View reviewed changes

andsel commented Oct 25, 2019

View reviewed changes

robbavey mentioned this pull request Dec 11, 2019

Java pipeline part of PR #11155 #11288

Closed

andsel closed this Jan 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added line and columns to plugin name #11155

Added line and columns to plugin name #11155

andsel commented Sep 20, 2019 •

edited

Loading

yaauie commented Oct 3, 2019

andsel commented Oct 7, 2019

yaauie commented Oct 7, 2019

jsvd commented Oct 7, 2019

andsel commented Oct 7, 2019

andsel commented Oct 10, 2019

yaauie left a comment

yaauie Oct 15, 2019

andsel Oct 16, 2019

andsel commented Oct 22, 2019

yaauie left a comment

yaauie Oct 22, 2019

andsel Oct 25, 2019

andsel commented Oct 31, 2019

andsel commented Jan 15, 2020

Added line and columns to plugin name #11155

Added line and columns to plugin name #11155

Conversation

andsel commented Sep 20, 2019 • edited Loading

yaauie commented Oct 3, 2019

andsel commented Oct 7, 2019

yaauie commented Oct 7, 2019

jsvd commented Oct 7, 2019

andsel commented Oct 7, 2019

andsel commented Oct 10, 2019

yaauie left a comment

Choose a reason for hiding this comment

yaauie Oct 15, 2019

Choose a reason for hiding this comment

andsel Oct 16, 2019

Choose a reason for hiding this comment

andsel commented Oct 22, 2019

yaauie left a comment

Choose a reason for hiding this comment

yaauie Oct 22, 2019

Choose a reason for hiding this comment

andsel Oct 25, 2019

Choose a reason for hiding this comment

andsel commented Oct 31, 2019

andsel commented Jan 15, 2020

andsel commented Sep 20, 2019 •

edited

Loading