[Logs UI] Shorten the logs ML job ID prefixes #47477

weltenwort · 2019-10-07T15:30:13Z

Summary

The static parts log rate job IDs should be as short as possible.

Rationale

The log rate jobs are assigned human-readable IDs that contain the static parts as well as the Kibana space and logs source IDs: kibana-logs-ui-${spaceId}-${sourceId}-log-entry-rate Since the Kibana space ID is set by the user, there is a risk of exceeding the 64-character limit on the length of the ML job ID. Reducing the lengths of the static parts can reduce that risk by leaving more room for the user-defined space ID.

In the long term the space-awareness of ML jobs will remove the necessity for including the space ID in the job ID.

Because the id is used to find the jobs belonging to a source config, this will be a breaking change.

Acceptance criteria

The id assigned to jobs is shortened to logs-${spaceId}-${sourceId}-rate.
The id assigned to a datafeed matches the job id.
Jobs that were created before this change are still handled correctly.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2019-10-07T15:30:14Z

Pinging @elastic/infra-logs-ui (Team:infra-logs-ui)

jasonrhodes · 2020-06-11T15:54:16Z

This will no longer be a big deal when we can drop space ID from the Job ID, right? cc @weltenwort

weltenwort · 2020-06-12T16:10:21Z

True: if there is a migration in place to move already existing jobs into the respective spaces, we should remove the id as part of that migration.

jasonrhodes · 2020-10-16T16:46:47Z

@elastic/machine-learning What's the ETA on ML space awareness?

jgowdyelastic · 2020-10-16T17:17:10Z

ML job space awareness is planned for 7.11

sophiec20 · 2020-10-16T17:52:46Z

Meta ticket for ref #64172

jasonrhodes · 2021-07-29T12:40:45Z

Refinement update: we no longer need to put the space ID in the index name, but we need to make sure we understand how to query with backwards compatibility, if we remove that.

miltonhultgren · 2022-09-07T13:09:20Z

After digging into the code for this I have some questions.
Maybe @elastic/machine-learning are best suited to answer.

What needs to be in the job ID to make it unique?
Today we include a prefix, the space id, the logs/metrics source configuration id and the job name.
Can I somehow migrate old jobs so that their old ids are renamed to use the new pattern?
That would make the UI code a lot simpler, since otherwise I need to check for both the new and old format in every place where we refer to a job by its id.
Are there any general guidelines for what to put into the group field when registering a job?
For the log rate and categorisation jobs we put the group as "logs-ui" (meaning, the app that created them), but for our metric jobs we grouped them by "metrics" (the type of data they use) and "host"/"k8s" (the dataset they're working on).

miltonhultgren · 2022-09-12T09:44:28Z

Gentle re-ping, I suspect my edit didn't fire the notification, @elastic/machine-learning :)

sophiec20 · 2022-09-12T11:55:51Z

What needs to be in the job ID to make it unique?

This is a question for the logs team. ML jobs can be shared between spaces, and I don't know what source_id is.

It's a good time to consider if Logs UI should stick with only ever being able to link to one job (with a hard-coded fixed id). It is reasonable to think that customers would want the flexibility to see results from multiple jobs.

Alternatively, and imho probably more flexibly, if sticking to 1 job then Logs UI could allow the user to override the job_id in adv settings.

You might want to add a version number in the job_id. See next q.

Can I somehow migrate old jobs so that their old ids are renamed to use the new pattern?

The job_id cannot be changed. You would need to clone and restart new jobs. You cannot rename them. Other solutions have done post upgrade checks, and offered users the option to upgrade their ML jobs via app banners. (Where the upgrade will create a new job, stop the old one, and point to the new one).

Are there any general guidelines for what to put into the group field when registering a job?

The "group" field mainly allows filtering in the ML UI -- this allows users to view results from multiple jobs together and to manage multiple jobs together .. e.g. bulk stop. As a general guideline, define fewer groups ... otherwise the filtering causes a lot of groups of 1 which isn't the best user experience as job_id is already unique.

Also in job config, include "custom_settings.managed: true". This means the job will have a badge in the ML UI and there are warnings if you try and delete/edit it. Already used by Metrics.

miltonhultgren · 2022-09-13T08:15:23Z

@smith I think this will need more thought before we proceed.

smith · 2022-09-13T16:17:39Z

Thanks for looking @miltonhultgren. I'll put this back in the backlog for now.

sophiec20 · 2022-09-13T17:51:41Z

Sorry, did not mean to put you off .. imho changing to logs-${spaceId}-${sourceId}-rate would be a useful incremental change.

smith · 2022-09-13T18:04:49Z

Sorry, did not mean to put you off .. imho changing to logs-${spaceId}-${sourceId}-rate would be a useful incremental change.

sourceId is usually default, but spaceId could be anything so we would still face the problem of easily exceeding the 64 character limit. IMO this is only worth picking up right now if we can prevent exceeding the limit without too much effort.

Danouchka · 2022-10-06T22:04:57Z

Experiencing same issue on 8.4.3.. Just a question; why dont we allow job names of 255 chars ?

adjenks · 2023-04-14T23:37:06Z

I recently ran into this issue creating a ML job and seeing "The job id cannot contain more than 64 characters."
That lead me here: https://discuss.elastic.co/t/kibana-ml-the-job-id-cannot-contain-more-than-64-characters/303047
Which lead me here: #112938
Which lead me here.
I look forward to a fix.
Thank you and good luck.

While working on #47477, I found that attempting to re-create a ML job faces a 404 because it uses an endpoint that has been removed / changed. This PR updates to use the newer endpoint to find which tasks are blocking in the ML system (like job deletion) and changes the types to match the new API.

While working on elastic#47477, I found that attempting to re-create a ML job faces a 404 because it uses an endpoint that has been removed / changed. This PR updates to use the newer endpoint to find which tasks are blocking in the ML system (like job deletion) and changes the types to match the new API. (cherry picked from commit 48b66d7)

…168075) # Backport This will backport the following commits from `main` to `8.11`: - [[infra] Use correct ML API to query blocking tasks (#167779)](#167779)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Milton Hultgren <milton.hultgren@elastic.co>

While working on elastic#47477, I found that attempting to re-create a ML job faces a 404 because it uses an endpoint that has been removed / changed. This PR updates to use the newer endpoint to find which tasks are blocking in the ML system (like job deletion) and changes the types to match the new API.

elasticmachine · 2023-11-09T12:05:27Z

Pinging @elastic/obs-ux-logs-team (Team:obs-ux-logs)

Closes elastic#47477 ### Summary ML job IDs have a limit of 64 characters. For the log ML jobs we add the string `kibana-logs-ui` plus the space and log view IDs as a prefix to the job names (`log-entry-rate` and `log-entry-categories-count`) which can quickly eat up the 64 character limit (even our own Stack Monitoring log view hits the limit). This prevents users from being able to create ML jobs and it's hard to rename a space or log view, and the limit is not hinted at during space creation (because they are unrelated in some sense). In order to achieve a more stable length to the ID, this PR introduces a new format for the prefix which creates a UUID v5 which uses the space and log view ID as seed information (it then removes the dashes to still be within the size limit for the categorization job). Since there is no technical difference between the new and old format, this PR makes an effort to continue to support the old format and allow migration of old jobs as needed. The old jobs work and may contain important data so the user should not feel forced to migrate. The main addition is a new small API that checks if any ML jobs are available and which format they use for the ID so that the app can request data accordingly and the APIs have been modified to take the ID format into account (except during creation which should always use the new format). The solution applied is not ideal. It simply passes the ID format along with the space and log view ID to each point where the ID is re-created (which is multiple). The ideal solution would be to store the job data in the store and pass that around instead but that seemed like a considerably larger effort. This PR does introduce some functional tests around the ML job creation process, so such a future refactor should be a bit safer than previously. ### How to test * Start from `main` * Start Elasticsearch * Start Kibana * Load the Sample web logs (Kibana home -> Try sample data -> Other sample data sets) * Visit the Anomalies page in the Logs UI * Set up any of the two ML jobs or both, wait for some results to show up * Checkout the PR branch * Visit the anomalies page and verify that it still works (requests go to resolve the ID format, should return 'legacy' which should then load the data for the legacy job) * Recreate the ML job and verify that the new job works and results still show up (new requests should go out with the new format being used, which may be a mixed mode if you have two jobs and only migrate one of them) --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

weltenwort added Feature:Logs UI Logs UI feature Team:Infra Monitoring UI - DEPRECATED DEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_services v7.5.0 labels Oct 7, 2019

sgrodzicki added the [zube]: Ready label Oct 14, 2019

afgomez self-assigned this Oct 18, 2019

weltenwort added [zube]: Backlog and removed [zube]: Ready labels Oct 18, 2019

afgomez removed their assignment Oct 18, 2019

weltenwort removed [zube]: Backlog v7.5.0 labels Jun 8, 2020

jasonrhodes added the impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. label Jul 29, 2021

jasonrhodes added the needs-refinement A reason and acceptance criteria need to be defined for this issue label Jul 29, 2021

weltenwort mentioned this issue Dec 9, 2021

Too long ML job id of Logs UI is being generated #112938

Closed

smith removed the needs-refinement A reason and acceptance criteria need to be defined for this issue label Jul 26, 2022

miltonhultgren self-assigned this Sep 6, 2022

miltonhultgren removed their assignment Sep 13, 2022

smith added the needs-refinement A reason and acceptance criteria need to be defined for this issue label Sep 13, 2022

Danouchka added impact:high Addressing this issue will have a high level of impact on the quality/strength of our product. and removed impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. labels Oct 6, 2022

miltonhultgren self-assigned this Sep 5, 2023

miltonhultgren mentioned this issue Oct 2, 2023

[infra] Use correct ML API to query blocking tasks #167779

Merged

miltonhultgren added a commit to miltonhultgren/kibana that referenced this issue Oct 6, 2023

[infra] Shorten IDs for ML jobs (elastic#47477)

9714423

miltonhultgren mentioned this issue Oct 6, 2023

[infra] Shorten IDs for ML jobs #168234

Merged

miltonhultgren changed the title ~~[Logs UI] Shorten the log rate job ID prefixes~~ [Logs UI] Shorten the logs ML job ID prefixes Oct 13, 2023

miltonhultgren mentioned this issue Oct 13, 2023

[Metrics UI] Shorten the metrics ML job ID prefixes #168822

Open

gbamparop added Team:obs-ux-logs Observability Logs User Experience Team and removed Team:Infra Monitoring UI - DEPRECATED DEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_services labels Nov 9, 2023

botelastic bot added needs-team Issues missing a team label and removed needs-team Issues missing a team label labels Nov 9, 2023

miltonhultgren closed this as completed in #168234 Nov 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Logs UI] Shorten the logs ML job ID prefixes #47477

[Logs UI] Shorten the logs ML job ID prefixes #47477

weltenwort commented Oct 7, 2019 •

edited

Loading

elasticmachine commented Oct 7, 2019

jasonrhodes commented Jun 11, 2020

weltenwort commented Jun 12, 2020 •

edited

Loading

jasonrhodes commented Oct 16, 2020

jgowdyelastic commented Oct 16, 2020

sophiec20 commented Oct 16, 2020

jasonrhodes commented Jul 29, 2021

miltonhultgren commented Sep 7, 2022 •

edited

Loading

miltonhultgren commented Sep 12, 2022

sophiec20 commented Sep 12, 2022 •

edited

Loading

miltonhultgren commented Sep 13, 2022

smith commented Sep 13, 2022

sophiec20 commented Sep 13, 2022

smith commented Sep 13, 2022

Danouchka commented Oct 6, 2022

adjenks commented Apr 14, 2023

elasticmachine commented Nov 9, 2023

[Logs UI] Shorten the logs ML job ID prefixes #47477

[Logs UI] Shorten the logs ML job ID prefixes #47477

Comments

weltenwort commented Oct 7, 2019 • edited Loading

Summary

Rationale

Acceptance criteria

elasticmachine commented Oct 7, 2019

jasonrhodes commented Jun 11, 2020

weltenwort commented Jun 12, 2020 • edited Loading

jasonrhodes commented Oct 16, 2020

jgowdyelastic commented Oct 16, 2020

sophiec20 commented Oct 16, 2020

jasonrhodes commented Jul 29, 2021

miltonhultgren commented Sep 7, 2022 • edited Loading

miltonhultgren commented Sep 12, 2022

sophiec20 commented Sep 12, 2022 • edited Loading

miltonhultgren commented Sep 13, 2022

smith commented Sep 13, 2022

sophiec20 commented Sep 13, 2022

smith commented Sep 13, 2022

Danouchka commented Oct 6, 2022

adjenks commented Apr 14, 2023

elasticmachine commented Nov 9, 2023

weltenwort commented Oct 7, 2019 •

edited

Loading

weltenwort commented Jun 12, 2020 •

edited

Loading

miltonhultgren commented Sep 7, 2022 •

edited

Loading

sophiec20 commented Sep 12, 2022 •

edited

Loading