-
Notifications
You must be signed in to change notification settings - Fork 24.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] Language analyzer docs failure #30557
Comments
Pinging @elastic/es-search-aggs |
The first failure is related to the italian analyzers:
The second is the same input token, but relates to the catalan analyzer:
|
Its a bit tricky to debug this since the context if missing from the error, but I think I managed to isolate the part where the two analyzer outputs begin to differ. I can reproduce in Kibana, not sure if this copy/paste action preserves all "hidden" characters that the test string contains, but anyway:
The first analyzes to:
The second to
So the original italian analyzer seems to swallow one more token. This part is surrounded by many chracters that seem to get dropped during analysis, which makes this also hard to debug. |
I think it's caused by the elision filter that is case insensitive in the built in analyzer and not in the rebuilt one. Adding |
This commit fixes docs failure on language analyzers when compared to the built in analyzers. The `elision` filters used by the rebuilt language analyzers should be case insensitive to match the definition of the prebuilt analyzers. Closes elastic#30557
This commit fixes docs failure on language analyzers when compared to the built in analyzers. The `elision` filters used by the rebuilt language analyzers should be case insensitive to match the definition of the prebuilt analyzers. Closes #30557
This commit fixes docs failure on language analyzers when compared to the built in analyzers. The `elision` filters used by the rebuilt language analyzers should be case insensitive to match the definition of the prebuilt analyzers. Closes #30557
This commit fixes docs failure on language analyzers when compared to the built in analyzers. The `elision` filters used by the rebuilt language analyzers should be case insensitive to match the definition of the prebuilt analyzers. Closes #30557
These both reproduce:
The text was updated successfully, but these errors were encountered: