Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add max_token_length setting to the CharGroupTokenizer #56860

Merged
merged 7 commits into from
May 20, 2020
Merged

Add max_token_length setting to the CharGroupTokenizer #56860

merged 7 commits into from
May 20, 2020

Conversation

ADBalici
Copy link
Contributor

Adds max_token_length option to the CharGroupTokenizer.
Updates documentation as well to reflect the changes.

Solves #56676

@cbuescher cbuescher self-assigned this May 18, 2020
@cbuescher cbuescher added the :Search Relevance/Analysis How text is split into tokens label May 18, 2020
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-search (:Search/Analysis)

@elasticmachine elasticmachine added the Team:Search Meta label for search team label May 18, 2020
Copy link
Member

@cbuescher cbuescher left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @ADBalici, thanks for opening this PR, looks great. I left one super small comment but I'd also merge this if you don't get to changing it. Let me know, I will need to run out CI tests on this before merging in any case.

@cbuescher
Copy link
Member

@elasticmachine test this please

@ADBalici
Copy link
Contributor Author

Thanks for reviewing this @cbuescher
All points that you raised were addressed. 😄

@cbuescher
Copy link
Member

@elasticmachine test this please

@cbuescher
Copy link
Member

@ADBalici I took the liberty to change another checkstyle issue that appeared when running the test, hope you don't mind
@elasticmachine test this please

@cbuescher
Copy link
Member

@elasticmachine update branch

@cbuescher
Copy link
Member

@elasticmachine test this please

@cbuescher cbuescher merged commit da31b4b into elastic:master May 20, 2020
cbuescher pushed a commit that referenced this pull request May 20, 2020
Adds `max_token_length` option to the CharGroupTokenizer.
Updates documentation as well to reflect the changes.

Closes #56676
@cbuescher
Copy link
Member

@ADBalici test are all good now, I merged to master and the upcoming 7.9 branch. Thanks for your contribution!

russcam added a commit to elastic/elasticsearch-net that referenced this pull request Jul 31, 2020
russcam added a commit to elastic/elasticsearch-net that referenced this pull request Aug 4, 2020
github-actions bot pushed a commit to elastic/elasticsearch-net that referenced this pull request Aug 4, 2020
github-actions bot pushed a commit to elastic/elasticsearch-net that referenced this pull request Aug 4, 2020
russcam added a commit to elastic/elasticsearch-net that referenced this pull request Aug 4, 2020
russcam added a commit to elastic/elasticsearch-net that referenced this pull request Aug 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>enhancement :Search Relevance/Analysis How text is split into tokens Team:Search Meta label for search team v7.9.0 v8.0.0-alpha1
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants