Normalize the search term #1239

joelit · 2021-11-09T19:36:38Z

sonarcloud · 2021-11-09T19:37:45Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
0.0% Duplication

codecov · 2021-11-09T19:40:54Z

Codecov Report

Merging #1239 (5bfe646) into master (30ce5d0) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff            @@
##             master    #1239   +/-   ##
=========================================
  Coverage     69.07%   69.08%           
  Complexity     1638     1638           
=========================================
  Files            32       32           
  Lines          4016     4017    +1     
=========================================
+ Hits           2774     2775    +1     
  Misses         1242     1242

Impacted Files	Coverage Δ
model/ConceptSearchParameters.php	`84.44% <100.00%> (+0.17%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 30ce5d0...5bfe646. Read the comment docs.

osma · 2021-11-10T08:43:15Z

I tested this and it works as it should. The only issue is that highlighting (with bold text) the matching parts in the autocomplete box is not working when the input string is decomposed (composed on the left, decomposed on the right):

I tried to fix this by adding a .normalize('NFC') call here:

Skosmos/resource/js/docready.js

Line 804 in 30ce5d0

searchString = $(this).val();

...but it didn't help; I think it would require changes within typeahead.js as it seems to read the search string directly from the text field, so there is no easy opportunity to normalize it. Anyway this is no big deal; there are other similar problems with missing highlights (e.g. in case of accent folding) and these should be pretty rare cases anyway.

I think it's reasonable to place the normalization call in ConceptSearchParameters.getSearchTerm(), as that method also performs other types of search term normalization such as stripping whitespace.

kinow · 2021-11-10T09:06:07Z

I think it would require changes within typeahead.js

I think #1182 is also updating typeahead. I can try to test it with the latest version and comment if we have a follow-up issue for that @osma 👍 (I will see if I have some text with surrogate characters or just manually edit a dataset and upload to my test fuseki)

osma · 2021-11-10T11:16:09Z

will see if I have some text with surrogate characters or just manually edit a dataset and upload to my test fuseki

You don't need very special data for this, just text with non-ASCII characters (e.g. åäöéñ) that are stored as composed Unicode characters (NFC), which is the usual case for RDF data.

The problem was when the user enters a search string which contains decomposed (NFD) characters (typically copied and pasted from some other system - in our case an Aleph ILS). This comment in the original issue shows how to trigger it with Linux command line tools and the KANTO/FINAF data set in Finto.

joelit · 2021-11-11T07:14:03Z

Thanks for spotting this defect, and thanks for the valuable input. :) I'll make a new issue regarding the autocomplete highlighting bug, and call this one an incremental improvement on the situation (and I'll link this discussion there).

Normalize the search term

5bfe646

osma added the bug label Nov 10, 2021

osma added this to the 2.13 milestone Nov 10, 2021

osma approved these changes Nov 10, 2021

View reviewed changes

joelit mentioned this pull request Nov 11, 2021

Autocomplete component does not highlight the search term as a matching part when using decomposed unicode characters #1246

Closed

joelit merged commit 15b1674 into master Nov 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize the search term #1239

Normalize the search term #1239

joelit commented Nov 9, 2021

sonarcloud bot commented Nov 9, 2021

codecov bot commented Nov 9, 2021 •

edited

Loading

osma commented Nov 10, 2021

kinow commented Nov 10, 2021

osma commented Nov 10, 2021

joelit commented Nov 11, 2021

Normalize the search term #1239

Normalize the search term #1239

Conversation

joelit commented Nov 9, 2021

sonarcloud bot commented Nov 9, 2021

codecov bot commented Nov 9, 2021 • edited Loading

Codecov Report

osma commented Nov 10, 2021

kinow commented Nov 10, 2021

osma commented Nov 10, 2021

joelit commented Nov 11, 2021

codecov bot commented Nov 9, 2021 •

edited

Loading