Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Searching for "digital" finds "digit" #2484

Closed
eaquigley opened this issue Aug 28, 2015 · 5 comments
Closed

Searching for "digital" finds "digit" #2484

eaquigley opened this issue Aug 28, 2015 · 5 comments
Labels

Comments

@eaquigley
Copy link
Contributor

eaquigley commented Aug 28, 2015

Pointed out by someone on Twitter:

"try search "digital" and you find files labelled "digit". add phrase support."

In production, I searched for digital and narrowed down to files. Digital did show up in the description of some files (on page 66, I was logged in) but not in the name of the file as far as I could see in my quick look. The first page of results (and many after) had Digit in the title but not Digital. We should look at the relevancy for searches; would make sense that if the exact term showed up anywhere in the metadata of an object, it would be one of the first results displayed.

@pdurbin
Copy link
Member

pdurbin commented Sep 4, 2015

Ah, this tweet: https://twitter.com/jonintweet/status/637223429175947264

Screen Shot 2022-10-05 at 10 13 09 PM

Hmm. We worked a bit on search relevance in #1928 . Maybe the comment from @tercer at #1928 (comment) should be copied to this issue since that was for Beta 15 (but still open in QA).

@mercecrosas mercecrosas modified the milestone: In Review Nov 30, 2015
@scolapasta scolapasta removed this from the Not Assigned to a Release milestone Jan 28, 2016
@pdurbin pdurbin added the User Role: Guest Anyone using the system, even without an account label Jul 4, 2017
@pdurbin
Copy link
Member

pdurbin commented Oct 6, 2022

"try search "digital" and you find files labelled "digit"

Easy to reproduce at https://dataverse.harvard.edu/dataverse/andrewleigh/?q=digital (running 5.11.1):

Screen Shot 2022-10-05 at 10 16 28 PM

@pdurbin pdurbin added Type: Bug a defect and removed Type: Suggestion an idea labels Oct 6, 2022
@qqmyers
Copy link
Member

qqmyers commented Oct 6, 2022

FWIW - there is phrase support - searching for "digital code" only returns the first hit above. However, stemming is enabled so the various forms of words - digital/digit, code/codes are found, even in the phrase.

@pdurbin
Copy link
Member

pdurbin commented Oct 6, 2022

Yes, stemming. Exactly. What we should probably do is first write some tests to make assertion about how our search works. Then try to fix this issue and hopefully not cause other problems. 😄

If anyone wants to pick this up, here is where we keep the search tests: https://github.com/IQSS/dataverse/blob/v5.12/src/test/java/edu/harvard/iq/dataverse/api/SearchIT.java

@pdurbin pdurbin changed the title Search: Complete Term in Metadata of Object but Isn't Appearing in First Set of Results Searching for "digital" finds "digit" Oct 8, 2023
@DS-INRAE DS-INRAE moved this to 🔍 Interest in Recherche Data Gouv Jul 10, 2024
@cmbz
Copy link

cmbz commented Aug 20, 2024

To focus on the most important features and bugs, we are closing issues created before 2020 (version 5.0) that are not new feature requests with the label 'Type: Feature'.

If you created this issue and you feel the team should revisit this decision, please reopen the issue and leave a comment.

@cmbz cmbz closed this as completed Aug 20, 2024
@github-project-automation github-project-automation bot moved this from 🔍 Interest to Done in Recherche Data Gouv Aug 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

No branches or pull requests

6 participants