Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increase HotPotQA lookup range to a full paragraph #96

Closed
wants to merge 2 commits into from

Conversation

albertbou92
Copy link
Contributor

@albertbou92 albertbou92 commented Oct 27, 2024

Currently, the behavior is to return the two sequences before and after the lookup term. However, this can cause issues in cases like "Milhouse was named after U.S. president Richard Milhous Nixon", where the important information is not returned if action is Lookup("Milhouse"). This PR proposes returning the full paragraph instead.

@dosubot dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Oct 27, 2024
Copy link
Collaborator

@sidnarayanan sidnarayanan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Ryan-Rhys is it important that we split on sentences instead of paragraphs? The period-based splitting was getting tripped up on things like "U.S." (which would hit the two-sentence window)

@jamesbraza
Copy link
Collaborator

Bonus points for unit test

@albertbou92
Copy link
Contributor Author

closing as this change is also included in #99

@jamesbraza jamesbraza deleted the hotpotqa_lookup_range branch November 8, 2024 19:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size:XS This PR changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants