fix: filter empty value in xlsx to improve vector similarity hit #422

saifeiLee · 2023-06-20T07:47:51Z

Exclude empty value during xlsx parsing, to make hit-test result more accurate.
By the way, we should exclude any redundant text in original dataset, so that the indexing and query result can be more accurate.

Fix: #388

…r similarity

takatost · 2023-06-21T03:19:14Z

Btw, How much of an impact does None have on dataset retrieval? Is there any comparison that we can refer to?

…ggenius#422)

saifeiLee added 3 commits June 20, 2023 15:41

fix: list render should contain unique key

409a335

fix: xlsx parser content should exclude 'None' value to improve vecto…

a7fc65f

…r similarity

fix: xlsx parser should exclude falsy value

8cb51b0

takatost approved these changes Jun 21, 2023

View reviewed changes

JohnJyong approved these changes Jun 21, 2023

View reviewed changes

JohnJyong merged commit 23ef226 into langgenius:main Jun 21, 2023

Octivian pushed a commit to Octivian/dify that referenced this pull request Aug 8, 2023

fix: filter empty value in xlsx to improve vector similarity hit (lan…

97ec306

…ggenius#422)

HuberyHuV1 pushed a commit to HuberyHuV1/dify that referenced this pull request Jul 22, 2024

fix: filter empty value in xlsx to improve vector similarity hit (lan…

bca013e

…ggenius#422)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: filter empty value in xlsx to improve vector similarity hit #422

fix: filter empty value in xlsx to improve vector similarity hit #422

saifeiLee commented Jun 20, 2023 •

edited

Loading

takatost commented Jun 21, 2023

fix: filter empty value in xlsx to improve vector similarity hit #422

fix: filter empty value in xlsx to improve vector similarity hit #422

Conversation

saifeiLee commented Jun 20, 2023 • edited Loading

takatost commented Jun 21, 2023

saifeiLee commented Jun 20, 2023 •

edited

Loading