Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add cuvs_bench python folder, config files and constraints #244

Merged
merged 4 commits into from
Jul 31, 2024

Conversation

dantegd
Copy link
Member

@dantegd dantegd commented Jul 23, 2024

No description provided.

@dantegd dantegd added feature request New feature or request non-breaking Introduces a non-breaking change labels Jul 23, 2024
@dantegd dantegd self-assigned this Jul 23, 2024
Copy link
Member

@cjnolet cjnolet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My comments are somewhat minore, but relevant. I think we went to keep the params separate from the datasets. I'm also okay removing the files that tried to combine everything if they are only being left around for reference.

base_file: nytimes-256-inner/base.fbin
query_file: nytimes-256-inner/query.fbin
groundtruth_neighbors_file: nytimes-256-inner/groundtruth.neighbors.ibin
distance: euclidean
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably my fault, but there's several places in the file that say Euclidean when they should say inner.

file: index/deep-image-96-inner/raft_ivf_pq/dimpq64-cluster1024-float-half
name: raft_ivf_pq.dimpq64-cluster1024-float-half
search_params:
- internalDistanceDtype: float
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this work like? Is the intention here to group these 3 args together?

@@ -0,0 +1,26 @@
dataset:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we want to couple the param sweeps dirdftly to the datasets. The whole reason for separating them is that the sweeps should generalize across all datasets (and because it's more manageable to specify the sweeps once, instead of per dataset)

@cjnolet cjnolet marked this pull request as ready for review July 30, 2024 23:37
@cjnolet cjnolet requested a review from a team as a code owner July 30, 2024 23:37
@cjnolet
Copy link
Member

cjnolet commented Jul 30, 2024

/merge

@rapids-bot rapids-bot bot merged commit 812fffd into rapidsai:branch-24.08 Jul 31, 2024
54 checks passed
divyegala pushed a commit to divyegala/cuvs that referenced this pull request Jul 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request non-breaking Introduces a non-breaking change Python
Projects
Development

Successfully merging this pull request may close these issues.

2 participants