Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
utils folder is one level up from the python folder
  • Loading branch information
shahrokhDaijavad authored Nov 19, 2024
1 parent 9c82fe0 commit ed4e9c1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion transforms/universal/fdedup/python/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ shingles.
`num_minhashes_per_band` minhashes. For each document, generate a unique signature for every band.

The values for `num_bands` and `num_minhashes_per_band` determine the likelihood that documents with a certain Jaccard
similarity will be marked as duplicates. A Jupyter notebook in the [utils](utils) folder generates a graph of this
similarity will be marked as duplicates. A Jupyter notebook in the [utils](../utils) folder generates a graph of this
probability function, helping users explore how different settings for `num_bands` and `num_minhashes_per_band` impact
the deduplication process.

Expand Down

0 comments on commit ed4e9c1

Please sign in to comment.