Skip to content

Commit

Permalink
feat(clean): add clean_duplication() function
Browse files Browse the repository at this point in the history
Implements the clean_duplication() function for clustering and standardizing duplicate values.
  • Loading branch information
ryanwdale committed Mar 31, 2021
1 parent 7a8af96 commit 98ff38d
Show file tree
Hide file tree
Showing 6 changed files with 1,177 additions and 408 deletions.
2 changes: 2 additions & 0 deletions dataprep/clean/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@

from .clean_date import clean_date, validate_date

from .clean_duplication import clean_duplication

__all__ = [
"clean_lat_long",
Expand All @@ -40,4 +41,5 @@
"validate_address",
"clean_date",
"validate_date",
"clean_duplication",
]
Loading

0 comments on commit 98ff38d

Please sign in to comment.