-
Notifications
You must be signed in to change notification settings - Fork 108
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add option to skip false positive checks during Fuzzy Deduplication (#…
…199) * Initial logic for buckets to cc input Signed-off-by: Ayush Dattagupta <[email protected]> * Update Fuzzy Duplicates to use no fp check codepath Signed-off-by: Ayush Dattagupta <[email protected]> * Allow no fp check in fuzzy dedup config Signed-off-by: Ayush Dattagupta <[email protected]> * update semdedup test class name Signed-off-by: Ayush Dattagupta <[email protected]> * Rename class and method names Signed-off-by: Ayush Dattagupta <[email protected]> * Update tests Signed-off-by: Ayush Dattagupta <[email protected]> * Rename deprecated shuffle arg to shuffle_method Signed-off-by: Ayush Dattagupta <[email protected]> * Use Pairwise instead of Combinations Co-authored-by: yury-tokpanov <[email protected]> Signed-off-by: Ayush Dattagupta <[email protected]> * Fix bad rebase Signed-off-by: Ayush Dattagupta <[email protected]> * Fix type annotations and update __init__ Signed-off-by: Ayush Dattagupta <[email protected]> * Add false positive % in warning Signed-off-by: Ayush Dattagupta <[email protected]> * Add CLI script for buckets to edges Signed-off-by: Ayush Dattagupta <[email protected]> * Apply suggestions from code review Co-authored-by: Sarah Yurick <[email protected]> Signed-off-by: Ayush Dattagupta <[email protected]> * Update use of groupby collect with agg(list) Signed-off-by: Ayush Dattagupta <[email protected]> --------- Signed-off-by: Ayush Dattagupta <[email protected]> Co-authored-by: yury-tokpanov <[email protected]> Co-authored-by: Sarah Yurick <[email protected]>
- Loading branch information
1 parent
ad13c61
commit 982e7ec
Showing
6 changed files
with
383 additions
and
78 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.