Skip to content

agustingenoud/swoosh-algorithm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

git-swoosh


Swoosh algorithm implementation in R. ETL match and merge functions for the entity resolution (ER) model.

Usage

match()

Seeks for identity duplications that are not exact (different email, phone numbers, or abbreviations)

merge()

Merges the records that match so they can represent the same entity.

> info & paper

About

A swoosh algorithm implementation in R from the Entity Relation Model from Stanford.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages