This is a dummy repo for practicing some of the tasks involved in principled data processing.
The datasets that you'll find in this repo come from CompERBench: Complementing Entity Matching Benchmark Tasks, and in the training-docs, you'll see it again in the demo record-linkage task.