dedupeio / dedupe Star 4.1k Code Issues Pull requests Discussions 🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution. python clustering dedupe record-linkage python-library entity-resolution datamade dedupe-library de-duplicating Updated Jul 8, 2024 Python
datamade / data-making-guidelines Star 286 Code Issues Pull requests 📘 Making Data, the DataMade Way etl makefile principles datamade Updated Feb 3, 2021 HTML