Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Login
From:
Your Site's RSS Feed
(Uncensored)
subscribe
Super-fast deduplication of large datasets using Splink and DuckDB
https://www.robinlinacre.com/fast_deduplication/
links
backlinks
Evaluating 1 billion record comparisons to deduplicate 7 million records in two minutes