![Building a Scalable Record Linkage System with Apache Spark, Python 3, and Machine Learning - YouTube Building a Scalable Record Linkage System with Apache Spark, Python 3, and Machine Learning - YouTube](https://i.ytimg.com/vi/iQiaZKU3n0Y/hqdefault.jpg)
Building a Scalable Record Linkage System with Apache Spark, Python 3, and Machine Learning - YouTube
![Robin Linacre · Splink: A free source package for record linkage at scale using Apache Spark · SlidesLive Robin Linacre · Splink: A free source package for record linkage at scale using Apache Spark · SlidesLive](https://cdn.slideslive.com/data/presentations/38932351/slideslive_niander-assis_pedro-o-s-vazdemelo_renato-assuncao_stop-the-clock-are-timeout-effects-real__small.jpg?1599164702)
Robin Linacre · Splink: A free source package for record linkage at scale using Apache Spark · SlidesLive
GitHub - moj-analytical-services/splink: Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
GitHub - ropeladder/record-linkage-resources: Resources for tackling record linkage / deduplication / data matching problems
![Robin Linacre on Twitter: "We have recently released splink_comparison_viewer on PyPi. It produces an interactive .html dashboard to help you rapidly understand and quality assure the results of record linkage. Demo using Robin Linacre on Twitter: "We have recently released splink_comparison_viewer on PyPi. It produces an interactive .html dashboard to help you rapidly understand and quality assure the results of record linkage. Demo using](https://pbs.twimg.com/tweet_video_thumb/FE9K4ThXEA4OITY.jpg)
Robin Linacre on Twitter: "We have recently released splink_comparison_viewer on PyPi. It produces an interactive .html dashboard to help you rapidly understand and quality assure the results of record linkage. Demo using
GitHub - moj-analytical-services/splink: Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
![GitHub - moj-analytical-services/splink_cluster_studio: Create interactive dashboards to visualise and analyse the outputs of data linking GitHub - moj-analytical-services/splink_cluster_studio: Create interactive dashboards to visualise and analyse the outputs of data linking](https://user-images.githubusercontent.com/2608005/138103796-1d1e6795-d3f9-4518-bb25-ffc98ef436b4.png)