From 152ebbe6a0c8fb55d042bba2bd77631192bab17d Mon Sep 17 00:00:00 2001 From: Simon Eskildsen Date: Tue, 21 Jun 2022 13:22:21 -0400 Subject: [PATCH] readme: add performance graph --- README.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/README.md b/README.md index 8e92f3d7..39b396b8 100644 --- a/README.md +++ b/README.md @@ -24,6 +24,13 @@ there are few/no changes, but is able to output each differing row! By pushing the compute into the databases, it's _much_ faster than querying for and comparing every row. +![Performance for 100M rows](https://user-images.githubusercontent.com/97400/174860361-35158d2b-0cad-4089-be66-8bf467058387.png) + +**†:** The implementation for downloading all rows that `data-diff` and +`count(*)` is compared to is not optimal. It is a single Python multi-threaded +process. The performance is fairly driver-specific, e.g. Postgres' performs 10x +better than MySQL. + ## Table of Contents - [Common use-cases](#common-use-cases)