tidb-lightning
tidb-lightning copied to clipboard
lightning import failed when tikv net delay 100ms
Bug Report
Please answer these questions before submitting your issue. Thanks!
- What did you do? we simulate the network delay of two places and three centers in remote computer rooms(100ms),and import csv file use lightning tools(importer mode)
- What did you expect to see?
-
What did you see instead?
-
What version of TiDB are you using (
tidb-server -V
or runselect tidb_version();
on TiDB)? -
which tool are you using?
-
what versionof tool are you using (
pump -V
ortidb-lightning -V
orsyncer -V
)?
Hi, lightning's repo is https://github.com/pingcap/tidb-lightning/
@King-Dylan @lance6716 please let us transfer the issue rather than close and reopen a new one.
Would this design be better When the majority copy is successfully imported,then lightning return success prevent the remote center network from affecting the import speed。
according to the other logs the error occurred because upload timed out (no response with 30 seconds, it seems), and it has failed consecutively for 5 times.
An internal test shows that, with 100ms latency, some of the ranges may cost up to 40 secs to be uploaded.
data:image/s3,"s3://crabby-images/33639/336391207c8935435fd49e2889c7d63ab5797374" alt=""
This may cause the importer backend failed because it set a 30s timeout for upload
RPC. (As the gRPC site said, timeout is the longest time an RPC can be alive, but no more detailed docs found.) And with the latency grows, the time cost may NEVER less than 30s, and finally, it exceeds all retry times.
For now, use local backend can probably resolve this.
But the key problem is why latency slow down uploading, and what we can we do for it?
An experiment shows that, with RTT growing, the throughput of raw TCP would be limited. when RTT doubles, throughput would become half.
Detailed info:
./test-result/ping-0ms.log :
[ 4] 0.00-10.00 sec 9.92 GBytes 8.52 Gbits/sec 47 sender
[ 4] 0.00-10.00 sec 9.91 GBytes 8.52 Gbits/sec receiver
./test-result/ping-100ms.log :
[ 4] 0.00-10.00 sec 268 MBytes 225 Mbits/sec 6 sender
[ 4] 0.00-10.00 sec 267 MBytes 224 Mbits/sec receiver
./test-result/ping-200ms.log :
[ 4] 0.00-10.00 sec 123 MBytes 103 Mbits/sec 15 sender
[ 4] 0.00-10.00 sec 123 MBytes 103 Mbits/sec receiver
./test-result/ping-400ms.log :
[ 4] 0.00-10.00 sec 50.5 MBytes 42.3 Mbits/sec 14 sender
[ 4] 0.00-10.00 sec 49.7 MBytes 41.7 Mbits/sec receiver
./test-result/ping-800ms.log :
[ 4] 0.00-10.00 sec 13.7 MBytes 11.5 Mbits/sec 0 sender
[ 4] 0.00-10.00 sec 13.7 MBytes 11.5 Mbits/sec receiver
Since #400 is merged, I think this issue can be closed? @YuJuncen