bladebit icon indicating copy to clipboard operation
bladebit copied to clipboard

Tracking performance discrepancies on Windows vs Linux

Open harold-b opened this issue 3 years ago • 8 comments

User Reythia on reddit provided timings for several timings for tests he ran, showing severe performance deficiencies, especially during Phase 1 sort, when running on windows as compared to Linux (even on WSL better results were obtained):

Time - seconds Windows (v1.2.0) WSL (v1.2.0) WSL (v1.1.0 control)
Phase 1 954 324 324
of which sorting 848 194 191
Phase 2 32 32 32
Phase 3 372 175 175
Phase 4 0.7 0.7 0.7
Total 1363 560 560
    -m --no-cpu... -m --no-cpu WSL WSL -m
Phase 1 954 468 604 649 295 302
sorting 848 358 481 525 172 177
other 106 110 123 124 123 125
Phase 2 32 32 32 32 32 32
Phase 3 372 228 177 176 172 172
Phase 4 0.7 0.7 0.7 0.7 0.7 0.7
Total 1363 730 817 860 526 533

Tests performed on:

Threadripper 3995wx with RAM @ 3200MHz. Output drive is an NVME RAID 0. Windows Enterprise.

harold-b avatar Oct 04 '21 15:10 harold-b

can you share hardware specs?

cemalefendi avatar Oct 05 '21 09:10 cemalefendi

can you share hardware specs?

I don't have them, but did ask the reddit user to see if he's willing to share them.

harold-b avatar Oct 08 '21 21:10 harold-b

Updated with test machine info.

harold-b avatar Oct 14 '21 03:10 harold-b

same specs

3995wx

total 390 seconds

3995

cemalefendi avatar Oct 14 '21 14:10 cemalefendi

Thanks for the info on your system. That is Ubuntu, however. Have you happened to have run this on Windows? Interesting that you are getting 46 second faster times on Linux, though.

harold-b avatar Oct 14 '21 15:10 harold-b

harold-b something (?) is off on your Threadripper 3995wx with ?? RAM @ 3200MHz.. Would seem you should get far better results. With a straight Win 10 Pro 3955WX setup (16C/32T 128GB 3200MHz Sam 980 Pro) my best time for k32 has been 840 sec (vs your 1340 sec). I posted about it here w/logs https://github.com/Chia-Network/bladebit/discussions/186#discussioncomment-3743524 ... showing more typical ave result over time of 14.8 minutes (886 sec). Have done many days' of these plots approx 100TB. J Pls be aware plots are made with Windows BB Disk v2 for MMX, not Chia, but should be similar, if not identical for Chia AFAIK.

FuzeGuy avatar Oct 28 '22 14:10 FuzeGuy

These are Reythia's windows-specific times w/ BB RAM. There's an apparent performance issue with memory bandwidth on windows machines. I have the same setup which gets ~300 second plot times on CentOS Stream 8 BB RAM. BB disk comes out at around 8-9 minutes, same OS and system.

harold-b avatar Oct 28 '22 20:10 harold-b

@harold-b this is just some sort of system performance issue? Can we close?

jmhands avatar Aug 03 '23 00:08 jmhands