PhoenixGo icon indicating copy to clipboard operation
PhoenixGo copied to clipboard

AVX + AVX2 + FMA + AVX512 : Windows release does not support these instructions, please release windows precompiled versions for modern computers

Open wonderingabout opened this issue 6 years ago • 7 comments

Can you release 2 windows releases for modern computers please ?

Computer from google cloud : Windows Server 2016 Tesla V100 Intel Xeon Phi (supports AVX512F)

cpu instructionsv2

after that the games are played no problem, but it is slower because no AVX / AVX2 / FMA / AVX512F

  • release 1 : supports AVX and AVX2 and FMA
  • release 2 : supports AVX512F (in linux it is supported by the compiler)

on ubuntu no problem to use these modern CPU instructions

Since tensorRT does not support batch size higher than 4, there is little benefit to use ubuntu instead of windows (harder to configure too) So i think these 2 releases for windows and mac would be greatly appreciated :+1:

can you make these 2 releases for windows (and mac) ?

big thanks ! @wodesuck

wonderingabout avatar Dec 14 '18 21:12 wonderingabout

I would try building an avx/avx2 version. But since I don't have any PC with avx512, you may need to build it yourself if really want.

wodesuck avatar Dec 17 '18 09:12 wodesuck

avx/avx2 should cover most usages

for avx512, i use avx512 in ubuntu 16.04 after compiling with bazel, so no i dont need it i just thought it was a good idea to add it

most important is avx/avx2 release for windows (and mac)

wonderingabout avatar Dec 17 '18 11:12 wonderingabout

any update on avx/avx2 builds for windows and mac @wodesuck ?

wonderingabout avatar Dec 27 '18 07:12 wonderingabout

Not yet. I got some problem while building, but don't have time to fix it yet.

wodesuck avatar Jan 03 '19 11:01 wodesuck

ok, when you try this again, can you also add fma too ?

(my r7 1700 has avx/avx2/fma support)

wonderingabout avatar Jan 03 '19 11:01 wonderingabout

@fiskerhuang @funionguo

wonderingabout avatar Jan 07 '19 17:01 wonderingabout

some benchmark on my gtx1060 :

it is much slower than on ubuntu here 3000 sims per move with default settings (all time manage settings disabled)

47s per move versus 21s per move on ubuntu with tensorrt (and arround 25s per move without tensorrt)

see : avxv1 avxv2

compare it to my ubuntu benchmarks (4000 sims per move with same settings) here : https://github.com/wonderingabout/PhoenixGo/blob/faqv2-bazel-master/docs/benchmark-gtx1060.md

i understand that it will take time, and i am not asking the developers to hurry,

but the windows release is indeed much slower than the linux release, due to the lack of avx avx2 fma (and possibly also because nvidia drivers are said to be arround 10% more powerful on ubuntu)

wonderingabout avatar Jan 07 '19 20:01 wonderingabout