Laurent Wirz

Results 9 comments of Laurent Wirz

I verified my statement about recompiling test.cpp. When running on the driver and the bitstream generated by the dev branch and only recompiling test.cpp: - dev test.cpp does not work...

~~The default count of 16~~ Count of 24

I now initialized using: accl.get()->initialize(ranks, mpi_rank, mpi_size, 64, 1024, options.seg_size); I still get the error. Maybe it's sheer chance, but I had to repeat it 7 times to produce the...

Results(1 means test succeeded): ### accl.get()->initialize(ranks, mpi_rank, mpi_size, 64, 1024, options.seg_size); - dev~1 1111111111 - docu for vadd exampl 1111111111 - hls code compatibility with Vitis 2023+ 1111111011 ### accl.get()->initialize(ranks,...

Can confirm, that I observe similar behaviour when running Allreduce in isolation. I tried to run Allreduce with a size of just 2. The first run succeeded. On the secnd...

I can also confirm, that the issues are not present on the commit before the 196 merge. [Merge pull request](https://github.com/Xilinx/ACCL/commit/01f49d2dd01b6f2cc754f1e73e5a0f302c3f0431)

I mean to say they are probably introduced in the 196-fix. The commit right before is the 194 merge(01f49d2), on which the issue is not present.

It's the test/host/Coyote/runscripts/run.sh with ``` TEST_MODE=(5) N_ELEMENTS=(1048576) # 128 256 512 1024 2048 4096 8192 16384 32768 65536 131072 262144 524288 1048576 ```

I didn't try the equivalent as a isolated testcase. But the emulator works with the ProcessGroup with different sizes and repetitions, while in hardware it shows behaviour like this very...