Nitin Bhat

Results 22 comments of Nitin Bhat

*Original date: 2019-03-15 14:59:20* --- I will try to reproduce this issue with a simple example.

*Original date: 2019-04-03 15:22:59* --- Added a test case: https://charm.cs.illinois.edu/gerrit/c/charm/+/5055, but haven't been able to reproduce the bug so far.

> Still in progress, but I put it here to get input from @nitbhat since the primary thing it affects is some ZC stuff. > > The main question I...

> 1. Why is ckJustMigrated called here? And on all charges? Since `ckJustMigrated` is called at the end of migration, for chares migrated with ZC Pup, I don't call that...

With just export UCX_ZCOPY_THRESH=-1, I saw a hang during the initial domain decomposition ``` Charm++> Running in SMP mode: 64 processes, 55 worker threads (PEs) + 1 comm threads per...

> @nitbhat, can you please try UCX_IB_RX_MAX_BUFS=32768 without any other settings? > if it does not help, smaller value is worth trying (say 8192) Okay, I'll try that.

@brminich: I haven't been able to test that setting yet. (Frontera was down for maintenance on Tuesday and now for some reason, I'm getting weird errors while launching the MPI...

@brminich: I tried different values for UCX_IB_RX_MAX_BUFS from 32k to 2k, and I got the same error. For the case when I set UCX_IB_RX_MAX_BUFS to 2048, I saw this warning:...

@trquinn: How can I get access to the dwf1b benchmark? Is it the same as dwf1.6144 as listed in https://github.com/N-BodyShop/changa/wiki/ChaNGa-Benchmarks?

I tried the `h148.cosmo50PLK.6144g3HbwK1BH.param` benchmark on 64 nodes with 2 processes/node, built on charm that was built using ucx master, and I see the crash which is the same as...