Zhang Junda comments

Results 6 comments of


                                            Zhang Junda

encounter nan error in finding neg idx by torch.multinomial

Thanks for your answer! I'll have a try

encounter nan error in finding neg idx by torch.multinomial

Hi, I tried what you suggested in this way: Old code: ```python3 for b in range(bs): neg_idx = torch.multinomial(weights_t2i[b], 1).item() image_embeds_neg.append(image_embeds[neg_idx]) ``` Current code: ```python3 for b in range(bs): nan_idx...

encounter nan error in finding neg idx by torch.multinomial

Sorry for the late reply, it's little sensitive in our data... I'll try to squeeze time to experiment in some public datasets and let you know...

CRIU Cuda support

In the Eurosys 20, a paper named "Balancing Efficiency and Fairness in Heterogeneous GPU Clusters for Deep Learning" said they implemented this by using CRIU.

CRIU Cuda support

> Indeed, thanks for pointing out that paper. > > I just had a look and they write that they do not checkpoint the GPU part only the CPU part....

CRIU Cuda support

> > > Indeed, thanks for pointing out that paper. > > > I just had a look and they write that they do not checkpoint the GPU part only...