ldd91

Results 22 comments of ldd91

Here i will show two scenario in our platform: First: # Linux flie system quota ## support - Limit the maximum disk quota for a user (using user limits) -...

Second: # Lustre quota Each user has its own user name, uid, GID, and the group to which it belongs. In our scenario, we create a common directory for each...

I deployed MPIJob using Role in my own namespaces however it failed,but when i used the clusterrole it succeed.My question is how to set RBAC in cluster when using MPIJob.

Thank you for timely reply.I will have a try.@rongou @everpeace

@chunhanl Hi, I change the code to support multi-GPU,however I meet the same error:output shape [] doesn‘t match the boradcast shape [1,1],would you share how you reshape latency_to_accumulate

@latifisalar Thank you very much for your help,I will have a try

@latifisalar I meet a new issue,with the log shows:AssertionError: Gradients were computed more than backward_passes_per_step times before call to step(). Increase backward_passes_per_step to accumulate gradients locally,

@chunhanl @latifisalar ,Hi, Have you test this project in ImageNet?Can this method reach the resoult of the paper,I test this in ImageNet but only get 20% accurency and loss is...