Bairen Yi comments

Results 33 comments of


                                            Bairen Yi

Unsafe use of Tensorflow and Caffe on same DeepDetect server / GPU

I don't think sharing 1 GPU between multiple applications is generally a good idea. Given TF takes control of the device global memory allocation, and all other resources that requires...

Crashed in OS X 10.13

Since pthread_once fires function call twice. it is reasonable for us to file the issue twice. (Credit: https://www.zhihu.com/people/zhu-xiao-e/)

py3k support

I wasn't looking into this for long time. You're free to use my code as it is.

Add the ability to trace TensorList in-place ops

This PR should be raised in core PyTorch as `make_fx` is moved there.

Make fx tracing use input tensor copies to prevent side effects

A little bit more context: this issue hits me when I try to trace through batch norm in training mode, as the number of tracked batches is incremented twice during...

In tfjob, is there a plan to support RDMA SRIOV with non hostNetwork?

In HCA mode, a veth is created separately from the IB device (using Calico or Flannel), and the IF binds to the netdev, e.g. eth0, is bridged to the host...

RFC: Sparse Domain Isolation for Supporting large-scale Sparse Weights Training.

> @byronyi If we are going to contribute to addon first, do we need a RFC here? I guess the design was originally targeted to TF core. As @alextp said,...

RFC: Sparse Domain Isolation for Supporting large-scale Sparse Weights Training.

> I think TensorFlow can provide a way to extend optimizers so that you can extend existing optimizers to handle your sparse weights. cc @omalleyt12 who proposes the new customizable...

CUDNN_STATUS_INTERNAL_ERROR - 8xV100 SXM2

@rz001 Hi, author of GDR here. Sorry that I just saw this issue. Do you still run into this error now?

RFC: Easily Customizable Optimizer.minimize

> Hello, do you have any information about when optimizers will be allowed to perform global gradient clipping across several GPUs? Some of my work quite critically depends on this....