tcourat comments

Results 16 comments of


                                            tcourat

Import error adaptive_conv_cuda

Hi. I had the same issue. It was caused by a mismatch between the CUDA version used by Pytorch and the native CUDA version in my computer. I got the...

Tensorboard starting but not showing scalars when used on AWS Sagemaker

Same issue there.

Environment

> When I want to run train.py in the tools directory, it prompts me that mmcv=1.0.5 is required. Does the training of this program only work in this environment? I...

packnet and depthformer training config

I am interested by the training config for DepthFormer and PackNet too.

> Probably I understood paper wrong, but thought it was mentioned linear classification over features patch-wise. If that is so, 1x1 convolution on unrolled patches is mathematically equivalent to linear...

Can this algorithm be used to obtain real-world distance to an object?

Google just released a paper very similar to this one (diffusion models for depth prediction). They claim being able to predict **metric depth** if you know camera FOV : https://diffusion-vision.github.io/dmd/

Training procedure

> I had a follow up question about this - the loss function specified in the paper for semantic preservation (equation 9) computes the sum of all the cosine similarity...

Comparison with ZoeDepth and Diffusion for Metric Depth (DMD).

Thanks for answering quickly. This is weird because the DMD paper seemed to obtain results coherent with those from ZoeDepth (they have AbsRel=0.091) on SunRGBD.

Extraction of image embeddings/ feature vectors latent space.

Hi, you can keep the embeddings during the inference by using forwards hooks. For instance if you want to store the image encoder features while running the mask generator :...

How to use it to improve accuracy of my already trained MVD video classifier?

I guess you need to train a new segmentation head on top of the upsampled features from a frozen backbone ?