Shilong Liu comments

Results 90 comments of


                                            Shilong Liu

The meaning of num_feature_levels.

Yes, it is. By default, ```num_feature_levels=4``` but ```3``` different scales are extracted from the backbone. Hence it will 2x upsample the highest feature map(C4) as the 4th feature level.

Swim-Transformer Pre-trained Weights

Thanks for your suggestions, we will clean this part and update them recently.

Swim-Transformer Pre-trained Weights

@fernandorovai Thanks for your attention. Our new work DINO, which is based on DAB-DETR, is available now: https://github.com/IDEACVR/DINO. You can refer to this repo for Swin Transformer supports.

Temperature tuning and positional embeddings for DAB-DETR

For the first question, you are right, and it seems a bug in our implementations. For the second, we only use PE(xy) as positional queries, see [this line](https://github.com/IDEA-opensource/DAB-DETR/blob/main/models/DAB_DETR/transformer.py#L238), which will...

Compiling CUDA operators error

Can you provide more information about your device, environments (like the cuda version and PyTorch version), and commands used?

nvcc fatal : Unsupported gpu architecture 'compute_86'

It seems like a problem with your environment. Can you provide the env and the command you used?

AssertionError: Invalid boxes

Thanks for your question. This discussion may be helpful: https://github.com/facebookresearch/detr/issues/101

Does this project require a lot of RAM and video memory resources?

The Transformer arch is GPU-mem cost. You may need GPUs with more memory. For the RAM, it seems like 32GB is not enough. One way to alleviate this is to...

How to train custom dataset

See #23 for fine-tune details. You can ignore pretrained checkpoints if you want to train DINO on your custom datasets from scratch.

How to train custom dataset

Thanks for pointing out the problem. We will correct them and update a manual for custom training later.