David Fan comments

Results 13 comments of


                                            David Fan

low mIOU in train with coco and ResNet as backbone

The issue is with the way images are padded in the [RandomScaleCrop](https://github.com/jfzhang95/pytorch-deeplab-xception/blob/master/dataloaders/custom_transforms.py#L88) data augmentation, when the shorter side after rescaling is not large enough to be cropped. As is, the...

How to deal with Long time video data which has1000 frames

Could also try substituting in a sparse / linear self-attention mechanism

GPU version support？

https://github.com/harritaylor/torchvggish/pull/19 Sending the model to GPU works fine but PyTorch will complain `RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same` unless the audio tensor is also...

Memory leaks from address sanitizer

I observe an increase in memory during the course of training when using the AudioReader during distributed PyTorch training. This doesn't happen when using the VideoReader

Separate index generation necessary?

I just copied what Browser Genome did although it's true they might not have worried about efficiency

Changed panel toggles from collapse to tabs, implemented variance filter and Browser Genome functionality, made buttons and inputs Bootstrap compliant, added recursive upload and code for .2bit upload (but not the files because git lfs didn't work for me)

Uncommented out the .2bit download code and everything should work now! My copy is working at https://dfan97.github.io/ubit2/ with all the features (except the .2bit download).

David Fan

low mIOU in train with coco and ResNet as backbone

How to deal with Long time video data which has1000 frames

GPU version support？

Memory leaks from address sanitizer

Separate index generation necessary?

Changed panel toggles from collapse to tabs, implemented variance filter and Browser Genome functionality, made buttons and inputs Bootstrap compliant, added recursive upload and code for .2bit upload (but not the files because git lfs didn't work for me)

Implement Locking of Text Tower for `CLIP` Models

Added support for downloading PNG file along with SVG file.

Add close button and/or Esc keyboard shortcut

Support GPU Training