bhack comments

Results 1417 comments of


                                            bhack

Add support for large `num_items` to `device_select.cuh`

For `nonzero` as `sum` seems already covered in the table at: https://github.com/NVIDIA/cccl/issues/50#issuecomment-1956325564 `cub::DeviceSelect::Flagged` is the only one still needed for large `N`: ```cuda cub::DeviceSelect::Flagged(nullptr, temp_storage_bytes, counting_itr, itr, out_temp.mutable_data_ptr(), (int*)num_nonzeros.get(), N,...

Add support for large `num_items` to `device_select.cuh`

@elstehle We are having another problem related to this with the just release (but popular= model by Meta SAM2: https://github.com/facebookresearch/segment-anything-2/issues/44 Any progress on this? Basically pytorch `nonzero` ops rely on...

Pretrained Weights for Pytorch Version of Online Tapir/BootsTapir

Do you have any example of the Online Tapir/BootsTapir inference?

Bad Performance on Visual Odometry Image Sequences?

I have also found many of the described problems and false positives matching in these type of sequences. I suppose that for an odometry like camera movement it would also...

T 0 visibility at higer resoultion

> One sanity check is how the model predicts 't 0' points at 256x256 resolution, is it working mostly of the time? I've still not experimented on the same sequence...

T 0 visibility at higer resoultion

> The model is (approximately) estimating whether the result is within an 8 pixel threshold at every resolution; relative to the image size, this threshold gets smaller and smaller later...

T 0 visibility at higer resoultion

Or are you talking about only the "pyramid" effect in iterative approach at section 4 and not about the "feature" backbone pyramid? > 4. Extension to High-Resolution Videos When running...

T 0 visibility at higer resoultion

I also have and extra doubt about section 4. As you are going to work with iterative refinements at 2x iter you are still locked in the "receptive field" of...

OOM error

@yangyi02 Can this trick used also for training? I want to ask if you also a raw estimate about the memory O of the `build_model_init` feature extractor/backbone vs the tracking...

OOM error

> Feature backbone is relatively small (similar to ResNet-18), but it's applied across the whole video, so for long videos can be problematic, especially if the video is long. Sorry...