segmentation_models.pytorch
segmentation_models.pytorch copied to clipboard
Mix ViT in_channels restriction
What's the reasoning behind limiting the Mix Visual Transformer encoder #632 to 3 input channels?
https://github.com/qubvel/segmentation_models.pytorch/blob/master/segmentation_models_pytorch/encoders/mix_transformer.py#L468
I couldn't spot anything in the paper or the original SegFormer implementation.
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days.
I've removed the restriction and the model appears to learn. Is there any argument for this restriction?
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days.
This issue was closed because it has been stalled for 7 days with no activity.