🚀 What is the purpose of this issue?

According to the documentation, these are the parameters available for Conv2d.

in_channels (int) – The number of input channels.
out_channels (int) – The number of output channels.
kernel_size (int or tuple) – The size of the convolution filters.
stride (int or tuple, optional) – The size of the stride when applying the filter. Default: 0.
padding (int or tuple, optional) – How many positions to 0-pad the input with. Default: 0.
bias (bool, optional) – If True add a learnable bias to the output. Default: True

However, the dilation and groups parameter as defined in the torch implementation of Conv2d, would be desirable to implement well-known architectures, e.g. ASPP. The following is the description of these two parameters according to torch:

dilation controls the spacing between the kernel points; also known as the à trous algorithm. It is harder to describe, but this link has a nice visualization of what dilation does.
groups control the connections between inputs and outputs. in_channels and out_channels must both be divisible by groups. For example, at groups=1, all inputs are convolved to all outputs. At groups=2, the operation becomes equivalent to having two conv layers side by side, each seeing half the input channels and producing half the output channels, and both subsequently concatenated.

Thank you for your attention 👍

Dec 09 '23 14:12 Aavache

@jagrit06 in conv2d op call, there is an assert to check if dilations is set to 1, but from a quick look through the implementation it seems to support dilations. I removed this check and messed with it a little comparing to pytorch and am seeing the same results with different values for dilations. Is there a specific case where dilations isn't supported?

Jan 24 '24 01:01 dc-dc-dc

Well at the moment the convolutions with dilations revert to a fairly naive and slow kernel on the GPU (though I think we do fine on the CPU)

That restriction was added to basically side step that performance hit

Jan 24 '24 02:01 jagrit06

Should the assert be removed and create a separate issue to improve Convolution speed on GPU when dilation is not 1? Creates more visibility to the core issue and opens the possibility of a contributor able to help out

Feb 01 '24 03:02 dc-dc-dc

agree with @dc-dc-dc, this would be really helpful for implementations that require dilation

Feb 01 '24 21:02 a1eaiactaest

FWIW this PR now also depends on a groups param present.

Feb 03 '24 15:02 jimexist

Added dilation in #766

Mar 05 '24 16:03 a1eaiactaest

Is anyone working on groups for convolution? E.g. @Stealeristaken ?

I am closing https://github.com/ml-explore/mlx/pull/637 as it's inactive but it might be useful as a starting point.

Mar 19 '24 22:03 awni

Is anyone working on groups for convolution? E.g. @Stealeristaken ?

I am closing #637 as it's inactive but it might be useful as a starting point.

Been a bit busy these days. I tried adding groups to convs, but it didn't work very well. So, I closed that pull request. I'll try again in the next few days.

Mar 20 '24 07:03 Stealeristaken

I'm eager to get this working as well. I can pick this up if no one is working on it.

Mar 28 '24 13:03 Rifur13

I don't think anyone is actively working on it, would be great to have you work on it @Rifur13 !

Mar 28 '24 13:03 awni

I'm eager to get this working as well. I can pick this up if no one is working on it.

feel free to take. gl!

Mar 28 '24 16:03 Stealeristaken

mlx
mlx copied to clipboard

[Feature Request] Groups added to Conv2d

🚀 What is the purpose of this issue?

mlx mlx copied to clipboard

[Feature Request] Groups added to Conv2d

🚀 What is the purpose of this issue?

mlx
mlx copied to clipboard