as title,

the error is :
Traceback (most recent call last):
File "/ai/home/project/sd-scripts-main/train_network.py", line 538, in
train(args)
File "/ai/home/project/sd-scripts-main/train_network.py", line 408, in train
noise_pred = unet(noisy_latents, timesteps, encoder_hidden_states).sample
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1130, in _call_impl
return forward_call(*input, **kwargs)
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/accelerate/utils/operations.py", line 490, in call
return convert_to_fp32(self.model_forward(*args, **kwargs))
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/torch/amp/autocast_mode.py", line 12, in decorate_autocast
return func(*args, **kwargs)
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/diffusers/models/unet_2d_condition.py", line 375, in forward
sample = self.conv_in(sample)
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1130, in _call_impl
return forward_call(*input, **kwargs)
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/torch/nn/modules/conv.py", line 457, in forward
return self._conv_forward(input, self.weight, self.bias)
File "/ai/home/project/sd-scripts-main/venv/lib/python3.10/site-packages/torch/nn/modules/conv.py", line 453, in _conv_forward
return F.conv2d(input, weight, bias, self.stride,
RuntimeError: Given groups=1, weight of size [320, 9, 3, 3], expected input[2, 4, 64, 64] to have 9 channels, but got 4 channels instead
so how can i train that stabilityai/stable-diffusion-2-inpainting?