defensetongxue
defensetongxue
I think in this line of code, x is processed by mutil-attention. Each att(x,adj) calculate one head of attention score. And these heads are concated and sent to another layer...
May be it can refer to the `torchvision.transforms.Grayscale(num_output_channels=1)` ```python def to_gray(img, num_output_channels=3): gray = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY) if num_output_channels==3: return cv2.cvtColor(gray, cv2.COLOR_GRAY2RGB) elif num_output_channel==1: return gray else: raise ImplementError# rasie an...