HoP
HoP copied to clipboard
Why does the channel reduction can prune the height info ?
The paper claims that "Accordingly, we first employ a channel reduction operation to the input set Brem to prune the height information and achieve better training efficiency". But why channel reduction can represent height information ? And how do you do channel reduction ?