HyperHuman
HyperHuman copied to clipboard
There is something confusing in section 3.3 about the network archetectrue?
as you say, you use different conv to convert different controls to same input size 128*128 and elementsize add them all to an encoder of SDXL, so is it the same archetecture with ControlNet but only changes the base model and increase some control types? Why you name it refiner?