gen-vlkt
gen-vlkt copied to clipboard
zero-shot setting
Hi,
In hico-det, 117 actions and 80 categories make up 600 interactions. In zero-shot setup (UC), a part of the interactions are invisible. Are invalid interactions excluded from the prediction? (e.g. man eating car)
Also, how is the data preprocessed in the zero-shot setup, and the inference process?