Multimodal-Transformer
Multimodal-Transformer copied to clipboard
No data preprocessing
I noticed there are data stored in pkl files but there is no implementations of how these pkl files have been made after preprocessing. How can i reproduce the results using different datasets as there is no clear preprocessing steps?