gyulaaa
gyulaaa
In my understanding, VQA is similar with the ability of zero-shot image-to-text generation mentioned in the BLIP2 paper. They all give the answer about prompt(question / natural language instructions) conditioned...
I got the "RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling `cublasSgemmStridedBatched" error when running the infer.py. Is there anyone else who's had this problem. Thanks
在PairedImageDataset的docs中写了: Paired image dataset for image restoration. Read LQ (Low Quality, e.g. LR (Low Resolution), blurry, noisy, etc) and GT image pairs. 但是后面的GT没有解释,方便解释一下吗?谢谢
when I run the command 'python main.py --config config/base.yaml --experiment experiment_5x1 --signature smile --target figures/smile.png --log_dir log/', It takes about 15 minutes to generate the result svg. It need more...
### Describe the problem I create a collection with every document composed by page_content and a metadata 'style'. the text of style is some style types and joined by comma,...
### Describe the problem I want to extract embeddings for one hundred thousand images by OpenCLIPEmbeddingFunction. But I found the images can only be encoded one by one because the...
when reading training code, I found a learnable positional_embedding which is passed to the first block of Unet1D's downs, mid_blocks, ups. such as the code of Unet1D's downs: for block0,...
initially, the load_checkpoints is imported at the line 20 of generate_objautoencoder by `from utils import yield_forever, load_checkpoints, save_checkpoints` but there is no load_checkpoints. After I change the utils to training_utils,...
how long to train a model for a room type ,like livingrooms_uncond.pt? Thanks for any help
I‘d like to know the reason why why did you train a checkpoint for each room type? Is this the best approach after some trial and error? Another way, room...