gyulaaa issues

Results 10 issues of


                                            gyulaaa

what is the difference between the Instructed Zero-shot Image-to-Text Generation and Visual Question Answering about BLIP2?

In my understanding, VQA is similar with the ability of zero-shot image-to-text generation mentioned in the BLIP2 paper. They all give the answer about prompt(question / natural language instructions) conditioned...

RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED

I got the "RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling `cublasSgemmStridedBatched" error when running the infer.py. Is there anyone else who's had this problem. Thanks

数据集 gt是什么意思

在PairedImageDataset的docs中写了： Paired image dataset for image restoration. Read LQ (Low Quality, e.g. LR (Low Resolution), blurry, noisy, etc) and GT image pairs. 但是后面的GT没有解释，方便解释一下吗？谢谢

speed issue

when I run the command 'python main.py --config config/base.yaml --experiment experiment_5x1 --signature smile --target figures/smile.png --log_dir log/', It takes about 15 minutes to generate the result svg. It need more...

[Feature Request]: add more operators in where filter by metadata

### Describe the problem I create a collection with every document composed by page_content and a metadata 'style'. the text of style is some style types and joined by comma,...

enhancement

[Feature Request]: is there a plan to support batch processing in OpenCLIPEmbeddingFunction?

### Describe the problem I want to extract embeddings for one hundred thousand images by OpenCLIPEmbeddingFunction. But I found the images can only be encoded one by one because the...

enhancement

what's the function of the learnable positional_embedding in the class DiffusionSceneLayout_DDPM?

when reading training code, I found a learnable positional_embedding which is passed to the first block of Unet1D's downs, mid_blocks, ups. such as the code of Unet1D's downs: for block0,...

the load_checkpoints func is confused

initially, the load_checkpoints is imported at the line 20 of generate_objautoencoder by `from utils import yield_forever, load_checkpoints, save_checkpoints` but there is no load_checkpoints. After I change the utils to training_utils,...

how long to train a model for a room type ,like livingrooms_uncond.pt?

how long to train a model for a room type ,like livingrooms_uncond.pt? Thanks for any help

why did the checkpoint is trained for each room type?

I‘d like to know the reason why why did you train a checkpoint for each room type? Is this the best approach after some trial and error？ Another way, room...