Blake

Results 22 issues of Blake

What is the recommended video length for the target video?

In the context of NLP models, I understand what top k top p, and temperature do with regard to sampling. Could someone explain what they mean(and cond_scale) with regards to...

I have been blessed to have been able to get an RTX 3090, and thus I can run this model with many layers and large batch size. I have tried...

**Describe the bug** When using GPTJ or GPT Neo 2.7B with DeepSpeed inference if you give it the short simple "DeepSpeed is" like the tutorial shows, and generate only 50...

bug
inference

**Describe the bug** My understanding of model parallelism is that the model is split over multiple GPUs to lower memory usage per GPU allowing larger models and speeding up inference....

bug
inference

**Describe the bug** I am working on a task where I finetune GPT models. I have tried GPT Neo and GPTJ models. If I try to add custom token to...

bug

Rather than giving the expected result of an array with the detected changepoints, the function is returning an empty array. Why is that?

**Describe the bug** Currently when using DeepSpeed Inference for GPTJ(but probably other models too) and when using contrastive search with Huggingface the results are very poor. Apparently, the results are...

bug
inference

GPT models without KV cache have to recalculate values and thus time to compute grows exponentially given a longer input. Thus, for your benchmarks, how many tokens were generated, and...

question