Kolesh jr comments

Results 9 comments of


                                            Kolesh jr

"format": "json" in api request causes hang due to repeated tokens

Is the format : json there on default?? Because using langchain and chatollama also hangs even without the format : json. Option ?

deeplake.util.exceptions.TransformError

I am also getting the same error, did you get a fix?

Ollama stops generating output and fails to run models after a few minutes

I am having the same issue even on the new version: 0.1.28 . This happens after 200 iterations on a custom finetuned 4 bit mistral on collabs free tier t4

Google Colab breaks

Hey @danielhanchen I am facing this issue during inference: NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs: query : shape=(1, 2327, 8, 4, 128) (torch.float16) key : shape=(1, 2327, 8,...

Google Colab breaks

Yes I did , it's failing for free tier t4 when you call model.generate but for v100 it's passing.

Google Colab breaks

@danielhanchen These are the new imports that you have suggested in this thread import torch major_version, minor_version = torch.cuda.get_device_capability() !pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git" if major_version >= 8: !pip install...

Kolesh jr

"format": "json" in api request causes hang due to repeated tokens

deeplake.util.exceptions.TransformError

Ollama stops generating output and fails to run models after a few minutes

Google Colab breaks

Google Colab breaks

Google Colab breaks

Google Colab breaks

Ollama instance stuck and hanging after few hours.

Ollama stuck after few runs