Chansung Park comments

Results 145 comments of


                                            Chansung Park

13B Int8 huggingface space

sorry about this guys. I was granted free GPU for 7B and 13B by HuggingFace team, but that was withdrawn today

13B Int8 huggingface space

Looks like it is not allowed to deploy LLaMA in any kind of form https://huggingface.co/spaces/chansung/LLaMA-7B/discussions/5

13B Int8 huggingface space

Yeah But I didnt expose the hard link of the weights. So it looks like the weights should be used soley for personal purpose

simple playground share

any tips to speed up the inference speed?

simple playground share

it runs the model shared by @tloen on 7 core CPU / 32GB with a single RTX5000. I am hosting it in jarvislabs.ai

@benob You could do something like below ```python def evaluate(instructions, input=None): prompts = [generate_prompt(instructions) for instruction in instructions] encodings = tokenizer(prompts, return_tensors="pt", padding=True).to('cuda') # input_ids = inputs["input_ids"].cuda() generation_outputs = model.generate(...

Chansung Park

13B Int8 huggingface space

13B Int8 huggingface space

13B Int8 huggingface space

simple playground share

simple playground share

anyone tried batch inference?

anyone tried batch inference?

anyone tried batch inference?

sharing experimental results like a chatbot

sharing experimental results like a chatbot