cphoward

Results 1 comments of cphoward

After playing around with models, I've found something like ``` python def prepare_preprompt_kv_cache(preprompt): inputs = tokenizer(preprompt, return_tensors="np", add_special_tokens=False) model_inputs = { "input_ids": inputs["input_ids"], "attention_mask": inputs["attention_mask"] } # Generate position ids...