andreaKim
andreaKim
Hello, I am trying to test my data in Control Prefix manner. While I am testing, I always encounter error like below. ``` Validation sanity check: 0it [00:00, ?it/s] Validation...
``` model = AutoGPTQForCausalLM.from_quantized( model_name, #use_triton=True, #warmup_triton=False, trainable=True, inject_fused_attention=False, inject_fused_mlp=False, #**kwargs ) #model.warmup_triton() print(model) print("model is loaded.") model.resize_token_embeddings(32008) peft_config = GPTQLoraConfig( lora_alpha=lora_alpha, lora_dropout=lora_dropout, r=lora_r, bias="none", task_type="CAUSAL_LM", target_modules = [ "q_proj",...
  Hello, I am so impressed by your models. I tried fine tuning your models with my data and the evaulation_loss is not optimized as shown in the image...
Hello, I am struggling to download fused_adam pre build of deepspeed. I found nothing that solve my problem. Here are the situations. ``` DS_BUILD_FUSED_ADAM=1 pip install deepspeed ds_report ``` still...
Hi, I found interesting bug(maybe I could be wrong) that is in from_pretrained. below are the code that i produce my bug. ``` model = T5ForConditionalGeneration.from_pretrained( model, local_files_only=True ) model.to(device)...
### System Info - `transformers` version: 4.40.0 - Platform: Linux-4.18.0-425.3.1.el8.x86_64-x86_64-with-glibc2.28 - Python version: 3.9.0 - Huggingface_hub version: 0.20.1 - Safetensors version: 0.4.1 - Accelerate version: 0.30.1 - Accelerate config: not...
https://github.com/huggingface/transformers/issues/8739#issuecomment-732412022 Can you update the name of arguments according to newer transformers library? Thank you for reading this issue :)
When I use deepeval with massive data without web application, It is difficult to track every results from the printed logs. To address this, I hope to add file writing...
I add my resource paper about knowledgegraph-text pair ```Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model``` (https://arxiv.org/abs/2409.07088) is introducing the largest, high quality knowledge graph-to-text pair dataset...