andreaKim issues

Results 9 issues of


                                            andreaKim

I cannot reproduce the finetuning procedure by following README

Hello, I am trying to test my data in Control Prefix manner. While I am testing, I always encounter error like below. ``` Validation sanity check: 0it [00:00, ?it/s] Validation...

why "target_modules" does not recognize any parameters?

``` model = AutoGPTQForCausalLM.from_quantized( model_name, #use_triton=True, #warmup_triton=False, trainable=True, inject_fused_attention=False, inject_fused_mlp=False, #**kwargs ) #model.warmup_triton() print(model) print("model is loaded.") model.resize_token_embeddings(32008) peft_config = GPTQLoraConfig( lora_alpha=lora_alpha, lora_dropout=lora_dropout, r=lora_r, bias="none", task_type="CAUSAL_LM", target_modules = [ "q_proj",...

Is there available SFT fine tuning for zephyr-7B families?

![image](https://github.com/huggingface/alignment-handbook/assets/40377750/d664a446-88cc-408a-b8f0-73595902a7d9) ![image](https://github.com/huggingface/alignment-handbook/assets/40377750/705aa2a0-d489-4fbd-9923-d2a8ee31bd3d) Hello, I am so impressed by your models. I tried fine tuning your models with my data and the evaulation_loss is not optimized as shown in the image...

Failed to install Fused_adam op on CPU

Hello, I am struggling to download fused_adam pre build of deepspeed. I found nothing that solve my problem. Here are the situations. ``` DS_BUILD_FUSED_ADAM=1 pip install deepspeed ds_report ``` still...

build

"from_pretrained" read wrong config file. not "tokenizer_config.json", but "config.json"

Hi, I found interesting bug(maybe I could be wrong) that is in from_pretrained. below are the code that i produce my bug. ``` model = T5ForConditionalGeneration.from_pretrained( model, local_files_only=True ) model.to(device)...

Autotokenizer."from_pretrained" read wrong config file. not "tokenizer_config.json", but "config.json"

### System Info - `transformers` version: 4.40.0 - Platform: Linux-4.18.0-425.3.1.el8.x86_64-x86_64-with-glibc2.28 - Python version: 3.9.0 - Huggingface_hub version: 0.20.1 - Safetensors version: 0.4.1 - Accelerate version: 0.30.1 - Accelerate config: not...

AttributeError: 'RobertaTokenizerFast' object has no attribute 'max_len'

https://github.com/huggingface/transformers/issues/8739#issuecomment-732412022 Can you update the name of arguments according to newer transformers library? Thank you for reading this issue :)

add write utils

When I use deepeval with massive data without web application, It is difficult to track every results from the printed logs. To address this, I hope to add file writing...

add dataset-wikiofgraph

I add my resource paper about knowledgegraph-text pair ```Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model``` (https://arxiv.org/abs/2409.07088) is introducing the largest, high quality knowledge graph-to-text pair dataset...