dingdongwang
dingdongwang
Hi, I have encountered the error when I run the [stage1_proj_cla.sh](https://github.com/YuanGongND/ltu/blob/main/src/ltu_as/train_scripts/stage1_proj_cla.sh), both the `base_model` and `data_path` are keep the same, and I also change the script to finetune_low_resource.py with smaller...
Hi, may I ask what the maximum allowable length is for audio input? Would a 1-minute WAV file be within the acceptable range? Thank you!
Hi, I have another question about the model related configuration settings during batch inference after model fine tuning. In the inference_batch.py script for LTU-AS provided below: ``` def main( load_8bit:...
Hi, I have a question about the LTU-AS FT. I saw the model used in [finetune.py](https://github.com/YuanGongND/ltu/blob/6869e4780d332b5758662091bad1c69daa572ca9/src/ltu_as/finetune.py) is only trained based on `LlamaForCausalLM`. However, since there has many classification downstream tasks...
Hi, I have a question about LTU-AS multi-GPU training, may I kindly ask if this repo support multiple GPU training? Since I didn't saw related configures (e.g. accelerate, deepspeed). Thank...
Hi, I have a question about the base model for ft and training stage 1. Since I saw the base model for FT is `ltuas_long_noqa_a6.bin`, which is only 187MB, and...
Hello, thank you for providing such a good idea of research on audio question answering. I have some questions about the LTU_AS: 1. For ASR task. During inference period(refer to...
Hello, I would like to ask about the following 2 questions: 1. If there if any shell scipt to run extract_whisper_feature.py? since I don't know what is the parameters of...
It seems missing the tokenize the audio (from 'input_ids') step both in finetune.py/finetune_low_resource.py of the LTU repo. Where is the detailed coding step for audio tokenization? I saw the 'load_audio()'...