ltu issues

Question about model loading in inference

2

Hi, I have another question about the model related configuration settings during batch inference after model fine tuning. In the inference_batch.py script for LTU-AS provided below: ``` def main( load_8bit:...

dingdongwang

question

Issue while loading openaqa_5.6M.json

4

Hello, Thank you so much for sharing the code. Great work on the repo!! I am trying to run the code for LTU openaqa, I've completed the first 3 stage...

sonalkum

bug

Question about LTU-AS Downstream Tasks

3

Hi, I have a question about the LTU-AS FT. I saw the model used in [finetune.py](https://github.com/YuanGongND/ltu/blob/6869e4780d332b5758662091bad1c69daa572ca9/src/ltu_as/finetune.py) is only trained based on `LlamaForCausalLM`. However, since there has many classification downstream tasks...

dingdongwang

question

Question about Multi-GPU Training

1

Hi, I have a question about LTU-AS multi-GPU training, may I kindly ask if this repo support multiple GPU training? Since I didn't saw related configures (e.g. accelerate, deepspeed). Thank...

dingdongwang

question

Question about LTU-AS base model

1

Hi, I have a question about the base model for ft and training stage 1. Since I saw the base model for FT is `ltuas_long_noqa_a6.bin`, which is only 187MB, and...

dingdongwang

question

LTU_AS ASR Task

4

Hello, thank you for providing such a good idea of research on audio question answering. I have some questions about the LTU_AS: 1. For ASR task. During inference period(refer to...

dingdongwang

question

OpenAQA Dataset Access

2

In the LTU paper you say you will distribute the dataset after the peer review process. I noticed that you have been accepted to ASRU 2023 for your LTU-AS paper...

MBAnslow

enhancement

extract_whisper_feature.py

3

Hello, I would like to ask about the following 2 questions: 1. If there if any shell scipt to run extract_whisper_feature.py? since I don't know what is the parameters of...

dingdongwang

question

Missing Tokenize Audio Info during Fine-tuning/Training

1

It seems missing the tokenize the audio (from 'input_ids') step both in finetune.py/finetune_low_resource.py of the LTU repo. Where is the detailed coding step for audio tokenization? I saw the 'load_audio()'...

dingdongwang

question

How to process audio that exceeds 10 seconds in length

5

Hello, I would like to ask, how do you test the audio in the LibriSpeech dataset that exceeds 10 seconds in duration?I'm encountering an issue while using the LibriSpeech dataset...

qisawO3

question

ltu
ltu copied to clipboard

Metadata

Question about model loading in inference

Issue while loading openaqa_5.6M.json

Question about LTU-AS Downstream Tasks

Question about Multi-GPU Training

Question about LTU-AS base model

LTU_AS ASR Task

OpenAQA Dataset Access

extract_whisper_feature.py

Missing Tokenize Audio Info during Fine-tuning/Training

How to process audio that exceeds 10 seconds in length

← Metadata

Owner

Metadata

ltu ltu copied to clipboard

Metadata

← Metadata

Owner

Metadata

ltu
ltu copied to clipboard