ast issues

Ask for help

1

Hello, I have learned from the example of extracting features from speech using the AST model. I mimicked this example to extract features from new speech using my own model,...

Ingram-lin

Inquiry Regarding Audio Spectrogram Transformer

2

I am a graduate student from China, and our team recently had the privilege of studying your article on the 'Audio Spectrogram Transformer'. We were truly impressed by the content...

Ingram-lin

question

self-contained Google Colab script error

2

moon-aver

bug

training MAP

2

Hi, @YuanGongND, I have trained ast with audioset for 36000 Iterations， the validate mAP is 0.029， is this right？Looking forward to your reply.

maxwZJU

reproduction

One question regarding the linear projection of AST.

1

Dear Minister Gong, I wanted to express my gratitude for your work on AST; it has truly been an inspiration to me. I can confidently say that AST has served...

poult-lab

question

AssertionError: choose a window size 400 that is [2, 1]

2

I try to use the feature extractor on my audiofiles. My audio files are all 16000Hz and 5 seconds long. The `waveform.shape[1]` is 80000 ```python input_values = feature_extractor(waveform, sampling_rate=16000, return_tensors="pt").input_values...

GrafKnusprig

bug

Discrepancy in Model Performance Using HuggingFace Pipeline Utility

5

Hi I'm attempting to reproduce the performance metrics of models using HuggingFace's Pipeline utility, but I'm encountering different results. Below is the Python code I used for testing: ```python import...

penguinwang96825

reproduction

some questions when reproducing your results

2

Hello Yuan,I'm delighted to read your paper and reproduce your work.And I encounter some problems. When empolying the audioset_pretrain,why does the stride as same as the patch_size(overlap == 0). In...

ben100118

reproduction

csv error

1

When running, the following error occurs and the number of training sessions is limited to 50. Is it necessary to save csv? I am wondering if there is a way...

mooncv

bug

ast
ast copied to clipboard

Metadata

Ask for help

Inquiry Regarding Audio Spectrogram Transformer

self-contained Google Colab script error

training MAP

One question regarding the linear projection of AST.

AssertionError: choose a window size 400 that is [2, 1]

Discrepancy in Model Performance Using HuggingFace Pipeline Utility

some questions when reproducing your results

csv error

← Metadata

Owner

Metadata

ast ast copied to clipboard

Metadata

← Metadata

Owner

Metadata

ast
ast copied to clipboard