media-annotator
media-annotator copied to clipboard

Published 20 hours ago •

→

Metadata

Web-based annotation tool for media data. The easiest way to create you own media dataset.

Readme
Issues

Media-annotator

Web-based annotation tool for media data.

Features

Uploading selected audio-files from directory. Currently, only .wav .mp3 files are supported.
Manual and auto transcribing for 20+ languages and dialects via Vosk

ℹ️ Auto annotation will use the first channel of .mp3.
Export of the speech regions as zip of wavs and regions.csv or regions.json

How to use

Run app as docker image

docker-compose up -d
Open app page

How to debug

Install conda/miniconda/micromamba and node+npm

Clone repository, create python environment using conda manager and activate it

git clone https://github.com/ruslantau/media-annotator
cd annotator
conda env create -f backend/environment.yaml
conda activate annotator

Run FastAPI backend
```
python backend/main.py
```

Install dependencies and run Nuxt frontend

cd frontend
npm install
npm run build 
npm run start

Open app page

TODO

[x] add docker images and setup CI/CD
[ ] extend the list of supported formats (mp4,flac,avi,etc.)
[ ] running auto annotation on selected region
[ ] adding punctuation
[ ] speaker diarisation

About

Web-based annotation tool for media data. The easiest way to create you own media dataset.

audio

asr

annotation

audio-processing

auto-annotation

media-annotation

15

Stars

0

Forks

Watchers

Owner

← Metadata

15

Stars

0

Forks

Watchers

Owner

Metadata

Web-based annotation tool for media data. The easiest way to create you own media dataset.