wav2graph
wav2graph copied to clipboard
Information Retrieval from Audio via Knowledge Graph
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech
Please press ⭐ button and/or cite papers if you feel helpful.

This repository contains the codebase for the wav2graph paper:
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech
https://www.arxiv.org/abs/2408.04174
Project Overview
In the wav2graph paper, we introduce the first framework for supervised learning knowledge graph from speech data. This repository provides the necessary scripts, configurations, and setup instructions to reproduce the experiments discussed in the paper.
Setup
To set up the environment and run the experiments, follow the steps below:
1. Create a Virtual Environment
Before you start, create a Python virtual environment and install the required dependencies.
pip install -r requirements.txt
2. Configure Hugging Face Token
You will need a Hugging Face API token to access certain resources used in this project. Insert your Hugging Face token into the relevant YAML configuration files.
3. Run the Experiments
Once the environment is set up and the configurations are complete, you can run the experiments using the provided script.
sh run.sh
Cite our work
@misc{leduc2024wav2graphframeworksupervisedlearning,
title={wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech},
author={Khai Le-Duc and Quy-Anh Dang and Tan-Hanh Pham and Truong-Son Hy},
year={2024},
eprint={2408.04174},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2408.04174},
}
Contact
Core developers:
Khai Le-Duc
University of Toronto, Canada
Email: [email protected]
GitHub: https://github.com/leduckhai
Quy-Anh Dang
VNU University of Science, Vietnam
GitHub: https://github.com/QuyAnh2005
Facebook: https://www.facebook.com/anh.q.dang.5