DL-Simplified icon indicating copy to clipboard operation
DL-Simplified copied to clipboard

Hindi Voice Assistant

Open Amarta113 opened this issue 1 year ago • 11 comments
trafficstars

Deep Learning Simplified Repository (Proposing new issue)

:red_circle: Project Title : Hindi Voice Assistant :red_circle: Aim : To develop a chatbot that takes input in hindi audio and give output in hindi text and audio :red_circle: Dataset : :red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


📍 Follow the Guidelines to Contribute in the Project :

  • You need to create a separate folder named as the Project Title.
  • Inside that folder, there will be four main components.
    • Images - To store the required images.
    • Dataset - To store the dataset or, information/source about the dataset.
    • Model - To store the machine learning model you've created using the dataset.
    • requirements.txt - This file will contain the required packages/libraries to run the project in other machines.
  • Inside the Model folder, the README.md file must be filled up properly, with proper visualizations and conclusions.

:red_circle::yellow_circle: Points to Note :

  • The issues will be assigned on a first come first serve basis, 1 Issue == 1 PR.
  • "Issue Title" and "PR Title should be the same. Include issue number along with it.
  • Follow Contributing Guidelines & Code of Conduct before start Contributing.

:white_check_mark: To be Mentioned while taking the issue :

  • Full name : Amarta Waghani
  • GitHub Profile Link : https://github.com/Amarta113
  • Email ID :
  • Participant ID (if applicable):
  • Approach for this Project : Speech to text - Search - Text to speech
  • What is your participant role? GSSOC Contributor

Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

Amarta113 avatar Aug 05 '24 13:08 Amarta113

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

github-actions[bot] avatar Aug 05 '24 13:08 github-actions[bot]

Are you using any dataset for this project? If yes, please share it. Also what are the deep learning models you are planning to implement here for this problem statement?

abhisheks008 avatar Aug 06 '24 08:08 abhisheks008

@abhisheks008 I had not used any dataset. I used "whisper-large-v3" model for speech to text and "llama3-8b-8192" model to answer queries with groq API and gtts library for text to speech.

Amarta113 avatar Aug 06 '24 13:08 Amarta113

Any other models which will compete the whisper architecture?

abhisheks008 avatar Aug 07 '24 06:08 abhisheks008

I used API therefore only whisper was available. Here is the documentation link

Amarta113 avatar Aug 07 '24 16:08 Amarta113

It is streamlit application

Amarta113 avatar Aug 07 '24 16:08 Amarta113

Only implementing whisper will not qualify for this repository. You need to come up with at least 3 models for any problem statement. @Amarta113

abhisheks008 avatar Aug 09 '24 06:08 abhisheks008

@abhisheks008 I also used llama3 for q&a and for text to speech I used google tts library which contains two algorithms Tacotron and WaveNet .

Amarta113 avatar Aug 09 '24 12:08 Amarta113

@abhisheks008 I also used llama3 for q&a and for text to speech I used google tts library which contains two algorithms Tacotron and WaveNet .

Cool I am assigning this issue to you. Make sure you complete it by today 6 PM IST.

abhisheks008 avatar Aug 10 '24 03:08 abhisheks008

Can you please assign this issue to me under 𝗚𝗦𝗦𝗼𝗖 '𝟮𝟰 𝗘𝘅𝘁𝗲𝗻𝗱𝗲𝗱, Hacktoberfest-accepted

Abhiiesante avatar Oct 23 '24 04:10 Abhiiesante

Can you please assign this issue to me under 𝗚𝗦𝗦𝗼𝗖 '𝟮𝟰 𝗘𝘅𝘁𝗲𝗻𝗱𝗲𝗱, Hacktoberfest-accepted

Can you please share your approach and thoughts for solving this issue?

abhisheks008 avatar Oct 24 '24 11:10 abhisheks008