Scribe-Data icon indicating copy to clipboard operation
Scribe-Data copied to clipboard

Create Punjabi language queries

Open SethiShreya opened this issue 1 year ago • 9 comments

Terms

Description

This issue would look into expanding the src/scribe_data/language_data_extraction/Punjabi files with as much data as are possible from the current data on Wikidata. We can use code for getting data from other languages, and from there we can check Punjabi data on Wikidata for what conjugations are available. We can then expand the query with optional selections of certain forms as is done in other SPARQL queries. The query can be tried on the Wikidata Query Service UI during development :)

Data types to include:

  • [x] Nouns
  • [x] Verbs
  • [ ] Adjectives
  • [ ] Adverbs
  • [ ] Prepositions
  • [ ] Emoji keywords

Contribution

I will start with creating nouns, gender and verb queries for Punjabi and then will take it further to add conjugations and other stuffs.

SethiShreya avatar Oct 07 '24 18:10 SethiShreya

Thanks for the issue, @SethiShreya! :)

andrewtavis avatar Oct 07 '24 19:10 andrewtavis

Added Sparql query for Punjabi language nouns, so when I was searching I found while extracting the nouns for punjabi language, it was extracting for Gurmukhi (Punjabi language for India) and Shahmukhi (Punjabi language for Muslims), So I added filter only to get Gurmukhi with tag ("pa"). The query also included the Gender, plural and singular nouns for Gurmukhi. I have tested it on website and in the code too.

@andrewtavis Please review and give your feedbacks

SethiShreya avatar Oct 08 '24 15:10 SethiShreya

Hey @SethiShreya 👋 Interesting to learn all of this. Recently we've been discussing moving some languages to a common sub directory so it's a bit more organized, so would you say that we could have a Punjabi directory and have Gurmukhi and Shahmukhi sub directories? The same can be seen now in Norwegian :) Let me know. I'll review the current PR as is :)

andrewtavis avatar Oct 08 '24 23:10 andrewtavis

Hey @SethiShreya 👋 Interesting to learn all of this. Recently we've been discussing moving some languages to a common sub directory so it's a bit more organized, so would you say that we could have a Punjabi directory and have Gurmukhi and Shahmukhi sub directories? The same can be seen now in Norwegian :) Let me know. I'll review the current PR as is :)

That would be good, so directory structure will be

Punjabi -----Gurmukhi -----Shahmukhi

Right? Then i guess we would need to change code too to run these sub directory queries too i guess

SethiShreya avatar Oct 09 '24 04:10 SethiShreya

Yes, exactly. We'll do this maybe in a week as the new directories can't be run anyway :) Feel free to change the structure in your next PR!

andrewtavis avatar Oct 09 '24 07:10 andrewtavis

Just added a list of data types that we want to include to this issue :) We can maybe do all of this for Gurmukhi and Shahmukhi, @SethiShreya 😊

andrewtavis avatar Oct 09 '24 08:10 andrewtavis

Thanks @andrewtavis, sure i will work on those data types and on the directory structure too.

SethiShreya avatar Oct 09 '24 08:10 SethiShreya

Sounds great, @SethiShreya! :)

andrewtavis avatar Oct 09 '24 09:10 andrewtavis

@andrewtavis i have bifurcated directory to Gurmukhi and Shahmukhi added verb query and fixed directory structure to call these too. Raised a PR here, please check once. Checked it on my end.

please review and give your feedback, Thanks :)

SethiShreya avatar Oct 12 '24 11:10 SethiShreya

Sent along the base versions of the rest of the queries in f14a335, @SethiShreya :) Let me know if you'd like to expand them with their forms :)

andrewtavis avatar Oct 24 '24 12:10 andrewtavis

Sure :)

SethiShreya avatar Oct 29 '24 13:10 SethiShreya

Hi, anyone working on this?

GicharuElvis avatar Feb 09 '25 17:02 GicharuElvis

Hey @GicharuElvis 👋 Appreciate your interest in the project :) We'll be using new functionality to auto-generate these queries soon, so it will be closed in the coming days. Please let us know if there's anything else you find interesting to work on!

andrewtavis avatar Feb 09 '25 17:02 andrewtavis

sure, if its possible i'd also like to help in the automation.

GicharuElvis avatar Feb 20 '25 10:02 GicharuElvis

Would be great, @GicharuElvis! Automation will likely be happening over at Scribe-Server, which will be running Scribe-Data :) Do you have skills/interest in Go? We could maybe find an issue over there :)

andrewtavis avatar Feb 20 '25 17:02 andrewtavis

Closing this as work here will continue in #513 where we'll be autogenerating the queries :) Thanks all for the work here, and hope all are well! 😊

andrewtavis avatar Mar 09 '25 16:03 andrewtavis