Scribe-Data icon indicating copy to clipboard operation
Scribe-Data copied to clipboard

Expand Tajik data queries

Open andrewtavis opened this issue 1 year ago • 4 comments

Terms

Description

This issue would look into expanding the src/scribe_data/language_data_extraction/Tajik files with as much data as are possible from the current data on Wikidata. We can use code for getting data from other languages, and from there we can check Tajik data on Wikidata for what conjugations are available. We can then expand the query with optional selections of certain forms as is done in other SPARQL queries. The query can be tried on the Wikidata Query Service UI during development :)

Data types to include:

  • [x] Nouns
  • [ ] Verbs
  • [ ] Adjectives
  • [ ] Adverbs
  • [ ] Prepositions
  • [ ] Emoji keywords

Contribution

Happy to review a PR when one's open and support with answers to any questions! 😊

andrewtavis avatar Oct 03 '24 22:10 andrewtavis

Hello sir @andrewtavis, i would like to work on this issue please.

marcndo avatar Oct 04 '24 14:10 marcndo

Thanks for your interest, @marcndo! Let us know if we can help :)

andrewtavis avatar Oct 04 '24 17:10 andrewtavis

Sure, i'm currently exploiting resources to understand the technology. If need arise i would definitely reachout. Thanks for the willingness to help me.

marcndo avatar Oct 04 '24 18:10 marcndo

Just added a list of data types that we want to include to this issue :) Have marked those that are already done or have PRs open, and we can work on the others 😊 If the data type can't work, then we can move to the others and open up specific issues later :)

andrewtavis avatar Oct 09 '24 08:10 andrewtavis

hey @andrewtavis can I also work on this? @marcndo Can we work on this together? I'd like to help:)

VNW22 avatar Oct 14 '24 11:10 VNW22

Sounds good, @VNW22 :) @marcndo, what queries would you like to open a PR for here? @VNW22, as with the other one maybe start with prepositions and emoji keywords for now :)

andrewtavis avatar Oct 14 '24 11:10 andrewtavis

Okay, working on it :)

VNW22 avatar Oct 14 '24 11:10 VNW22

@andrewtavis the preposition query is returning nothing, do i just continue to post as it is or what is it that I am missing?

VNW22 avatar Oct 14 '24 14:10 VNW22

I was looking into the Russian preposition query for insight. It seems like the method used a python script that reads the queried data from a file, maps prepositions to their grammatical cases and handles duplicates by concatenating cases. Do you think I should follow a similar approach for my queries?

VNW22 avatar Oct 14 '24 15:10 VNW22

Don't look at the Python formatting at this point, @VNW22 :) It's all going to change soon. If the query isn't returning anything, then at this point just send it in a PR and we can see. Maybe there's just no data right now.

andrewtavis avatar Oct 14 '24 16:10 andrewtavis

Got it, thanks! 😊 I’ll go ahead and submit the PR.

VNW22 avatar Oct 14 '24 16:10 VNW22

@marcndo 👋 Let us know if there's anything we can do to support you with the remainder of the queries :) If you'd prefer @VNW22 to finish them, then that's also ok 😊

andrewtavis avatar Oct 14 '24 17:10 andrewtavis

This is all closed up :) Thanks all for the hard work here!

andrewtavis avatar Oct 22 '24 23:10 andrewtavis