Scribe-Data
Scribe-Data copied to clipboard
Expand Tajik data queries
Terms
- [X] I have searched open and closed feature requests
- [X] I agree to follow Scribe-Data's Code of Conduct
Description
This issue would look into expanding the src/scribe_data/language_data_extraction/Tajik files with as much data as are possible from the current data on Wikidata. We can use code for getting data from other languages, and from there we can check Tajik data on Wikidata for what conjugations are available. We can then expand the query with optional selections of certain forms as is done in other SPARQL queries. The query can be tried on the Wikidata Query Service UI during development :)
Data types to include:
- [x] Nouns
- [ ] Verbs
- [ ] Adjectives
- [ ] Adverbs
- [ ] Prepositions
- [ ] Emoji keywords
Contribution
Happy to review a PR when one's open and support with answers to any questions! 😊
Hello sir @andrewtavis, i would like to work on this issue please.
Thanks for your interest, @marcndo! Let us know if we can help :)
Sure, i'm currently exploiting resources to understand the technology. If need arise i would definitely reachout. Thanks for the willingness to help me.
Just added a list of data types that we want to include to this issue :) Have marked those that are already done or have PRs open, and we can work on the others 😊 If the data type can't work, then we can move to the others and open up specific issues later :)
hey @andrewtavis can I also work on this? @marcndo Can we work on this together? I'd like to help:)
Sounds good, @VNW22 :) @marcndo, what queries would you like to open a PR for here? @VNW22, as with the other one maybe start with prepositions and emoji keywords for now :)
Okay, working on it :)
@andrewtavis the preposition query is returning nothing, do i just continue to post as it is or what is it that I am missing?
I was looking into the Russian preposition query for insight. It seems like the method used a python script that reads the queried data from a file, maps prepositions to their grammatical cases and handles duplicates by concatenating cases. Do you think I should follow a similar approach for my queries?
Don't look at the Python formatting at this point, @VNW22 :) It's all going to change soon. If the query isn't returning anything, then at this point just send it in a PR and we can see. Maybe there's just no data right now.
Got it, thanks! 😊 I’ll go ahead and submit the PR.
@marcndo 👋 Let us know if there's anything we can do to support you with the remainder of the queries :) If you'd prefer @VNW22 to finish them, then that's also ok 😊
This is all closed up :) Thanks all for the hard work here!