Dan Ofer comments

Results 62 comments of


                                            Dan Ofer

Issue while generating pre training data

Would it be possible to provide an example of how to run the vocab/tokenizing in advance on this data, including the expected output sentencepiece vocab?

trouble getting started

The problem is that no features are extracted. (Not sure why). Have you tried extracting features using the "file" vs "dir" option? I'll be uploading an update In the next...

trouble getting started

What OS are you using? Try using the absolute file path. The update has been implemented.

trouble getting started

Also - the Tail command outputs lines ; It could have messed up the fasta formated files. https://en.wikipedia.org/wiki/Tail_(Unix) On Wed, Sep 9, 2015 at 4:08 PM, cnjr2 [email protected] wrote: >...

trouble getting started

There might be an issue with the "dir" option. (We didn't use it while writing the articles). The program is complaining that it's not getting labels/class for the sequences. Possibly...

With --classType file - your "classes" should be in 2 seperate multifasta files (each containing all the sequences belonging to a class. [without "overlapping"/duplicates]. e.g. (In case you have 2...

trouble getting started

Hi, I'm afraid that I'll be unable to debug the the issue, as I'll be unavailable for the next month. I suggest forking from the earliest commit in the meantime....

trouble getting started

Worst case, just use the features generation methods/featureGen.py Good luck On Oct 1, 2015 7:27 PM, "cnjr2" [email protected] wrote: > Thanks for the info. I will give it a shot!...

removed click, added parameter check

Yay. Thank you very much :-) בעתיד, תהיה אתה מעוניין\יכול עדיין להמשיך להשתתף עוד קצת בקשר לזה? (על דברים פשוטים יחסית. תיעוד )/readme), בדיקה ל exceptions וכו). אני לא ליד...

removed click, added parameter check

Ok. On May 27, 2015 5:52 PM, "Michael Doron" [email protected] wrote: > אין את השמירה של המודל או הנרמול, > ואני חושב שזה יהיה עמוס לי מדי, עם המחקר אצל...