venzen comments

Results 26 comments of


                                            venzen

Persist bookmarks when deleting and re-opening buffer

Perhaps persistence across sessions can be implemented by using nvim.sqlite as is done in nvim-neoclip.lua https://github.com/AckslD/nvim-neoclip.lua

Custom words dictionary

@titipata Thank you for your response. I started implementing word similarity checking before I saw your reply. Can use `difftool.SequenceMatcher()` with a text file of Thai words and find correct...

Custom words dictionary

Unexpected behavior from `deepcut`: I am passing a custom dictionary that contains both the words 'หรือ' and 'อิริยาบถ'. Each word is a separate entry on its own line and without...

Custom words dictionary

The issue was that the custom dictionary contained duplicate words (words also present in the deepcut dictionary). When I made a new blank custom dictionary deepcut works as expected.

Training on "Shakespeare" dataset is faster by using MacBook Air (M2)

@nexthybrid you can pass --device="mps" on the command line or specify `device="mps"` in the _finetune_shakespeare.py_ script

how can I use this project to train a model for Chinese ?

@codetiger When you say "tune the encoder/decoder a little bit", can you give a brief explanation or example?

Encountered problems with the new dataset.

Your train.py script looks OK. What is the output of the prepare.py script with your new data? To run train.py, do you use cpu or GPU? I ask because you...

Finetuning did not seem to change the generated contents of GPT-2?

Not sure that this is going to solve your issue, but I'd recommend you try the process of finetuning again with the following practice: 1. Execute commands from the project...

running prepare.py on a very large dataset

Tokenizing as you propose - read a line, tokenize it. write it to file, repeat - could work. However, the final step of creating a _train.bin_ and _val.bin_ files of...

running prepare.py on a very large dataset

@aartivnkt unfortunately I cannot advise about GPU - don't have one and know naathing :) What I can say is that the prepare.py script does not use GPU, so you...