RiccardoRiglietti
RiccardoRiglietti
``` --n_samples N_SAMPLES how many samples to produce for each given prompt. A.k.a batch size ``` This means how many images will be generated. ` --n_iter N_ITER sample this often...
@zirui has done all the hard work of scraping commit messages and has put the finished dataset on HuggingFace at [https://huggingface.co/datasets/zirui3/TSSB-3M-ext](https://huggingface.co/datasets/zirui3/TSSB-3M-ext) So I think this issue is done thanks to...
@zirui @ontocord here is the pull asking to add the notebook: https://github.com/LAION-AI/Open-Assistant/pull/1425 It says something weird about pre-commit, but I cannot run pre-commit locally because of version incompatibility
@ontocord I managed to run precommit by installing it with conda rather than with snap
@zirui Thanks for your effort, if you are querying the GitHub API for commits, it might be worth it to also get more context for the code, i.e. more lines...
Automating this looks hard, as we need a way to pull code from questions, that contain both code and text, also I suggest using beginner questions that contain less and...
Now I have run the pre-commit script and the check passes.
@olliestanley Given that the commits are not in the dataset, but they must be taken from GitHub with the API or web scraping or cloning the repositories with their history,...
@olliestanley I cleaned up the notebook, removed the outdated cell, added the open in colab badge, added section titles and a short description at the start.
@olliestanley I put the link temporarily to the online version of the notebook, good night.