Open-Assistant
Open-Assistant copied to clipboard
Add tlcv2.0_oa
tlcv2.0_oa is a dataset that made for Open Assistant Text-only format. It was build from Thai Literature Corpora (TLC).
Thai Literature Corpora (TLC) is Corpora of machine-ingestible Thai classical literature texts like gutenberg. https://attapol.github.io/tlc.html
Hugging Face Datasets: pythainlp/tlcv2.0_oa
Can you strip the outputs from the notebook please? It's a bit large! Also rebase + squash commits if possible.
Can you strip the outputs from the notebook please? It's a bit large! Also rebase + squash commits if possible.
OK. I was strip the outputs from the notebook.
Rebased on main, fixed merge conflicts and kept contributor commits rather than squash :)
@bitplane I think in general we decided to squash all PRs, to keep history clean. If you use "Squash and merge" in GitHub it will list the author of the commit as the person who made the PR, and anyone else who committed will also receive credit as a co-author :)
Sorry, I merged this before I saw the discussion on Discord. I'll squash in future :)