Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Add tlcv2.0_oa

Open wannaphong opened this issue 2 years ago • 2 comments

tlcv2.0_oa is a dataset that made for Open Assistant Text-only format. It was build from Thai Literature Corpora (TLC).

Thai Literature Corpora (TLC) is Corpora of machine-ingestible Thai classical literature texts like gutenberg. https://attapol.github.io/tlc.html

Hugging Face Datasets: pythainlp/tlcv2.0_oa

wannaphong avatar Mar 04 '23 19:03 wannaphong

Can you strip the outputs from the notebook please? It's a bit large! Also rebase + squash commits if possible.

bitplane avatar Mar 05 '23 15:03 bitplane

Can you strip the outputs from the notebook please? It's a bit large! Also rebase + squash commits if possible.

OK. I was strip the outputs from the notebook.

wannaphong avatar Mar 05 '23 15:03 wannaphong

Rebased on main, fixed merge conflicts and kept contributor commits rather than squash :)

bitplane avatar Mar 19 '23 20:03 bitplane

@bitplane I think in general we decided to squash all PRs, to keep history clean. If you use "Squash and merge" in GitHub it will list the author of the commit as the person who made the PR, and anyone else who committed will also receive credit as a co-author :)

olliestanley avatar Mar 19 '23 20:03 olliestanley

Sorry, I merged this before I saw the discussion on Discord. I'll squash in future :)

bitplane avatar Mar 20 '23 12:03 bitplane