KcBERT icon indicating copy to clipboard operation
KcBERT copied to clipboard

๐Ÿค— Pretrained BERT model & WordPiece tokenizer trained on Korean Comments ํ•œ๊ตญ์–ด ๋Œ“๊ธ€๋กœ ํ”„๋ฆฌํŠธ๋ ˆ์ด๋‹ํ•œ BERT ๋ชจ๋ธ๊ณผ ๋ฐ์ดํ„ฐ์…‹

Results 2 KcBERT issues
Sort by recently updated
recently updated
newest added

์•ˆ๋…•ํ•˜์„ธ์š”! ์ฝ”ํผ์Šค ๋ฐ ์ฝ”๋“œ๋ฅผ ๊ณต๊ฐœํ•ด์ฃผ์…”์„œ ์ •๋ง ๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค. ๊ณต๊ฐœํ•ด์ฃผ์‹  ์ฝ”ํผ์Šค๋กœ KcBERT๋ฅผ ์ง์ ‘ ํ•œ๋ฒˆ ๋งŒ๋“ค์–ด ๋ณด๋ ค๊ณ  ํ•˜๋Š”๋ฐ์š”. BERT ๊ณต์‹ github(https://github.com/google-research/bert)์˜ pre-training ์„ค๋ช…์— ๋”ฐ๋ฅด๋ฉด | Here's how to run the data generation. The...

์•ˆ๋…•ํ•˜์„ธ์š”! ์ข‹์€ ๋ชจ๋ธ๊ณผ ์ฝ”๋“œ๋ฅผ ์—ด์–ด์ฃผ์…”์„œ ๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค. ๋‹ค๋ฆ„์ด ์•„๋‹ˆ๋ผ ์ œ๊ฐ€ https://beomi.github.io/2021/03/15/KcBERT-MLM-Finetune/ ์ด ์‚ฌ์ดํŠธ์— ๋‚˜์™€์žˆ๋Š”๋ฐ๋กœ ์ถ”๊ฐ€ ํ•™์Šต์„ ํ–ˆ์—ˆ๋Š”๋ฐ ์ œ ๋„๋ฉ”์ธ์— ๋งž๋Š” ๋ฐ์ดํ„ฐ [mask] ์˜ˆ์ธก์„ ์ž˜ ํ•˜์ง€ ๋ชปํ•˜๋Š” ๊ฒƒ ๊ฐ™์•„์„œ, vocab.txt๋ฅผ ์ œ ํ•™์Šต...