Skip to content

Train Data(v1) Release!

Compare
Choose a tag to compare
@Beomi Beomi released this 08 Sep 11:29
· 23 commits to master since this release
fff9e4f

Kaggle์— ๊ณต๊ฐœํ–ˆ๋˜ ๋ฐ์ดํ„ฐ์…‹์„ ์ข€ ๋” ๋‹ค์šด๋กœ๋“œ ๋ฐ›๊ธฐ ์‰ฝ๊ฒŒ ํ•˜๊ธฐ ์œ„ํ•ด ๋ถ„ํ•  ์••์ถ•(๊ฐ๊ฐ 2G/2G/0.6G)ํ•ด ๋ฆด๋ฆฌ์ฆˆํ•ฉ๋‹ˆ๋‹ค :)

( Pretrain Dataset ๊ณต๊ฐœ: https://www.kaggle.com/junbumlee/kcbert-pretraining-corpus-korean-news-comments )

์•„๋ž˜ kcbert-train.tar.gz aa, ab, ac๋ฅผ ๋ชจ๋‘ ๋ฐ›์œผ์‹  ๋’ค, ํ•ด๋‹น ํด๋”์—์„œ ์•„๋ž˜ ๋ช…๋ น์–ด๋กœ ์••์ถ•์„ ํ’€์–ด์ฃผ์„ธ์š”.

cat kcbert-train.tar.gz* | tar -zxvpf -