From 4218adb1b4233fa6ae618dde92136c6f4925baf5 Mon Sep 17 00:00:00 2001 From: Wonseok Hwang Date: Tue, 8 Nov 2022 07:40:22 +0000 Subject: [PATCH] docs: update README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 95cefaa..597181b 100644 --- a/README.md +++ b/README.md @@ -11,6 +11,7 @@ A multi-task benchmark for Korean legal language understanding and judgement pre - [Minjoon Seo](mailto:minjoon@lbox.kr) # Updates +- Nov 8, 2022: We release [`legal-mt5-small`], a domain adapted mt5-small using `precedent_corpus`. We also release the `legal-mt5-small` fine-tuned on the `summarization` dataset. Both models can be download from [here](https://drive.google.com/file/d/1lZaUtDPCkAOcwaxBzFo-QHecGAQendOd/view?usp=share_link)! To use the models, `cd [project-dir]; tar xvfz legal-mt5-small.tar.gz`. - Oct 18, 2022: We release three new datasets `casename_classification_plus`, `statute_classification_plus`, and `summarization_plus`! - Oct 2, 2022: [`defamation corpus-v0.1`](https://lbox-open.s3.ap-northeast-2.amazonaws.com/precedent_benchmark_dataset/defamation_corpus/defamation_corpus.jsonl) has been added. The corpus consists of 1,536 criminal cases related to "defamation (λͺ…μ˜ˆν›Όμ†)". The corpus will be integrated into `precedent corpus` in the future (at the moment, there can be some overlap between `precedent corpus` and `defamation corpus-v0.1`). See also [this issue](https://github.com/lbox-kr/lbox-open/issues/4#issue-1393652876). - Sep 2022: Our paper is accepted for publication in NeurIPS 2022 Datasets and Benchmarks track! There will be major updates on the paper, the dataets, and the models soon! Meanwile, one can check the most recent version of our paper from [OpenReview](https://openreview.net/forum?id=TaARsI_Iio)