'古籍·酷' Automatic Punctuation tool refers to the Artificial Intelligence Punctuation Engine released by the Beijing Longquan Monastery (Fenghuangling, Haidian District) Tripitaka Office on the website (http://gj.cool), excluding the texts processed by the engine. This tool automatically punctuating the modern Chinese ancient text by machine without human intervention.
The date sets that derived from the published ancient Chinese literatures has been further proofread and processed. It mainly includes the punctuated CBETA text, Confucian literature, Taoist classics, 24 histories, 13 sutras, Tang history and so on. GJAP is an open dataset, which means it will grow over time as data is contributed. Thus in order to enable reproducibility and accurate citation in scientific journals the dataset is versioned.
Autopunc-Datasets-1-14:https://github.com/xianchun?tab=repositories
The datasets and engine are free for learning and research, not allowed any form of commercial exploitation. To use the engine for batch processing of text, please apply to the office for a free application programming interface (API) and indicate the engine URL in the release. It is not responsible for any damage caused by the use of the engine.