Skip to content

GuJiCool/Auto-Punctuation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Automatic-Punctuation (GJAP)

'古籍·酷' Automatic Punctuation tool refers to the Artificial Intelligence Punctuation Engine released by the Beijing Longquan Monastery (Fenghuangling, Haidian District) Tripitaka Office on the website (http://gj.cool), excluding the texts processed by the engine. This tool automatically punctuating the modern Chinese ancient text by machine without human intervention.

Datasets for automatic punctuation

The date sets that derived from the published ancient Chinese literatures has been further proofread and processed. It mainly includes the punctuated CBETA text, Confucian literature, Taoist classics, 24 histories, 13 sutras, Tang history and so on. GJAP is an open dataset, which means it will grow over time as data is contributed. Thus in order to enable reproducibility and accurate citation in scientific journals the dataset is versioned.

Terms of Use

The datasets and engine are free for learning and research, not allowed any form of commercial exploitation. To use the engine for batch processing of text, please apply to the office for a free application programming interface (API) and indicate the engine URL in the release. It is not responsible for any damage caused by the use of the engine.

'古籍·酷' API Application: https://jinshuju.net/f/HjqYl0

About

Automatic punctuation engine training and testing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published