Sentence segmentation accuracy #157
Unanswered
rshahrabani
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
mic.txt
I am using the latest build of Catalyst for sentence segmentation. I have used it on the attached file but the accuracy seems to be off. For example, the following was broken into 2 separate sentences (see General Instruction A.2. below):
June 7, 2021 MACQUARIE INFRASTRUCTURE CORPORATION (Exact name of Registrant as specified in its charter) (212) 231-1000 (Registrant's telephone number, including area code) N.A. (Former name or former address, if changes since last report) Check the appropriate box below if the Form 8-k filing is intended to simultaneously satisfy the filing obligation of the registrant under any of the following provisions (see General Instruction A.2.
below): ¨ Written communications pursuant to Rule 425 under the Securities Act (17 CFR 230.425) x Soliciting material pursuant to Rule 14a-12 under the Exchange Act (17 CFR 240.14a-12) ¨ Pre-commencement communications pursuant to Rule 14d-2(b) under the Exchange Act (17 CFR 240.14d-2(b)) ¨ Pre-commencement communications pursuant to Rule 13e-4(c) under the Exchange Act (17 CFR 240.13e-4(c)) Securities registered pursuant to Section 12(b) of the Act: Indicate by check mark whether the registrant is an emerging growth company as defined in Rule 405 of the Securities Act of 1933 (-230.405 of this chapter) or Rule 12b-2 of the Securities Exchange Act of 1934 (-240.12b-2 of this chapter).
I am using the code:
Is there any way to improve the accuracy of this sentence segmentation?
Beta Was this translation helpful? Give feedback.
All reactions