Releases: Kuuuube/japanese_text_analyzer
Releases · Kuuuube/japanese_text_analyzer
0.1.3
0.1.2
Changelog:
- Fix chance of lossy string replacement character pushing string over max tokenizer length
0.1.1
Changelog:
- Added option to filter by extension when using
--any
- Added option to parse
.mokuro
files - Fixed
--any
overflowing sudachi's max byte length - Removed redundant
--txt
option
0.1.0
Changelog:
- Added options for other analysis formats (
--any
,--txt
)
0.0.4
Changelog:
- Added average page length
- Added total page count
- Added average volume length
- Added total volume count
- Added total textbox count
0.0.3
Changelog:
- Add average (and shortest/longest) textbox length to stats
- Improve prints to help see what's going on
- Show time each part took to complete in ms
0.0.2
Changelog:
- Uses Sudachi Mode B to remove edge case of tokenizing ridiculously long compound words that are impossible to search in most dictionaries
- Print usage message and panic when run without args instead of only panicking
- Continue and print error on unreadable files instead of panicking
- Continue and print error on tokenization fails insteadof panicking
0.0.1
Initial release