Skip to content

Editor and CLI for maintaining a set of multilingual markdown(.md) files

License

Notifications You must be signed in to change notification settings

hrgdavor/multilingual-markdown

Repository files navigation

multilingual-markdown (mulmd)

Editor and CLI(called mulmd) for maintaining a set of multilingual markdown(.md) files

currently testing editor prototype

Started as an idea from a discussion .

TODO for intital release

general

  • define inital format
  • LIB: hashing rules
  • LIB: uuid lib / func
  • LIB: choose hash - MD5(cross platform and cross language) not for security, it is just helping detect changes in text
  • create node project (package.json)
  • CLI: create script with UUID and md5 included cli tutorial used
  • LIB: setup first test (test sample text and md5 of it)
  • LIB: sections parse - read md into obj model
  • LIB: sections stringify - write md from obj model
  • Editor: basic edit base file (drag and drop 1 file) sections outline, click to edit title and text separately one section at the time
  • LIB: initialize sections (add uuids and hashes)
  • Editor: publish as github page (docs folder) test version
  • CLI: initialize base file (add uuids and hashes) mulmd init docs/editor/parameters.md
  • Editor: initialize base file (add uuids and hashes)
  • CLI: generate file for translation from base file mulmd init file.md file.fr.md
  • Editor: generate file for translation from base file
  • LIB: clean metadata, and copy sections (after that serialize produces normal MD file for publishing)
  • LIB: optional section info for code blocks
  • DOC: define/document options that can be used in CLI via package.json or as argument
  • ALL: convert to ESM
  • CLI: read package.json for options (like the one above)
  • Editor: ask option: section info for code blocks
  • CLI: export clean file (MD without medatada)
  • CLI: export clean folder (MD without medatada)
  • Editor: export clean file (MD without medatada)
  • LIB: translation info (details)
    • Case 1 - missing-translation
    • Case 2 - missing-original
    • Case 3 - initialized
    • Case 4 - initialized-dirty
    • Case 5 - partial
    • Case 6 - partial-dirty
    • Case 7 - translated-dirty
    • Case 8 - translated
  • Editor: basic edit translation file (drag and drop 2 files, together or 1 by 1)
  • Editor: edit translation file: with diff of changes in base file (drag and drop 2 files, together or 1 by 1)
  • CLI: sync translation by injecting missing sections
  • Editor: sync translation by injecting missing sections
  • CLI-pre-commit-hook: script spacification so it can be used to block unfinished edits
  • CLI: allow automatic pushing code-block changes across translations (if it was unchanged in the translation)
  • Editor: allow automatic pushing code-block changes across translations (if it was unchanged in the translation)
  • Editor: electron app
  • CLI-info: specification of the script (options and actions and infos that it can give)
  • CLI-info: xxx
  • CLI-info: yyy

package.json

maybe future

  • CLI: lint code sections (or standardx) ... have them included and linted separetely
  • Editor: lint code sections
  • use git to compare current file edited with latest git show HEAD:./filename.txt

format

General format is defined by JSON string injected at the end of each heading, additional brackets are added for less chance of conflict with heading title

  • id - id of the section
  • h - md5 checksum of the text
  • ts - timestamp when hash was generated and the hashed text id bound to the hash
  • trans - for translated section is percentage of translation done
    • -1 - for original file
    • -2 - for copy (the copied section in) this type of section is not to be exported as it is part of the file only for keeping track of the changes.
    • 0-99 - for translation
  • status - empty/ommited when everything is ok, any other value means something more needs to be done (checkGrammar,approve,in progress). This is not part of this tool, but is a proposal for further more complex management of the sections

Other utilities may inject more properties in the JSON, so editor should not clear them.

Base file contains the original text

# Main tile
({"id":"UUID", "h":"md5"})
some text.

## subtitle
({"id":"UUID", "h":"md5"})
subtitle text etc

translations inside code

There is already similar use case for footnotes in general

  • use footnote syntax inside code blocks by placing a footnote reference [^1],[^2],...
  • code block is then same for all languages and can safely be updated in an automated task across translations
  • translation is placed as footnote
  • this is mainly meant for comments, but may be even ok to use inside strings
  • when exporting md files one can move translations back to code block and remove from text body
  • if there is legitimate code inside code-block that would be mistaken for footnote syntax, one could then enforce a prefix for these translation footnotes that would then prevent the conflict with the code

example code block and translation as footnote later

|```js
({"id":"UUID", "h":"md5"})
|  function code(){// [^1]
|```
|
|[^1]: Comment for the function that can be translated without changing the code

UUID and hash is also generated for code sections

|```js
({"id":"UUID", "h":"md5"})
|  function code(){...
|```

when translation file is created, hash:"" to mark not yet translated.

hashing rules

  • section title JSON data removed before hashing
  • code-block JSON data removed before hashing
  • lines: timrEnd (trimRight)
  • emtpy lines removed at the end .... (not sure about this one)

About

Editor and CLI for maintaining a set of multilingual markdown(.md) files

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published