This is a rough first proposal for the Typogrify project. It addresses the following deficiencies of the current project:
- It uses a HTML parser (as oppossed to a regex)
- The regex for the filters are less complex (mostly due to reason number 1)
- The regex for the filters have been cleaned up
- The codebase has been neatened up
- Specifically ignores tags like script (which does not affect Pelican only because documents are typogrified before scripts are added)