Emoji and abbreviations parser #305

MihaZupan · 2019-02-06T16:13:52Z

Fixes #296
Fewer memory allocations:

Building a new pipeline:

Method	Mean	Gen 0/1k Op	Gen 1/1k Op	Allocated Memory/Op
Markdig	12.05 us	19.1498	-	14.72 KB
Markdig_Advanced	42.11 us	45.1660	-	34.75 KB
Markdig_Advanced_Emoji	1,490.56 us	285.1563	130.8594	1502.8 KB
Markdig_Advanced_Emoji_Modified	1,499.65 us	298.8281	126.9531	1502.77 KB

(new) Method	Mean	Gen 0/1k Op	Gen 1/1k Op	Allocated Memory/Op
Markdig	12.04 us	19.1498	-	14.72 KB
Markdig_Advanced	41.97 us	45.1660	-	34.75 KB
Markdig_Advanced_Emoji	194.58 us	76.6602	15.3809	134.75 KB
Markdig_Advanced_Emoji_Modified	252.09 us	85.4492	18.5547	162.05 KB

Where Modified forces the lazy-init of dictionary properties.

Parsing speed for emojis and abbreviations is about the same (~10% faster for emojis),
as the dataset could be considered the worst-case for a prefix tree of this type (every input starts with the same character).

xoofx · 2019-02-08T16:38:24Z

Thanks, great perf improvement!

MihaZupan added 8 commits February 6, 2019 15:44

Allow single-char abbreviations

b15b050

Cross target NetCore 2.1

ca38da5

Port CompactPrefixTree to Markdig

d854b0b

Improve EmojiParser memory performance

325495a

Fix Abbreviations parser's one-char handling

ef452c2

Remove TextMatchHelper

18e9486

Add test case for xoofx#296

a11676e

Comment-out TextMatcher test in Benchmarks

b5293b9

xoofx merged commit 1d8266b into xoofx:master Feb 8, 2019

MihaZupan mentioned this pull request Feb 20, 2019

fix emoji performance issue when multiple pipeline needed #308

Merged

leotsarev mentioned this pull request Feb 26, 2019

TextMatchHelper was removed — what's upgrade path? #312

Closed

MihaZupan deleted the emoji-and-abbreviations-parser branch March 11, 2019 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emoji and abbreviations parser #305

Emoji and abbreviations parser #305

MihaZupan commented Feb 6, 2019 •

edited

Loading

xoofx commented Feb 8, 2019

Emoji and abbreviations parser #305

Emoji and abbreviations parser #305

Conversation

MihaZupan commented Feb 6, 2019 • edited Loading

xoofx commented Feb 8, 2019

MihaZupan commented Feb 6, 2019 •

edited

Loading