Split model training from randomness #71

Diggsey · 2021-05-29T17:34:54Z

Training the model with "learn" is relatively slow (much slower than generating new words) so ideally you could train it once, and then use the same model to generate several sequences with different seeds.

However, this is not possible with the current API, because the seed is supplied when you first construct the MarkovChain, and cannot be changed after that point.

AFAICT, the "learn()" function is completely deterministic (it does not use the RNG) so this is a suboptimal design.

I would suggest removing the rng field from the MarkovChain entirely, and instead pass the RNG when you construct the Words iterator. This way you can train a model once, and then generate several sequences of text with different seeds. This should also make the lipsum_words_from_seed function much faster, even when not using a custom model.

The text was updated successfully, but these errors were encountered:

mgeisler · 2021-05-30T13:55:09Z

Hi @Diggsey,

Thanks for reporting this! Your analysis sounds spot on — and I like that lipsum_words_from_seed can reuse the thread-local LOREM_IPSUM_CHAIN variable this way.

Would you be up for restructuring things as you suggest?

Diggsey · 2021-05-30T14:09:56Z

Yeah, it will be a breaking change though.

mgeisler · 2021-05-30T14:35:03Z

Yeah, it will be a breaking change though.

That's okay, especially since the lipsum_* functions will stay unchanged, right?

The few dependencies of the crate seem to use those high-level functions, so they should have no problem upgrading.

Diggsey mentioned this issue May 29, 2021

Misleadingly poor performance in router example yewstack/yew#1885

Closed

Diggsey mentioned this issue May 30, 2021

Split out RNG from MarkovChain to allow reusing a model multiple times. #72

Merged

mgeisler closed this as completed in #72 May 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split model training from randomness #71

Split model training from randomness #71

Diggsey commented May 29, 2021 •

edited

Loading

mgeisler commented May 30, 2021

Diggsey commented May 30, 2021

mgeisler commented May 30, 2021

Split model training from randomness #71

Split model training from randomness #71

Comments

Diggsey commented May 29, 2021 • edited Loading

mgeisler commented May 30, 2021

Diggsey commented May 30, 2021

mgeisler commented May 30, 2021

Diggsey commented May 29, 2021 •

edited

Loading