Add explicit tests to check Hermes can deal with arbitrarily large description terms #66

wardle · 2024-05-19T12:16:38Z

As per https://confluence.snomedtools.org/mag/community-consultations/snomed-international-proposal-to-increase-description-length-limit, SNOMED International is consulting on whether to increase the description length limit to 4096. In this proposal, users are reminded that:

"The RF2 Specification for SNOMED CT Descriptions states that the overall length limit for a description is 32Kb (understood to be Kilobits), equating to 4096 single byte characters."

As such, Hermes should already check that long descriptions of arbitrary length can be stored, retrieved and searched.

A quick examination of the synthetic unit tests shows that while generative testing is good at exercising these functions, the approach to generative testing starts with small strings and increases with the number of tests. A cursory reporting of string lengths actually tested with reasonable numbers of iterations shows strings are rarely generated over 50 characters. This means the current generative tests are insufficient to prove Hermes is behaving correctly with very large description lengths.

As such, the synthetic test generators should be changed to create very large strings. Even if SNOMED International does not increase the description length in the future, the implementation of store and search within Hermes uses no fixed size buffers. In order to create reasonable synthetic data, it would be reasonable to only synthesise large strings for a small proportion of generated synthetic descriptions. Fortunately, test.check, based on Haskell's QuickCheck makes this quite easy by simply using gen/frequency to change the generator based on specific frequencies.

As such, to resolve this issue, we need to do the following

Alter the generator for RF2 descriptions to potentially generate very large descriptions
Improve the synthetic tests to check that descriptions of any length are correctly stored, retrieved, indexed and found

…lly generated descriptions As per #66

See #66

wardle added a commit that referenced this issue May 19, 2024

Generate very large descriptions for a small proportion of synthetica…

082f13c

…lly generated descriptions As per #66

wardle added a commit that referenced this issue May 19, 2024

Check that we are creating descriptions of sufficient length for testing

8f928c2

See #66

wardle closed this as completed in c278f3b May 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add explicit tests to check Hermes can deal with arbitrarily large description terms #66

Add explicit tests to check Hermes can deal with arbitrarily large description terms #66

wardle commented May 19, 2024

Add explicit tests to check Hermes can deal with arbitrarily large description terms #66

Add explicit tests to check Hermes can deal with arbitrarily large description terms #66

Comments

wardle commented May 19, 2024