ga4gh · nsheff · Aug 25, 2023 · May 11, 2022 · Jun 1, 2022 · Dec 15, 2022
diff --git a/docs/decision_record.md b/docs/decision_record.md
@@ -6,6 +6,24 @@
 
 [TOC]
 
+## 2022-05-11 - Sequence identifier specification
+
+### Decision
+
+The GA4GH identifier will be used as our default sequence identifier instead of MD5. Other identifiers can be provided in a separate array and should not be part of the collection checksum calculation.
-The GA4GH identifier will be used as our default sequence identifier instead of MD5. Other identifiers can be provided in a separate array and should not be part of the collection checksum calculation.
+The GA4GH identifier will be used as our default sequence identifier instead of MD5. Other identifiers can be provided in a separate array and should not be part of the collection checksum calculation.
+It will be used to digest:
+
+- the sequences that are stored in the `sequences` array
+- the canonical representation of arrays of level 2
+- the canonical representation of the sequence collection of level 1
-The GA4GH identifier will be used as our default sequence identifier instead of MD5. Other identifiers can be provided in a separate array and should not be part of the collection checksum calculation.
+The GA4GH identifier will be used as our default sequence identifier instead of MD5. Other identifiers can be provided in a separate array and should not be part of the collection checksum calculation.
+It will be used to digest:
+
+- the sequences that are stored in the `sequences` array
+- the canonical representation of arrays of level 2
+- the canonical representation of the sequence collection of level 1
+
+### Rationale 
+
+GA4GH identifiers were created as part of the [Variation Representation Specification standard](https://vrs.ga4gh.org/en/stable/impl-guide/computed_identifiers.html), which included a way of creating identifiers to be used with sequences e.g. ACGT results in the identifier `ga4gh:SQ.aKF498dAxcJAqme6QYQ7EZ07-fiw8Kw2`. The scheme uses the [`sha512t24u` function](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0239883) to create a base64 URL-safe representation of a sha512 digest. Adopting GA4GH identifiers ensures sequence collections remains inline with newer standards within the GA4GH ecosystem.
+
+### Limitations
+
+GA4GH identifiers are not the default identifier used by standards such as CRAM, which uses MD5. We expect sequence collection providers to offer additional identifier arrays to provide compatability with these other formats and to declare their sequence identifier support via service-info.
+
+### Linked issues
+
+- [https://github.com/ga4gh/seqcol-spec/issues/30](https://github.com/ga4gh/seqcol-spec/issues/30)
+
 ## 2021-12-01 - Endpoint names and structure
 
 ### Decision