Greatly expand architecture section, including splitting into abstract vs concrete model #6877

Ericson2314 · 2022-08-06T00:46:11Z

I want to complete specify the store layer.

The concrete vs abstract split I would expect to be controversial, so here my rationalization for it:

Explaining everything at once makes for a huge tsunami of information that is highly likely to overwhelm the reader, even if they are "random accessing" the reference rather than reading it in order, end to end. However, if one learns the abstract model first, then they can develop their mental model for the "skeleton" on which the concrete details is the "meat". Rather than appearing as a wave of assorted details, the details make sense as a combination of the requirements of the abstract model with only a few arbitrary choices + historical evolutions mixed. That makes for a "compessed" mental representation (the concrete model encoded against the abstract mode) which is a lot less mentally taxing.

In the language of https://documentation.divio.com/'s documentation quadrant, the abstract model is pure explanation, describing what we are trying to do with out the constraints of how Unix and software for unix work today. The concrete model is mainly reference, with just enough explanation to tie it back to the abstract model.

This order of information I think matches how @edolstra designed Nix in the first place (see patterns in functional programming that can be reused for new purpose, design system accordingly), and also matches how I think most advanced users / developer Nix think about it. I thus think it is both a proper and a "proven" strategy for how "bottom half" of the documentation quadrant can be conveyed.

No TOC TOC tweak More markdown fix Fix markdown TODO for mentioning placeholders Concrete derivations Very nice to be able to leverage the abstract section! Greatly extend abstract drv & derived ref section Expand abstract store intro Fill in details about concrete store paths Small edits Start describing derivations Abstract vs concrete store path Introduce abstract and concrete stores Sketch out new TOC with abstract vs concrete split Merge branch 'reference-section' into architecture Apply suggestions from code review Co-authored-by: Valentin Gagarin <valentin@fricklerhandwerk.de> Improve reference introduction section Move scanning for references down Merge branch 'reference-section' into architecture Merge branch 'doc-what-is-nix' into reference-section Merge branch 'doc-what-is-nix' into architecture Merge branch 'undelete-wip-docs' into architecture Add back deleted WIP stuff for "staging" branch Merge branch 'doc-what-is-nix' into reference-section Fix typo Start to write about object content addressing Expand reference hashing section Move references to new section

…hitecture

Re-enable architecture section, at least for the PR while its in a draft state.

toraritte · 2022-08-06T01:29:05Z

doc/manual/src/architecture/store/abstract/abstract.md

@@ -0,0 +1,17 @@
+# The abstract model
+
+The abstract model is inspired by functional programming in the following key ways:


Does it still hold that Nix is a purely functional model?

toraritte · 2022-08-07T06:27:42Z

doc/manual/src/architecture/store/store.md

+The Nix store layer is the heart of Nix, the cornerstone of its design.
+It comes from two basic insights: a vision for build systems in the abstract based on functional programming, and an application of the vision to conventional software for conventional operating system.
+We could just present the combination of those two in the form of the current design of the Nix store, but we believe there is value introducing them separately.
+This still describes how Nix works, so this section still serves as a spec, but it also demonstrates with Nix's authors believe is a good way to think* about Nix.
+If one tries to learn the concrete specifics before learning the abstract model, the following text might come across as a wall of details without sufficient motivation.
+Conversely, if one learns the abstract model first, many of the concrete specifics will make more sense as miscellaneous details placed in the "slots" where the abstract model expects.
+The hope is that makes the material far less daunting, and helps it make sense in the mind of the reader.


I had a hard time wrapping my head around this section, until I jumped ahead to read both abstract.md and concrete.md. Basically,

the "abstract" section introduces Nix fundamental building blocks, and explains how the build prcoess works at a high level

the "concrete" section takes the introduced terms (derivation, store object, etc.) and functional programming principles, and applies them in the context of file systems

If this is correct, then the phrase "concrete specifics" here means "examples of the application of functional programming principles to build system operations over a file system", right? Also, wouldn't it be clearer to call "abstract model" as "model" and "concrete model" as "application" / "implementation"?

On a related note, I think this approach could be applied to the entirety of this section or to the Nix manual - naybe even splitting into two levels of documents?

the first level of doc(s) expound(s) the model / specification (building blocks, abstract rules, core paradigms, guiding principles, etc.)

the second level are companion guides that drive the level one (i.e., spec) concepts home by showing how they are applied in the implementation, referring back to level one docs when needed.

(Of course, if the spec changes, both levels need to be updated.)

Suggestion:

To ease understanding, we will explain the Nix store in two parts:

Subsection X introduces the theoretical functional model<1>

Subsection Y shows how the functional paradigm and elements of the model are applied when it comes to building software in an actual filesystem<2>

[1]: or notion, concept, etc.

[2]: There could be other examples to show the versatility of Nix when using it to assemble stuff other than software

toraritte · 2022-08-07T23:43:51Z

Both reviews should have probably been submitted to #6420, right?.. (It's highly confusing to have 2 PRs for the same topic - are there more?)

nixos-discourse · 2022-08-08T00:50:44Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/summer-of-nix-documentation-stream/20351/4

Ericson2314 · 2022-08-09T03:51:42Z

Both reviews should have probably been submitted to #6420, right?.. (It's highly confusing to have 2 PRs for the same topic - are there more?)

Well this one is supposed to be pick up against the last one. Even before I did the abstract vs concrete thing, I made the separate branch to have more content at the cost of less polishing.

edolstra · 2022-08-12T08:46:56Z

I'm not convinced that having a big architecture document is the way to go. I think it would be better to have more "literate", cross-referenced comments in the source code in combination with doxygen (or whatever). For example:

reference-scanning.md should be a doxygen comment in references.hh.
The description of the NAR format can go into archive.hh.
Not sure what the best anchor would be for describing the store path hash computation, but probably we should move all makeStorePath() and friends out of store-api.cc into their own source file and then we can keep the docs there.
The description of the build algoritm can go into libstore/build.

I also don't think we should have Haskell-style pseudo-code that reproduces C++ types, like this:

data DerivedPath
  = OpaquePath { path : StorePath }
  | BuiltPath {
      drv    : StorePath,
      output : OutputName,
    }

Given how much churn there has been in these types on the implementation side, having to constantly update an architecture document to reflect implementation changes would be painful. It is hard enough to keep comments in sync with the source, but the architecture document is likely to become outdated immediately. Again I think it would be preferable to have doxygen comments on the C++ types that describe their function.

There is also a lot of terminology here that doesn't appear in the implementation, which is likely to confuse people, e.g. "references are capabilities", FSOs, etc. Also, there are some reflections that would be more appropriate for an academic paper, like monadic vs applicative build systems and comparisons to Von Neumann/Harvard architectures.

edolstra · 2022-08-12T08:47:34Z

doc/manual/src/architecture/store/concrete/reference-scanning.md

+
+## Exceptions
+
+Note that just because Nix *can* scan references doesn't mean that it *must* scan for references.


I have a hard time understanding this section and I don't really know what it's trying to say. Nix absolutely does have to scan for references, there is no option to disable that.

This section is pretty WIP.

There is a subtlety where reference scanning isn't normative in that when one does a --check we don't attempt to rescan, but I can make that more explicit

blaggacao · 2022-08-12T13:48:20Z

As a pure tooling advice, there is handy: https://github.com/JoelCourtney/mdbook-kroki-preprocessor

Ericson2314 · 2022-08-12T14:00:29Z

@edolstra the entire point of having a specification document is that it is separate from the implementation. I want people to know the the Nix store layer works without having to read any C++ or even Doxygen.

Ericson2314 · 2022-08-12T14:11:52Z

Given how much churn there has been in these types on the implementation side, having to constantly update an architecture document to reflect implementation changes would be painful.

A lot less painful that doing it out of tree though!

Also, I think the type changes have been mainly replacing strings and more std::variant and thus are by and large converging on new stable design. I expect new changes to be not from refactoring existing features once they are cleaned up, but adding new features like "deep content-addressing".

It still benefits being in-tree in that case so we can update the spec and implementation together in the new feature case, but it is not so onerous that it is futile to have a spec at all.

nixos-discourse · 2022-08-15T12:31:42Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/summer-of-nix-documentation-stream/20351/2

fricklerhandwerk · 2022-08-17T15:04:04Z

doxygen

@edolstra Do we have infrastructure to make this visible on a par with the manual? As long as we don't, I'm strongly in favor of adding this type of material to the manual. I'm agnostic to how we build this, if the result is easily discoverable and not too expensive in terms of setup and maintenance.

I agree that this kind of thing should be right next to the code, but we can still move it in there later. What's more important is that it's written down at all, and easy to find, read, and modify.

I also don't think we should have Haskell-style pseudo-code that reproduces C++ types

@edolstra Agreed. The architecture spec should show the principles, not the implementation details. Actually we should not use any kind of specific code to represent those principles, but ideally something more universal (and self-explanatory) such as UML diagrams (no specific proposal, just an example) where it's suitable. Even if Haskell or whatever is more precise, we cannot assume people to know any of this. Learning also has dependencies, and we should make them explicit and keep the closure small.

There is also a lot of terminology here that doesn't appear in the implementation

Concerning really abstract stuff that doesn't touch on what's really going on in the implementation I agree, but I suppose some things can have more structured naming. Sure, the term FSO is not used in the code, but why not change the code?

@Ericson2314 I'm not convinced by the abstract/concrete setup. There is overwhelming evidence that people don't learn well from first principles. The manual, even the architecture spec, should describe Nix specifically, and nothing else. As suggested above, we can squint a little bit or add some wishful thinking about code factoring and naming to make things appear less thorny, but we should stay with what there is.

While I believe all of what you wrote is true, it is often hard to read and understand. And because we made the mistake already in #6420, I think we should go through this with multiple PRs section by section (as outlined here), even if it is more work for authors. It's really really hard to review such a large amount of text, and we have to take reviewer's effort into consideration as well.

@edolstra If you could please revert 81e1013 we can do exactly that for #6420 and make a PR for each section topologically sorted, now that we have developed an idea on the general direction we would like to take.

Ericson2314 · 2022-08-17T15:41:02Z

The point of the data types is that it is the interface, not just a mere implementation detail.

I am fine replacing with something that doesn't look like code, but let's be clear what this is achieving. People have a bias "programing language = implementation", so we are simply laundering the exact same information in a format that seems more "platonic".

If we understand that we're just trying to find a presentation that plays nicely with people's biases, great, we're on the same page. If we think there actually is an objective rather than rhetorical problem with data types in the spec, then we're still talking past each other on what a "specification" even means.

fricklerhandwerk · 2022-08-17T17:13:10Z

People have a bias "programing language = implementation", so we are simply laundering the exact same information in a format that seems more "platonic".

@Ericson2314 when we worked on #6420 we were pretty clear in that regard. We used non-code interface descriptions to be more universally approachable and simplified them to a point where the principle became visible. Sure, if we rewrote the Store in Haskell, that would be a moot point. But that's not what Nix is today.

It's not about dumbing down, it's about making it easy to understand with minimal prerequisites.

Ericson2314 · 2022-08-17T21:31:07Z

@fricklerhandwerk but that PR did contain data definitions, like https://github.com/nix-community/nix/blob/39d32ac4c63f4aa3784d114b19c0eca83e306ca9/doc/manual/src/architecture/store/fso.md This PR adds more of them, but it uses them in exactly the same way: to specify Nix, irrespective of how it is implemented.

roberth · 2025-01-19T18:00:33Z

This is written in a somewhat narrative, certainly linear and learning-oriented style. An explanation, whereas the manual is a reference manual.

It starts in a way that is so abstract that I doubt that opening with it is a good idea, considering its central place in the ecosystem and the expectation most users have to learn about the tool, not the architecture. Opening this way signals to them that Nix is a highly abstract and academic piece of software that's not for solving their practical problems. Of course these thoughts are misguided, but that does not make them less real or less prevalent.

The language used is only understandable by functional programmers, which is a mere fraction of this document's audience.

I think we might be able to include documentation like this in the manual, but then I think it should be in a 'second half' after the reference documentation. It would also be wise to only link to the explanations from the places where they are introduced in the reference documentation. The architecture and explanation section of the manual _ must not_ be used as a substitute for reference documentation.

It is not a language topic.

Thanks!! Co-authored-by: Robert Hensing <roberth@users.noreply.github.com>

More can be written, but this will do for now.

It is too much of an explanation. There are other docs we can move here later in the language section already.

Ericson2314 added 11 commits August 4, 2022 09:43

Merge remote-tracking branch 'nix-community/doc-what-is-nix' into arc…

b152b9f

…hitecture

Discuss concrete RFC 92 derived path & fix typos

f03d511

Document abstract building

34b7768

Merge remote-tracking branch 'upstream/master' into architecture

1cf4f1b

Move files around to match the TOC

c65d1c5

Replace ^^^^ with |-----| highlights to match

c0c2a27

WIP moving over store path info to manaul

5cd2284

Move things around some more

16fbcd8

Big progress towards specifying content addressing store objects

b942454

Merge remote-tracking branch 'upstream/master' into architecture

006e8f5

Re-enable architecture section, at least for the PR while its in a draft state.

Ericson2314 changed the title ~~Greatly exapnd architecture section, including splitting into abstract vs concrete model~~ Greatly expand architecture section, including splitting into abstract vs concrete model Aug 6, 2022

toraritte mentioned this pull request Aug 7, 2022

Document what Nix *is* #6420

Merged

toraritte reviewed Aug 7, 2022

View reviewed changes

Finish talking about content addressing store objects

6b39672

Talk about content addressing FSOs w/ text from dissertation

a03e4d4

edolstra reviewed Aug 12, 2022

View reviewed changes

Ericson2314 added 8 commits November 17, 2024 21:51

Small tweak

1926161

WIP moving info from builtins.derivation docs

a8b1e3f

Merge remote-tracking branch 'upstream/master' into architecture

7130df2

Merge branch 'master' into architecture

177d9cc

Merge branch 'master' into document-derivation-and-deriving-path

16bd5a4

Fix formatting issue

97977b7

Merge branch 'document-derivation-and-deriving-path' into architecture

7b920b2

Fix typo

4b29ca8

Ericson2314 added 4 commits January 19, 2025 13:07

Derivation doc improvements

4b9fb54

Merge branch 'document-derivation-and-deriving-path' into architecture

1c0312f

More doc improvements

87c559e

Merge branch 'document-derivation-and-deriving-path' into architecture

cb06347

Ericson2314 mentioned this pull request Jan 19, 2025

Document Store Derivations and Deriving Paths #12290

Open

Ericson2314 added 9 commits January 19, 2025 15:16

Move builder execution to store layer

96a46f9

It is not a language topic.

Formatting example

6246901

Fix redirect

80e6942

Fix redirects more

e77338d

Merge branch 'document-derivation-and-deriving-path' into architecture

2d3bed3

Fix another URL in the docs

fb7fc6a

Change deriving path pseudo-code

0cd1e78

Further reworking

05a38b8

Merge branch 'document-derivation-and-deriving-path' into architecture

c7e538c

github-actions bot added the new-cli Relating to the "nix" command label Jan 20, 2025

Ericson2314 and others added 6 commits January 20, 2025 12:56

Apply suggestions from code review

e409ea7

Thanks!! Co-authored-by: Robert Hensing <roberth@users.noreply.github.com>

Merge branch 'master' into document-derivation-and-deriving-path

3d3a86d

Clean out some TODOs

f3fa9cf

More can be written, but this will do for now.

Remove store-layer __structuredAttrs docs

f2d5be9

It is too much of an explanation. There are other docs we can move here later in the language section already.

Fix build failure

86f23c2

Merge branch 'document-derivation-and-deriving-path' into architecture

080a4ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Greatly expand architecture section, including splitting into abstract vs concrete model #6877

Greatly expand architecture section, including splitting into abstract vs concrete model #6877

Ericson2314 commented Aug 6, 2022

toraritte Aug 6, 2022

toraritte Aug 7, 2022 •

edited

Loading

toraritte commented Aug 7, 2022 •

edited

Loading

nixos-discourse commented Aug 8, 2022

Ericson2314 commented Aug 9, 2022

edolstra commented Aug 12, 2022

edolstra Aug 12, 2022

Ericson2314 Aug 12, 2022

blaggacao commented Aug 12, 2022

Ericson2314 commented Aug 12, 2022

Ericson2314 commented Aug 12, 2022

nixos-discourse commented Aug 15, 2022

fricklerhandwerk commented Aug 17, 2022 •

edited

Loading

Ericson2314 commented Aug 17, 2022 •

edited

Loading

fricklerhandwerk commented Aug 17, 2022 •

edited

Loading

Ericson2314 commented Aug 17, 2022

roberth commented Jan 19, 2025

		@@ -0,0 +1,17 @@
		# The abstract model

		The abstract model is inspired by functional programming in the following key ways:


		## Exceptions

		Note that just because Nix can scan references doesn't mean that it must scan for references.

Greatly expand architecture section, including splitting into abstract vs concrete model #6877

Are you sure you want to change the base?

Greatly expand architecture section, including splitting into abstract vs concrete model #6877

Conversation

Ericson2314 commented Aug 6, 2022

toraritte Aug 6, 2022

Choose a reason for hiding this comment

toraritte Aug 7, 2022 • edited Loading

Choose a reason for hiding this comment

toraritte commented Aug 7, 2022 • edited Loading

nixos-discourse commented Aug 8, 2022

Ericson2314 commented Aug 9, 2022

edolstra commented Aug 12, 2022

edolstra Aug 12, 2022

Choose a reason for hiding this comment

Ericson2314 Aug 12, 2022

Choose a reason for hiding this comment

blaggacao commented Aug 12, 2022

Ericson2314 commented Aug 12, 2022

Ericson2314 commented Aug 12, 2022

nixos-discourse commented Aug 15, 2022

fricklerhandwerk commented Aug 17, 2022 • edited Loading

Ericson2314 commented Aug 17, 2022 • edited Loading

fricklerhandwerk commented Aug 17, 2022 • edited Loading

Ericson2314 commented Aug 17, 2022

roberth commented Jan 19, 2025

toraritte Aug 7, 2022 •

edited

Loading

toraritte commented Aug 7, 2022 •

edited

Loading

fricklerhandwerk commented Aug 17, 2022 •

edited

Loading

Ericson2314 commented Aug 17, 2022 •

edited

Loading

fricklerhandwerk commented Aug 17, 2022 •

edited

Loading