You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
In mkdocstrings, to format code, highlight it, and cross-ref names, we have to dance a bit:
a Jinja template renders everything unformatted, replacing names with unique ids, and stashing their corresponding cross-ref in a dict
we then format the code with black
we then highlight the code with pymdownx.highlight (pygments) (and by the way since builtin names like str are hidden behind unique ids, we lose some of the lexer capabilities)
we then replace the unique ids with the stashed cross-refs
It feels super convoluted (back and forth between Python code and Jinja templating), and inefficient (Griffe parsed code with ast, then Black parses it again, then Pygments lexes it again...).
Furthermore, pygments is "only" a lexer, so that means it cannot distinguish between the name of a parameter and a name used as value, and potentially other similar cases. That means we don't get classes distinct enough to allow users to style their code with enough flexibility, based on semantics rather than tokenization.
Describe the solution you'd like
I wonder if we couldn't pack all this (formatting + highlighting) directly in our expressions.
The highlighting part is easy: Griffe already parsed code as an expression, so we have all the semantics associated with each part of the expression. For example, in a function call, we know that the ExprName used for the names of keyword parameters are parameter names, and not just names, so we could easily wrap them in <span class="pn">...</span> or whatever.
The formatting part however is probably much more complex. I don't have the pretension to think I'm capable of writing something as qualitative and efficient as Black or Ruff. But maybe there's a way? For now, Griffe expressions only handle single statements (type annotations, signatures, assignments). What when we start supporting arbitrary code in expressions?
Describe alternatives you've considered
We could consider never formatting code ourselves, but use a CST instead of an AST, to preserve already existing formatting. Users would format the code themselves (there are plenty of tools to do that), and we would just keep the same formatting. However with Griffe's ability to transform expressions (for example expression modernization, and Annotated unwrapping), we will have to update the spacing anyway.
Maybe a combination of both would be enough? We preserve original formatting, therefore the formatting complexity is drastically reduced so we can naively format the transformed parts ourselves?
Benefit of retaining original formatting: we respect the users choice (comments enabling/disabling formatting for example).
Additional context
Boost priority
Boost priority in our backlog through Polar.sh. Higher pledge, higher priority.
Minimum pledge by user/organization is $5, minimum amount for boost is $30.
Is your feature request related to a problem? Please describe.
In mkdocstrings, to format code, highlight it, and cross-ref names, we have to dance a bit:
str
are hidden behind unique ids, we lose some of the lexer capabilities)It feels super convoluted (back and forth between Python code and Jinja templating), and inefficient (Griffe parsed code with
ast
, then Black parses it again, then Pygments lexes it again...).Furthermore, pygments is "only" a lexer, so that means it cannot distinguish between the name of a parameter and a name used as value, and potentially other similar cases. That means we don't get classes distinct enough to allow users to style their code with enough flexibility, based on semantics rather than tokenization.
Describe the solution you'd like
I wonder if we couldn't pack all this (formatting + highlighting) directly in our expressions.
The highlighting part is easy: Griffe already parsed code as an expression, so we have all the semantics associated with each part of the expression. For example, in a function call, we know that the
ExprName
used for the names of keyword parameters are parameter names, and not just names, so we could easily wrap them in<span class="pn">...</span>
or whatever.The formatting part however is probably much more complex. I don't have the pretension to think I'm capable of writing something as qualitative and efficient as Black or Ruff. But maybe there's a way? For now, Griffe expressions only handle single statements (type annotations, signatures, assignments). What when we start supporting arbitrary code in expressions?
Describe alternatives you've considered
We could consider never formatting code ourselves, but use a CST instead of an AST, to preserve already existing formatting. Users would format the code themselves (there are plenty of tools to do that), and we would just keep the same formatting. However with Griffe's ability to transform expressions (for example expression modernization, and
Annotated
unwrapping), we will have to update the spacing anyway.Maybe a combination of both would be enough? We preserve original formatting, therefore the formatting complexity is drastically reduced so we can naively format the transformed parts ourselves?
Benefit of retaining original formatting: we respect the users choice (comments enabling/disabling formatting for example).
Additional context
Boost priority
The text was updated successfully, but these errors were encountered: