☂️ Formatter: Support formatting of embedded code #8237

MichaReiser · 2023-10-26T02:56:06Z

Support formatting Python code embedded in other languages like:

Markdown
reStructuredText
HTML
...

The goal of this issue is not that we implement support for all these languages but to build up the infrastructure to run ruff (at least the formatter) on files that contain embedded python code and format it. Ideally, the infrastructure would, in the future, allow us to support arbitrary nesting:

Format SQL in Python
Format Markdown in Python
...

Prettier and JetBrains code formatter do an excellent job at this.

dhruvmanila · 2023-10-26T03:10:47Z

The goal of this issue is not that we implement support for all these languages but to build up the infrastructure to run ruff (at least the formatter) on files that contain embedded python code and format it.

I think we should keep the linter in mind while designing the infrastructure as, and I'm not 100% sure but it's mainly my intuition from working on the Notebook support, it's more than likely than we get the formatter support for free. Free in the sense that it'll require considerably less effort on the formatter side once the infrastructure is in place.

MichaReiser · 2023-10-26T03:14:15Z

I agree, but I wanted to keep this issue scoped. The linter and formatter likely have similar requirements and need similar infrastructure, but with slight nuances.

ddelange · 2023-10-26T06:41:44Z

potential duplicate of #3792

henryiii · 2023-10-26T16:25:28Z

https://github.com/adamchainz/blacken-docs does this with black and markdown, ReST, and LaTeX. You can see the block types supported there. (python, pycon, etc). Even better, maybe the block types could be configurable, say setting md-blocks = ["python", "ipython", "{code-cell} ipython3"] would allow you to run on python, ipython, and executable Python (see https://jupyterbook.org/en/stable/file-types/myst-notebooks.html).

KelSolaar · 2024-01-29T04:44:27Z

Hello,

I would be keen to see doctests formatting, e.g.:

def least_square_mapping_MoorePenrose(
    y: ArrayLike, x: ArrayLike
) -> NDArrayFloat:
    """
    Compute the *least-squares* mapping from dependent variable :math:`y` to
    independent variable :math:`x` using *Moore-Penrose* inverse.

    Parameters
    ----------
    y
        Dependent and already known :math:`y` variable.
    x
        Independent :math:`x` variable(s) values corresponding with :math:`y`
        variable.

    Returns
    -------
    :class:`numpy.ndarray`
        *Least-squares* mapping.

    References
    ----------
    :cite:`Finlayson2015`

    Examples
    --------
    >>> prng = np.random.RandomState(2)
    >>> y = prng.random_sample((24, 3))
    >>> x = y + (    prng.random_sample(    (24, 3)) - 0.5) * 0.5
    >>> least_square_mapping_MoorePenrose(y, x)  # doctest: +ELLIPSIS
    array([[ 1.0526376...,  0.1378078..., -0.2276339...],
           [ 0.0739584...,  1.0293994..., -0.1060115...],
           [ 0.0572550..., -0.2052633...,  1.1015194...]])
    """

    y = np.atleast_2d(y)
    x = np.atleast_2d(x)

    return np.dot(np.transpose(x), np.linalg.pinv(np.transpose(y)))

Ruff currently does not format anything inside >>> and ... in docstrings. I managed to drop Black and replace it with Ruff but I still depend on adamchainz/blacken-docs.

Cheers,

Thomas

MichaReiser · 2024-01-29T07:25:46Z

Hey @KelSolaar

Docstring code block formatting is supported but off by default. You can enable it in your settings using format.docstring-code-format = true

KelSolaar · 2024-01-29T07:42:43Z

Hi @MichaReiser,

I have it enabled but it does not seem to format the above.

Cheers,

Thomas

MichaReiser · 2024-01-29T08:22:07Z

@KelSolaar

The issue is that

array([[ 1.0526376...,  0.1378078..., -0.2276339...],
           [ 0.0739584...,  1.0293994..., -0.1060115...],
           [ 0.0572550..., -0.2052633...,  1.1015194...]])

is not valid python syntax (because of the ...). The formatter can only format examples that are valid python.

KelSolaar · 2024-01-29T08:30:32Z

Right I see! So this particular code output is used as a doctest where any whitespace counts so it should NOT be formatted, ever. Ruff formatter should ignore those outputs fully which is what adamchainz/blacken-docs does. Hope it does make sense!

KelSolaar · 2024-01-29T18:36:08Z

For that specific case, Ruff should only consider docstring lines starting with either >>> or .... There might be some subtleties and I would check the doctests parser to confirm but it is the main idea.

henryiii · 2024-01-29T19:46:35Z

That example should use the pycon lexer (similar to how "console" is used for console input with $/#). Anything in the python language should be valid Python and formatted, so I think Ruff is doing the right thing trying to format it (and failing). Supporting pycon and formatting the "python" parts would be nice, though! (Note that pycon is supported elsewhere, including in markdown like here on GitHub)

If no language is given (such code with a 4-space indent), I think it should be treated as whatever the default is (which IIRC might be Python).

ddelange · 2024-01-29T20:06:11Z

corresponding google-style doctest:

def least_square_mapping_MoorePenrose(
    y: ArrayLike, x: ArrayLike
) -> NDArrayFloat:
    """
    Compute the *least-squares* mapping from dependent variable :math:`y` to
    independent variable :math:`x` using *Moore-Penrose* inverse.

    Examples:
        >>> prng = np.random.RandomState(2)
        >>> y = prng.random_sample((24, 3))
        >>> x = y + (    prng.random_sample(    (24, 3)) - 0.5) * 0.5
        >>> least_square_mapping_MoorePenrose(y, x)  # doctest: +ELLIPSIS
        array([[ 1.0526376...,  0.1378078..., -0.2276339...],
               [ 0.0739584...,  1.0293994..., -0.1060115...],
               [ 0.0572550..., -0.2052633...,  1.1015194...]])
    """

    y = np.atleast_2d(y)
    x = np.atleast_2d(x)

    return np.dot(np.transpose(x), np.linalg.pinv(np.transpose(y)))

aneeshusa · 2024-03-04T23:10:33Z

+1 to this - beyond blacken-docs, shed which I'm currently switching from has this feature too!

MichaReiser added formatter Related to the formatter wish Not on the current roadmap; maybe in the future labels Oct 26, 2023

MichaReiser mentioned this issue Oct 26, 2023

Format Python code in documentation files astral-sh/ruff-pre-commit#55

Closed

MichaReiser changed the title ~~☂️ Format Python code embedded in other languages~~ ☂️ Formatter: Support formatting of embedded code Oct 26, 2023

henryiii mentioned this issue Nov 17, 2023

Start using Scientific Python's repo-review astropy/astropy#15367

Merged

1 task

dhruvmanila mentioned this issue Nov 21, 2023

Support text Jupyter notebooks created with Jupytext #8800

Open

JacobCoffee mentioned this issue Dec 13, 2023

format doctests in docstrings #8811

Merged

3 tasks

Saransh-cpp mentioned this issue Dec 23, 2023

Migrate to ruff format pybamm-team/PyBaMM#3656

Merged

8 tasks

charliermarsh mentioned this issue Jan 2, 2024

Extend docstring-code-format to python snippets in MarkDown, RST etc #9326

Closed

zanieb mentioned this issue Jan 23, 2024

Using ruff to format doctests in .rst (and other format) files #9620

Closed

jagerber48 mentioned this issue Jan 27, 2024

FormattedNumber and post Latex and HTML conversion jagerber48/sciform#134

Merged

bluetech mentioned this issue Feb 1, 2024

Migrate from autoflake, black, isort, pyupgrade, flake8 and pydocstyle, to ruff pytest-dev/pytest#11901

Merged

JacobCoffee mentioned this issue Mar 16, 2024

Docs: Code block line length litestar-org/litestar#3211

Open

dhruvmanila mentioned this issue Mar 20, 2024

Add flag for syntax checking of doctests as in flake8 --doctests #3542

Open

cphyc mentioned this issue Apr 19, 2024

STY: migrate formatting from black to ruff-format yt-project/yt#4868

Merged

nstarman mentioned this issue Apr 27, 2024

Add support for Ruff's docstring-code-format scientific-python/cookie#343

Open

nstarman mentioned this issue Jun 3, 2024

DOC: cleanup Python 2 idioms astropy/astropy#16523

Merged

1 task

lafrech mentioned this issue Jul 2, 2024

[pre-commit.ci] pre-commit autoupdate marshmallow-code/apispec-webframeworks#151

Merged

seisman mentioned this issue Aug 22, 2024

DOC: Generate the charset tables dynamically from codes GenericMappingTools/pygmt#3409

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

☂️ Formatter: Support formatting of embedded code #8237

☂️ Formatter: Support formatting of embedded code #8237

MichaReiser commented Oct 26, 2023 •

edited

Loading

dhruvmanila commented Oct 26, 2023

MichaReiser commented Oct 26, 2023

ddelange commented Oct 26, 2023

henryiii commented Oct 26, 2023

KelSolaar commented Jan 29, 2024

MichaReiser commented Jan 29, 2024 •

edited

Loading

KelSolaar commented Jan 29, 2024

MichaReiser commented Jan 29, 2024

KelSolaar commented Jan 29, 2024 •

edited

Loading

KelSolaar commented Jan 29, 2024

henryiii commented Jan 29, 2024 •

edited

Loading

ddelange commented Jan 29, 2024

aneeshusa commented Mar 4, 2024

☂️ Formatter: Support formatting of embedded code #8237

☂️ Formatter: Support formatting of embedded code #8237

Comments

MichaReiser commented Oct 26, 2023 • edited Loading

dhruvmanila commented Oct 26, 2023

MichaReiser commented Oct 26, 2023

ddelange commented Oct 26, 2023

henryiii commented Oct 26, 2023

KelSolaar commented Jan 29, 2024

MichaReiser commented Jan 29, 2024 • edited Loading

KelSolaar commented Jan 29, 2024

MichaReiser commented Jan 29, 2024

KelSolaar commented Jan 29, 2024 • edited Loading

KelSolaar commented Jan 29, 2024

henryiii commented Jan 29, 2024 • edited Loading

ddelange commented Jan 29, 2024

aneeshusa commented Mar 4, 2024

MichaReiser commented Oct 26, 2023 •

edited

Loading

MichaReiser commented Jan 29, 2024 •

edited

Loading

KelSolaar commented Jan 29, 2024 •

edited

Loading

henryiii commented Jan 29, 2024 •

edited

Loading