Preserve quotes in generated f-strings #15794

ntBre · 2025-01-28T21:11:11Z

Summary

This is another follow-up to #15726 and #15778, extending the quote-preserving behavior to f-strings and deleting the now-unused Generator::quote field.

Details

I also made one unrelated change to rules/flynt/helpers.rs to remove a to_string call for making a Box<str> and tweaked some arguments to some of the Generator::unparse_f_string methods to make the code easier to follow, in my opinion. Happy to revert especially the latter of these if needed.

Unfortunately this still does not fix the issue in #9660, which appears to be more of an escaping issue than a quote-preservation issue. After #15726, the result is now a = f'# {"".join([])}' if 1 else "" instead of a = f"# {''.join([])}" if 1 else "" (single quotes on the outside now), but we still don't have the desired behavior of double quotes everywhere on Python 3.12+. I added a test for this but split it off into another branch since it ended up being unaddressed here, but my dbg! statements showed the correct preferred quotes going into UnicodeEscape::with_preferred_quote.

Test Plan

Existing rule and Generator tests.

and document its use in unparse_f_string

in RUF030

github-actions · 2025-01-28T21:20:18Z

`ruff-ecosystem` results

Linter (stable)

ℹ️ ecosystem check detected linter changes. (+5 -5 violations, +0 -0 fixes in 2 projects; 53 projects unchanged)

bokeh/bokeh (+4 -4 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --no-fix --output-format concise --no-preview --select ALL

+ examples/server/api/flask_embed.py:26:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/flask_embed.py:26:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block
+ examples/server/api/flask_gunicorn_embed.py:41:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/flask_gunicorn_embed.py:41:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block
+ examples/server/api/standalone_embed.py:18:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/standalone_embed.py:18:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block
+ examples/server/api/tornado_embed.py:29:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/tornado_embed.py:29:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block

zulip/zulip (+1 -1 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --no-fix --output-format concise --no-preview --select ALL

+ scripts/lib/sharding.py:65:21: SIM108 Use ternary operator `host = shard if "." in shard else f"{shard}.{external_host}"` instead of `if`-`else`-block
- scripts/lib/sharding.py:65:21: SIM108 Use ternary operator `host = shard if "." in shard else f'{shard}.{external_host}'` instead of `if`-`else`-block

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
SIM108	10	5	5	0	0

Linter (preview)

ℹ️ ecosystem check detected linter changes. (+5 -5 violations, +0 -0 fixes in 2 projects; 53 projects unchanged)

bokeh/bokeh (+4 -4 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --no-fix --output-format concise --preview --select ALL

+ examples/server/api/flask_embed.py:26:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/flask_embed.py:26:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block
+ examples/server/api/flask_gunicorn_embed.py:41:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/flask_gunicorn_embed.py:41:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block
+ examples/server/api/standalone_embed.py:18:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/standalone_embed.py:18:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block
+ examples/server/api/tornado_embed.py:29:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f"{new}D").mean()` instead of `if`-`else`-block
- examples/server/api/tornado_embed.py:29:9: SIM108 Use ternary operator `data = df if new == 0 else df.rolling(f'{new}D').mean()` instead of `if`-`else`-block

zulip/zulip (+1 -1 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --no-fix --output-format concise --preview --select ALL

+ scripts/lib/sharding.py:65:21: SIM108 Use ternary operator `host = shard if "." in shard else f"{shard}.{external_host}"` instead of `if`-`else`-block
- scripts/lib/sharding.py:65:21: SIM108 Use ternary operator `host = shard if "." in shard else f'{shard}.{external_host}'` instead of `if`-`else`-block

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
SIM108	10	5	5	0	0

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

AlexWaygood

Nice! Would it be possible to add some new lint rule test fixtures that show a behaviour change from this PR? Maybe a new snippet or two for one of the flynt rules?

crates/ruff_linter/src/rules/flynt/helpers.rs

crates/ruff_python_codegen/src/generator.rs

Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>

ntBre · 2025-01-28T22:32:58Z

Maybe a new snippet or two for one of the flynt rules?

This is trickier than I expected. For both of these rules, I just used Checker::default_fstring_flags instead of passing them along from an existing string. For the flynt rule that's reasonable since it's creating an f-string where one didn't exist before, but the RUF030 rule should actually preserve them. I think I need to restructure the rule code a bit to make this possible, though.

dhruvmanila · 2025-01-29T03:45:59Z

but we still don't have the desired behavior of double quotes everywhere on Python 3.12+. I added a test for this but split it off into another branch since it ended up being unaddressed here, but my dbg! statements showed the correct preferred quotes going into UnicodeEscape::with_preferred_quote.

We should make sure that this doesn't introduce any incompatibility with the f-string style in the formatter (I think it shouldn't) as the formatter prefers alternating quote style instead of using the same quotes in Python 3.12+ (https://docs.astral.sh/ruff/formatter/#f-string-formatting).

dhruvmanila · 2025-01-29T03:52:31Z

Unfortunately this still does not fix the issue in #9660, which appears to be more of an escaping issue than a quote-preservation issue.

Sorry, I think I missed this issue before but I was wondering whether that should actually be considered a bug or not because the formatter prefers using alternating quote style. I think this should not be a bug because if the issue author is running the formatter after the linter, then the quotes will be changed back which might seem confusing.

AlexWaygood · 2025-01-29T10:59:15Z

Hmm, @dhruvmanila I still would consider #9660 a bug I think. I agree that if a lint autofix is generating a new f-string completely from scratch, it should prefer to use alternating quotes like the formatter does. But that's not what is happening in #9660 -- in #9660, it's replacing an existing f-string with a new f-string, but the new f-string has a different quoting style to the old f-string. That seems like an unnecessary stylistic change for the autofix to make, considering that the user might not even have any autoformatter enabled on their code. If they're using the Ruff formatter, then the issue doesn't arise in the first place, because their existing f-string would probably already have alternating quotes. (Or, if it's an f-string that they've just added and haven't yet formatted, they can run the formatter immediately after they run the linter in order to fix it up.)

Basically, my opinion is that where possible lint autofixes should aim to have no opinion at all on quoting styles, and should leave that to the formatter. Instead, they should as much as possible try to preserve quoting styles when doing autofixes.

AlexWaygood · 2025-01-29T11:02:08Z

This is trickier than I expected. For both of these rules, I just used Checker::default_fstring_flags instead of passing them along from an existing string. For the flynt rule that's reasonable since it's creating an f-string where one didn't exist before, but the RUF030 rule should actually preserve them. I think I need to restructure the rule code a bit to make this possible, though.

From the ecosystem report in #15794 (comment), it looks like there's some changes to SIM108 suggested autofixes -- you could maybe add some f-string snippets to the SIM108 fixtures, if the flynt rules are hard to add tests for?

ntBre · 2025-01-29T15:00:51Z

@AlexWaygood Thanks for the SIM108 suggestion, I added two cases for that: one with double quotes and one with single quotes. I was looking for rules that explicitly interacted with f-strings, but I see now that just about any rule that uses the Generator and can apply to an expression with f-strings would have worked, which is exciting on its own!

@dhruvmanila What do you think about Alex's comments? I tend to agree with preserving whatever the user had and then letting the formatter clean it up afterward. We still have some time to decide either way, though. I was going to work on preserving prefixes and triple quotes before taking a stab at that issue anyway.

AlexWaygood

🎉

ntBre added 10 commits January 28, 2025 15:48

remove effectively unused is_spec argument to unparse_f_string_value

8488a4e

and document its use in unparse_f_string

separate parse_f_string and parse_f_string_specifier and pass quote

23ed586

avoid to_string

ed46f7c

use Checker's default quote style for new f-string in FLY002

aff34a5

update Generator quote tests again

8a6208e

pass along the quote from Checker::default_string_flags for f-string

25deb4f

in RUF030

delete now-unused Generator::quote field and related uses

47d9ff2

macro is down to one case

1442618

FStringFlags docs, no default, add Checker::default_fstring_flags

9dade43

tidy up

6bfebce

ntBre added bug Something isn't working fixes Related to suggested fixes for violations labels Jan 28, 2025

ntBre requested a review from MichaReiser as a code owner January 28, 2025 21:11

AlexWaygood approved these changes Jan 28, 2025

View reviewed changes

Box::from over into

cb474df

Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>

ntBre added 2 commits January 29, 2025 09:36

combine deduplicated set_quote tests with quote tests

d9ee56c

add SIM108 test for preserving quotes

ca8f5f0

AlexWaygood approved these changes Jan 29, 2025

View reviewed changes

ntBre merged commit 23c9884 into main Jan 29, 2025
21 checks passed

ntBre deleted the brent/f-string-quotes branch January 29, 2025 18:28

ntBre mentioned this pull request Jan 29, 2025

Preserve triple quotes and prefixes for strings #15818

Open

BrewTestBot mentioned this pull request Jan 30, 2025

ruff 0.9.4 Homebrew/homebrew-core#206058

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve quotes in generated f-strings #15794

Preserve quotes in generated f-strings #15794

ntBre commented Jan 28, 2025

github-actions bot commented Jan 28, 2025 •

edited

Loading

AlexWaygood left a comment

ntBre commented Jan 28, 2025

dhruvmanila commented Jan 29, 2025

dhruvmanila commented Jan 29, 2025

AlexWaygood commented Jan 29, 2025

AlexWaygood commented Jan 29, 2025

ntBre commented Jan 29, 2025

AlexWaygood left a comment

Preserve quotes in generated f-strings #15794

Preserve quotes in generated f-strings #15794

Conversation

ntBre commented Jan 28, 2025

Summary

Details

Test Plan

github-actions bot commented Jan 28, 2025 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

AlexWaygood left a comment

Choose a reason for hiding this comment

ntBre commented Jan 28, 2025

dhruvmanila commented Jan 29, 2025

dhruvmanila commented Jan 29, 2025

AlexWaygood commented Jan 29, 2025

AlexWaygood commented Jan 29, 2025

ntBre commented Jan 29, 2025

AlexWaygood left a comment

Choose a reason for hiding this comment

github-actions bot commented Jan 28, 2025 •

edited

Loading

`ruff-ecosystem` results