reduce `single_char_pattern` to only lint on ascii chars #11852

llogiq · 2023-11-21T22:15:40Z

This should mostly fix the single_char_pattern lint, because with a single byte, the optimizer will usually see through the char-to-string-expansion and single loop iteration. This fixes #11675 and #8111.

Update: As per the meeting on November 28th, 2023, we voted to also downgrade the lint to pedantic.

changelog: downgrade [single_char_pattern] to pedantic

rustbot · 2023-11-21T22:15:46Z

r? @Alexendoo

(rustbot has picked a reviewer for you, use r? to override)

Alexendoo · 2023-11-21T22:59:49Z

Maybe it's time to retire this lint? Which is faster seems unpredictable and subject to change across versions, it seems like a better avenue would be opening issues in rustc for any case where one is slower than the other

david-monroe · 2023-11-22T10:04:55Z

clippy_lints/src/methods/mod.rs

+    /// Performing these methods using a `char` can be faster than
+    /// using a `str` because it needs one less indirection.


This is not true in the current implementation. If you want to be implementation agnostic, you might as well write the opposite:

Suggested change

/// Performing these methods using a `char` can be faster than

/// using a `str` because it needs one less indirection.

/// Performing these methods using a `str` can be faster than

/// using a `char` because it needs one less conversion.

It apparently depends on a lot of factors. The char based version stores the UTF8 data inline, whereas the &str one stores a ref to the caller-supplied string slice instead, so if the Searcher instantiation is const, it gets by with one less pointer.

Of course e.g. the fastest starts_with for an ascii character is haystack.as_bytes().get(0) == Some(&(c as u8)), dropping below a nanosecond in my benchmarks.

flip1995 · 2023-11-22T10:13:40Z

Maybe it's time to retire this lint?

At least I think we should move it to style, as it no longer affects performance IIUC.

bors · 2024-01-03T21:54:18Z

☔ The latest upstream changes (presumably #12030) made this pull request unmergeable. Please resolve the merge conflicts.

llogiq · 2024-01-06T12:17:51Z

@Alexendoo should I rebase or close?

Alexendoo · 2024-01-07T14:03:22Z

Sorry for the delay, given the decision to move it to pedantic do we still want to make the multibyte change? If we're dropping the perf aspect of the lint to be more a stylistic one I think it would make sense to lint multibyte characters still & modify the lint description to be less perf focused

llogiq · 2024-01-07T20:45:35Z

The problem I see with that is that for multibyte inputs using a char may actually hurt performance. So yes, I believe we should reduce the lint scope even as we move it to pedantic.

xFrednet · 2024-04-01T09:48:23Z

Hey @llogiq, this is a ping from triage, since there hasn't been any activity in some time. Are you still planning to continue this implementation?

If you have any questions, you're always welcome to ask them in this PR or on Zulip.

@rustbot author

llogiq · 2024-04-01T10:46:39Z

I'll take another look during the next week.

Alexendoo · 2024-04-02T14:29:40Z

Ignoring multibyte chars as a known issue sounds reasonable to avoid a perf regression if it's still there, ideally we'd have a rustc issue to link to

I still think the description should be changed though, the perf angle doesn't hold up to me

llogiq · 2024-04-11T20:56:57Z

@Alexendoo / @xFrednet I rebased the implementation and updated the docs. r?

xFrednet · 2024-04-22T13:28:28Z

Looks good to me.

Thank you for the update :)

@bors r+

bors · 2024-04-22T13:28:31Z

📌 Commit 54de78a has been approved by xFrednet

It is now in the queue for this repository.

bors · 2024-04-22T13:29:38Z

⌛ Testing commit 54de78a with merge fc6dfeb...

bors · 2024-04-22T13:37:39Z

☀️ Test successful - checks-action_dev_test, checks-action_remark_test, checks-action_test
Approved by: xFrednet
Pushing fc6dfeb to master...

rustbot assigned Alexendoo Nov 21, 2023

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties label Nov 21, 2023

david-monroe reviewed Nov 22, 2023

View reviewed changes

Alexendoo added the I-nominated Issue: Nominated to be discussed at the next Clippy meeting label Nov 27, 2023

llogiq force-pushed the single-char-pattern-ascii-only branch from d69a594 to 3d98b67 Compare November 29, 2023 07:11

flip1995 removed the I-nominated Issue: Nominated to be discussed at the next Clippy meeting label Dec 11, 2023

rustbot added S-waiting-on-author Status: This is awaiting some action from the author. (Use `@rustbot ready` to update this status) and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties labels Apr 1, 2024

reduce single_char_pattern to only lint on ascii chars

58c53e8

llogiq force-pushed the single-char-pattern-ascii-only branch from 3d98b67 to 1a69f84 Compare April 11, 2024 18:32

downgrade to pedantic

54de78a

llogiq force-pushed the single-char-pattern-ascii-only branch from 1a69f84 to 54de78a Compare April 11, 2024 20:24

rust-lang deleted a comment from rustbot Apr 13, 2024

xFrednet approved these changes Apr 22, 2024

View reviewed changes

xFrednet assigned xFrednet and unassigned Alexendoo Apr 22, 2024

bors merged commit fc6dfeb into master Apr 22, 2024
6 checks passed

llogiq deleted the single-char-pattern-ascii-only branch April 22, 2024 16:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce `single_char_pattern` to only lint on ascii chars #11852

reduce `single_char_pattern` to only lint on ascii chars #11852

llogiq commented Nov 21, 2023 •

edited

Loading

rustbot commented Nov 21, 2023

Alexendoo commented Nov 21, 2023

david-monroe Nov 22, 2023

llogiq Nov 22, 2023

flip1995 commented Nov 22, 2023

bors commented Jan 3, 2024

llogiq commented Jan 6, 2024

Alexendoo commented Jan 7, 2024

llogiq commented Jan 7, 2024

xFrednet commented Apr 1, 2024

llogiq commented Apr 1, 2024

Alexendoo commented Apr 2, 2024

llogiq commented Apr 11, 2024

xFrednet commented Apr 22, 2024

bors commented Apr 22, 2024

bors commented Apr 22, 2024

bors commented Apr 22, 2024

		/// Performing these methods using a `char` can be faster than
		/// using a `str` because it needs one less indirection.

reduce single_char_pattern to only lint on ascii chars #11852

reduce single_char_pattern to only lint on ascii chars #11852

Conversation

llogiq commented Nov 21, 2023 • edited Loading

rustbot commented Nov 21, 2023

Alexendoo commented Nov 21, 2023

david-monroe Nov 22, 2023

Choose a reason for hiding this comment

llogiq Nov 22, 2023

Choose a reason for hiding this comment

flip1995 commented Nov 22, 2023

bors commented Jan 3, 2024

llogiq commented Jan 6, 2024

Alexendoo commented Jan 7, 2024

llogiq commented Jan 7, 2024

xFrednet commented Apr 1, 2024

llogiq commented Apr 1, 2024

Alexendoo commented Apr 2, 2024

llogiq commented Apr 11, 2024

xFrednet commented Apr 22, 2024

bors commented Apr 22, 2024

bors commented Apr 22, 2024

bors commented Apr 22, 2024

reduce `single_char_pattern` to only lint on ascii chars #11852

reduce `single_char_pattern` to only lint on ascii chars #11852

llogiq commented Nov 21, 2023 •

edited

Loading