improve case conversion happy path #97046

conradludgate · 2022-05-14T17:52:13Z

Someone shared the source code for Go's string case conversion.

It features a hot path for ascii-only strings (although I assume for reasons specific to go, they've opted for a read safe hot loop).

I've borrowed these ideas and also kept our existing code to provide a fast path + seamless utf-8 correct path fallback.

(Naive) Benchmarks can be found here https://github.com/conradludgate/case-conv

For the cases where non-ascii is found near the start, the performance of this algorithm does fall back to original speeds and has not had any measurable speed loss

rust-highfive · 2022-05-14T17:52:16Z

Hey! It looks like you've submitted a new PR for the library teams!

If this PR contains changes to any rust-lang/rust public library APIs then please comment with r? rust-lang/libs-api @rustbot label +T-libs-api -T-libs to request review from a libs-api team reviewer. If you're unsure where your change falls no worries, just leave it as is and the reviewer will take a look and make a decision to forward on if necessary.

Examples of T-libs-api changes:

Stabilizing library features
Introducing insta-stable changes such as new implementations of existing stable traits on existing stable types
Introducing new or changing existing unstable library APIs (excluding permanently unstable features / features without a tracking issue)
Changing public documentation in ways that create new stability guarantees
Changing observable runtime behavior of library APIs

rust-highfive · 2022-05-14T17:52:17Z

r? @Mark-Simulacrum

(rust-highfive has picked a reviewer for you, use r? to override)

library/alloc/src/str.rs

conradludgate · 2022-05-16T09:00:02Z

I have benchmarked this on unicode heavy texts too (a 3MB Russian text file) and have found marginally better performance in this change:

lowercase/russian       time:   [27.309 ms 27.339 ms 27.370 ms] *
lowercase/russian_std   time:   [28.642 ms 28.672 ms 28.703 ms]
uppercase/russian       time:   [40.350 ms 40.389 ms 40.429 ms] *
uppercase/russian_std   time:   [40.931 ms 41.055 ms 41.178 ms]

So I am not worried about the initial ascii checks being negative to performance in strictly non-ascii contexts

mohe2015 · 2022-05-19T06:58:29Z

Could you maybe try German or something like that. And put umlauts somewhere at the end? (Or just use random garbled text where the start is only ASCII and the end contains unicode

conradludgate · 2022-05-21T07:46:07Z

And put umlauts somewhere at the end? (Or just use random garbled text where the start is only ASCII and the end contains unicode

This is basically when my original benchmarks do. Features 2 copies of Macbeth, one untouched only ascii, the other features 16 bytes of non-ascii at the end. This was to test how well it handled pure ascii with the seamless break out:

lowercase/ascii         time:   [18.715 us 18.755 us 18.814 us] *
lowercase/unicode       time:   [18.735 us 18.756 us 18.784 us] *

lowercase/ascii_std     time:   [283.71 us 284.09 us 284.53 us]
lowercase/unicode_std   time:   [285.11 us 285.61 us 286.21 us]

Performance it expected to be somewhere in the middle if the text contains a unicode character half way through.

Which is exactly what we see in the following results:

lowercase/ascii         time:   [19.084 us 19.341 us 19.700 us] *
lowercase/unicode       time:   [122.33 us 122.40 us 122.49 us] *

lowercase/ascii_std     time:   [285.41 us 288.32 us 292.74 us]
lowercase/unicode_std   time:   [284.82 us 285.10 us 285.52 us]

library/alloc/src/str.rs

conradludgate · 2022-05-22T14:23:30Z

r? @thomcc

klensy · 2022-05-22T14:53:16Z

Nice benches, but what about short strings?
I've expecting that this will be run on short string more times, that on strings over few kilobytes.

conradludgate · 2022-05-22T18:05:32Z

Nice benches, but what about short strings? I've expecting that this will be run on short string more times, that on strings over few kilobytes.

Fair point. While still maybe unnatural, I ran the following benchmark

ascii.split('.').map(str::to_lowercase).collect::<Vec<_>>()

over my large text files, this way it's tested more against shorter random length strings. I got the following results:

current_std: average 591.62ms
this change: average 269.67ms

This most likely would improve by reducing the magic number

Reducing the magic number from 16 to 8 I got minimal improvements (-25%) over the small string bench, but significant regression in the large string bench (+90%)

thomcc

This mostly looks okay to me, and I expect it to be an improvement, but I'm not sure that uppercasing strings over 100 bytes long is actually the common case.

library/alloc/src/str.rs

thomcc · 2022-05-26T07:14:54Z

Do you have benchmarks of the current version?

conradludgate · 2022-05-26T07:25:26Z

Do you have benchmarks of the current version?

Long ascii string

new  time:   [35.890 us 35.972 us 36.063 us]
old  time:   [522.71 us 523.65 us 524.69 us]

Long string with unicode half way through

new  time:   [228.51 us 229.41 us 230.73 us]
old  time:   [546.95 us 547.98 us 549.07 us]

Short ascii strings

new  time:   [144.01 us 144.34 us 144.73 us]
old  time:   [595.05 us 596.77 us 598.67 us]

thomcc · 2022-05-26T07:27:40Z

This looks good to me, then. No rollup because of possible perf impact if things change cases.

@bors r+ rollup=never

klensy · 2022-05-26T10:47:16Z

Wait, maybe squash 10 commits?

klensy · 2022-05-26T11:00:42Z

This looks good to me, then. No rollup because of possible perf impact if things change cases.

@bors r+ rollup=never

bors sleep.

library/alloc/src/str.rs

thomcc · 2022-05-26T14:42:02Z

@bors r+ rollup=never

bors · 2022-05-26T14:42:04Z

📌 Commit d0f9930 has been approved by thomcc

bors · 2022-05-26T15:29:10Z

⌛ Testing commit d0f9930 with merge 1851f08...

bors · 2022-05-26T17:58:03Z

☀️ Test successful - checks-actions
Approved by: thomcc
Pushing 1851f08 to master...

rust-timer · 2022-05-26T19:59:58Z

Finished benchmarking commit (1851f08): comparison url.

Instruction count

Primary benchmarks: no relevant changes found
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	N/A	N/A	0
Regressions 😿 (secondary)	0.1%	0.1%	1
Improvements 🎉 (primary)	N/A	N/A	0
Improvements 🎉 (secondary)	-1.0%	-1.0%	3
All 😿🎉 (primary)	N/A	N/A	0

Max RSS (memory usage)

Results

Primary benchmarks: no relevant changes found
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	N/A	N/A	0
Regressions 😿 (secondary)	2.3%	2.3%	1
Improvements 🎉 (primary)	N/A	N/A	0
Improvements 🎉 (secondary)	-1.3%	-1.4%	5
All 😿🎉 (primary)	N/A	N/A	0

Cycles

This benchmark run did not return any relevant results for this metric.

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

@rustbot label: -perf-regression

the arithmetic mean of the percent change ↩ ↩²
number of relevant changes ↩ ↩²

rust-highfive assigned Mark-Simulacrum May 14, 2022

rustbot added the T-libs Relevant to the library team, which will review and decide on the PR/issue. label May 14, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label May 14, 2022

This comment has been minimized.

Sign in to view

conradludgate force-pushed the faster-ascii-case-conv-path branch from 6e85b6a to 49657b1 Compare May 15, 2022 08:48

SabrinaJewson reviewed May 15, 2022

View reviewed changes

library/alloc/src/str.rs Outdated Show resolved Hide resolved

library/alloc/src/str.rs Outdated Show resolved Hide resolved

klensy reviewed May 22, 2022

View reviewed changes

library/alloc/src/str.rs Outdated Show resolved Hide resolved

rust-highfive assigned thomcc and unassigned Mark-Simulacrum May 22, 2022

thomcc reviewed May 24, 2022

View reviewed changes

library/alloc/src/str.rs Outdated Show resolved Hide resolved

library/alloc/src/str.rs Outdated Show resolved Hide resolved

klensy reviewed May 26, 2022

View reviewed changes

library/alloc/src/str.rs Outdated Show resolved Hide resolved

improve case conversion happy path

d0f9930

conradludgate force-pushed the faster-ascii-case-conv-path branch from 43d1d20 to d0f9930 Compare May 26, 2022 12:20

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 26, 2022

bors added the merged-by-bors This PR was explicitly merged by bors. label May 26, 2022

bors merged commit 1851f08 into rust-lang:master May 26, 2022

rustbot added this to the 1.63.0 milestone May 26, 2022

Marcondiro mentioned this pull request May 4, 2024

Sometimes 'Σ' is not handled correctly in str.to_lowercase() #124714

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve case conversion happy path #97046

improve case conversion happy path #97046

conradludgate commented May 14, 2022 •

edited

Loading

rust-highfive commented May 14, 2022

rust-highfive commented May 14, 2022

This comment has been minimized.

conradludgate commented May 16, 2022

mohe2015 commented May 19, 2022

conradludgate commented May 21, 2022 •

edited

Loading

conradludgate commented May 22, 2022

klensy commented May 22, 2022 •

edited

Loading

conradludgate commented May 22, 2022 •

edited

Loading

thomcc left a comment

thomcc commented May 26, 2022

conradludgate commented May 26, 2022

thomcc commented May 26, 2022 •

edited

Loading

klensy commented May 26, 2022 •

edited

Loading

klensy commented May 26, 2022

thomcc commented May 26, 2022

bors commented May 26, 2022

bors commented May 26, 2022

bors commented May 26, 2022

rust-timer commented May 26, 2022

improve case conversion happy path #97046

improve case conversion happy path #97046

Conversation

conradludgate commented May 14, 2022 • edited Loading

rust-highfive commented May 14, 2022

rust-highfive commented May 14, 2022

This comment has been minimized.

conradludgate commented May 16, 2022

mohe2015 commented May 19, 2022

conradludgate commented May 21, 2022 • edited Loading

conradludgate commented May 22, 2022

klensy commented May 22, 2022 • edited Loading

conradludgate commented May 22, 2022 • edited Loading

thomcc left a comment

Choose a reason for hiding this comment

thomcc commented May 26, 2022

conradludgate commented May 26, 2022

Long ascii string

Long string with unicode half way through

Short ascii strings

thomcc commented May 26, 2022 • edited Loading

klensy commented May 26, 2022 • edited Loading

klensy commented May 26, 2022

thomcc commented May 26, 2022

bors commented May 26, 2022

bors commented May 26, 2022

bors commented May 26, 2022

rust-timer commented May 26, 2022

Instruction count

Max RSS (memory usage)

Cycles

Footnotes

conradludgate commented May 14, 2022 •

edited

Loading

conradludgate commented May 21, 2022 •

edited

Loading

klensy commented May 22, 2022 •

edited

Loading

conradludgate commented May 22, 2022 •

edited

Loading

thomcc commented May 26, 2022 •

edited

Loading

klensy commented May 26, 2022 •

edited

Loading