Provides implementation of `Vec::extend_from_slice` optimized for `T: Copy` #236

zslayton · 2024-02-18T11:57:20Z

Addresses #235.

This PR is meant to be a jumping-off point for discussion. It adds a new inherent impl method to Vec:

impl<'bump, T: 'bump + Copy> Vec<'bump, T> {
    /// ...
    pub fn extend_from_slice_copy(&mut self, other: &[T]) {
    // ...
    }
}

It moves the logic added in #229 from String to Vec and then re-implements push_str by calling the new extend_from_slice_copy.

I've duplicated the push_str benchmarks but for Vec<'_, u8> and the performance improves as expected:

zslayton · 2024-02-19T15:58:39Z

@overlookmotel expressed a concern that unconditionally using copy_nonoverlapping might result in small performance regressions in the case of a small slices/strings.

I extended the benchmarks in this PR to test a variety of sizes between 4 bytes and 16KB. The results--in this microbenchmark, running on my machine--indicated that copy_nonoverlapping is a win for slices of any size.

overlookmotel · 2024-02-19T22:14:45Z

That's interesting! I wonder why we saw a slow-down in OXC (though an extremely small one) with the other change.

The other explanations I can think of are:

criterion::black_box is not doing its job properly in your benchmarks, which would allow the compiler to convert the copy_nonoverlapping calls to inlined copies.
It performs better with slice lengths which are multiples of 8 / 16 etc (seems plausible). Maybe try a "weird" length like 5 or 7 and see how it does with that?

Sorry, not trying to pick holes in your work, just trying to understand.

fitzgen

Thanks! Looks good to me modulo nitpick below.

Regarding short strings and possible perf regressions: nothing I've read seems serious enough to warrant reverting that optimization. I'm happy to receive follow up PRs that further improve things here, if there is anything further to improve.

fitzgen · 2024-02-20T20:25:48Z

.gitignore

+
+# JetBrains IDE files (e.g. RustRover)
+.idea


This seems like something that is more appropriate for your home directory's .gitignore than every project you contribute to.

🤦 TIL that you can put a .gitignore in your home directory. I'll remove this, thanks.

I also noticed that Vec's impl of std::io::Write can be updated to use this method; I'll push another commit with that change tomorrow.

overlookmotel · 2024-02-20T23:15:13Z

Regarding short strings and possible perf regressions: nothing I've read seems serious enough to warrant reverting that optimization. I'm happy to receive follow up PRs that further improve things here, if there is anything further to improve.

Yes absolutely, the performance "regression" we saw was absolutely tiny, and for general purpose use 80x faster for long strings outweighs 0.1% slower for very short strings. I just wanted to put it on the record that there was some downside to the PR I made, rather than leave it unsaid, in case it affects anyone else.

zslayton · 2024-02-21T13:52:01Z

It performs better with slice lengths which are multiples of 8 / 16 etc (seems plausible). Maybe try a "weird" length like 5 or 7 and see how it does with that?

This seemed plausible to me too! I added a few non-power-of-2 cases to the benchmark to check it out, but it looks like the improvement holds for those as well.

fitzgen

Thanks!

overlookmotel · 2024-02-21T17:34:49Z

@zslayton Thanks for tolerating my nitty comments, and for benching shorter byte ranges. I remain curious about why we saw the slight slow-down we did on similar PR for strings, but probably I should stop worrying!

zslayton · 2024-02-21T18:41:53Z

@zslayton Thanks for tolerating my nitty comments, and for benching shorter byte ranges.

Sure thing! I appreciate having another set of eyes on this, I'd rather spend some time double-checking things than accidentally cause a regression for all of these folks 🙃😬.

I remain curious about why we saw the slight slow-down we did on similar PR for strings, but probably I should stop worrying!

I'm curious too!

Adds extend_from_slice_copy inherent impl

4a777f7

zslayton mentioned this pull request Feb 19, 2024

Improve performance of extend_from_slice where T: Copy #235

Closed

Updated comments

6b0630f

zslayton added 3 commits February 19, 2024 10:59

Extends benchmarks to test a variety of slice lengths

f3b0616

Updates doc comment for extend_from_slice_copy

612b210

Removes comment with open question about alignment

b338613

fitzgen reviewed Feb 20, 2024

View reviewed changes

zslayton added 3 commits February 21, 2024 08:26

Changes Vec's impl of io::Write to use extend_from_slice_copy

41da5ba

Removes changes to .gitignore

3e204e3

adds non-power-of-2 sizes to the benchmark

a84b390

zslayton requested a review from fitzgen February 21, 2024 13:52

fitzgen approved these changes Feb 21, 2024

View reviewed changes

fitzgen merged commit 54c88f0 into fitzgen:main Feb 21, 2024
8 checks passed

zslayton deleted the slice-copy branch February 21, 2024 18:41

This was referenced Mar 1, 2024

Adds Vec::extend_from_slices_copy that accepts multiple slices #240

Merged

Implements writing e-expressions in binary 1.1 amazon-ion/ion-rust#722

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provides implementation of `Vec::extend_from_slice` optimized for `T: Copy` #236

Provides implementation of `Vec::extend_from_slice` optimized for `T: Copy` #236

zslayton commented Feb 18, 2024 •

edited

Loading

zslayton commented Feb 19, 2024

overlookmotel commented Feb 19, 2024

fitzgen left a comment

fitzgen Feb 20, 2024

zslayton Feb 20, 2024

overlookmotel commented Feb 20, 2024 •

edited

Loading

zslayton commented Feb 21, 2024

fitzgen left a comment

overlookmotel commented Feb 21, 2024

zslayton commented Feb 21, 2024

Provides implementation of Vec::extend_from_slice optimized for T: Copy #236

Provides implementation of Vec::extend_from_slice optimized for T: Copy #236

Conversation

zslayton commented Feb 18, 2024 • edited Loading

zslayton commented Feb 19, 2024

overlookmotel commented Feb 19, 2024

fitzgen left a comment

Choose a reason for hiding this comment

fitzgen Feb 20, 2024

Choose a reason for hiding this comment

zslayton Feb 20, 2024

Choose a reason for hiding this comment

overlookmotel commented Feb 20, 2024 • edited Loading

zslayton commented Feb 21, 2024

fitzgen left a comment

Choose a reason for hiding this comment

overlookmotel commented Feb 21, 2024

zslayton commented Feb 21, 2024

Provides implementation of `Vec::extend_from_slice` optimized for `T: Copy` #236

Provides implementation of `Vec::extend_from_slice` optimized for `T: Copy` #236

zslayton commented Feb 18, 2024 •

edited

Loading

overlookmotel commented Feb 20, 2024 •

edited

Loading