`Iterator::cloned`: document side effect behavior #92543

llogiq · 2022-01-03T22:57:38Z

During discussing work on clippy's upcoming iter_overeager_cloned lint, I found that the fact that the cloned implementation never optimizes any clone call away because the clone implementation could contain an observable side effect is underdocumented in the Iterator::cloned method.

This PR adds a note to rectify this omission.

rust-highfive · 2022-01-03T22:57:41Z

r? @joshtriplett

(rust-highfive has picked a reviewer for you, use r? to override)

library/core/src/iter/traits/iterator.rs

camelid · 2022-01-04T02:56:10Z

library/core/src/iter/traits/iterator.rs

+    /// on `iter.cloned().last()`, all elements will be cloned even if only the
+    /// last one is actually returned.


Is it really worth making this guarantee? I could imagine a future compiler optimization (or perhaps specialization) that would eliminate clone calls for non–side effecting Clone implementors, such as Vec.

We will soon have a clippy lint for this, so no, I don't think we should sacrifice backwards compatibility for a possible perf gain.

Maybe the clippy lint is the issue then?

Also, there is no backwards compatibility issue since the API contract currently does not promise this. It only becomes an issue once the guarantee is added.

There is no "promise", but it has been observable behavior since 1.0.0. So if we were to change that, we better have a good idea if any code depends on it.

Also why would the clippy lint be an issue?

I assume the clippy lint is implying something about the side-effects since it was brought up here and since the lint is motivating this change? The lint itself wasn't linked, so I am just guessing.

There is no "promise", but it has been observable behavior since 1.0.0.

observable behavior != stability guarantee

This PR turns what someone might hypothetically rely on into a guarantee, that's a significant change that paints us into a corner.

I'm not wedded to this PR – however I'd like to have the (non-)guarantees documented. I'd swap this out with "you should not rely on the items' Clone impl being called for all items observed by the iterator" in a heartbeat if that's what we decide.

Leaving the observable behavior undocumented makes it harder to say "you shouldn't have relied on it" later.

the8472 · 2022-01-04T03:11:28Z

#90209 intends to optimize away some clones here.

the8472 · 2022-01-04T06:45:50Z

Additionally this PR (saying that cloning always is a side-effect) is inconsistent with RFC 1521

llogiq · 2022-01-04T07:34:38Z

The PR says "cloning is regarded as a side effect, becaue Clone implementations may contain observable behavior". This doesn't mean "cloning is always regarded as a side effect". So how is this inconsistent with RFC 1521? The RFC just specifies that cloning is side effect free for Copy types.

llogiq · 2022-01-04T07:35:47Z

OK, so apparently @m-ou-se thinks that cloning should not matter here. I'll close this if #90209 lands.

the8472 · 2022-01-04T07:42:37Z

So how is this inconsistent with RFC 1521?

It allows us to turn .clone() calls, even when they contain side-effects, into copies via specialization. The doc comment says

For example iter.cloned().last() will clone all elements even if only the last is actually returned.

The will clone part wouldn't be true in that case.

llogiq · 2022-01-04T17:16:30Z

Let's take a step back and see what the problem is and what options we have to deal with it:

We see a performance footgun, in that elements may be needlessly cloned when e.g. doing iter.cloned().last(). However, the clone calls could include observable behavior that in theory users may rely on.

So we can:

Use a clippy lint to steer users to cloning as late as possible. Document the perf footgun & reasoning (i.e. this PR). The perf footgun is still there, but users will mostly know about and avoid it.
Optimize out the clones in general. This may break some software that depends on it, but they shouldn't have so all right I guess? Given that iterator specialization is a tricky business, I'm also afraid that we might get edge cases where the optimization fails to apply. We should then also document the lack of clones under certain conditions.
Do nothing.

Is my analysis here correct?

the8472 · 2022-01-04T22:31:54Z

You have to consider code that clippy doesn't see, e.g. iterators returned by a function and then consumed by another function, possibly in another crate. Or generic code which takes T: Clone but some T may be Copy too.
On the other hand it is correct that specialization can't solve everything because the possible combinations of iterator adapters can be difficult to cover with specialization traits and of course there are 3rd-party adapters which we can't specialize.
The approaches can be complementary.

And neither requires a documentation change. Just because a clippy lint is added we can still reserve the possibility of performing additional optimizations. And just because an optimization is added we do not necessarily need to document it. We have many optimizations in the standard library which are best-effort and not guaranteed. Some of them are observable if you look hard enough.

There's also the second-order problem that if we add explicit non-guarantees to Cloned, then does this mean the absence of non-guarantees on other adapters hints that they will never do any optimization? Obviously that's nonsense, but every time we make something explicit it shifts expectations a little. So it might be better to have such documentation about side-effects in a more central place, e.g. on the core::iter module saying that some adapters may perform side-effect eliding optimizations based on the overall iterator-shape alone, which is more aggressive than what the compiler is allowed to do.

in that elements may be needlessly cloned when e.g. doing iter.cloned().last()

At least the more general case cloned().foo().last() can optimize out the unnecessary clones without specialization if Foo requires a DoubleEndedIterator because then it could unconditionally implement last() via next_back(). Or some adapter could require ExactSizeIterator and then implement count() as len() and not clone any items at all. So even today we can't say too much about which clones are guaranteed to happen since adapters can't be reasoned about in isolation.

llogiq · 2022-01-04T23:02:47Z

That last point is a good one. I agree that the documentation should be centralized, but am not sure whether the iter module doc is the right place for it. We'll mention it in the clippy lint docs anyway.

So I'm closing this for now. Hope that #90209 is successful. 👍

rust-highfive assigned joshtriplett Jan 3, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jan 3, 2022

llogiq force-pushed the cloned-side-effect-doc branch from 436147f to 703a639 Compare January 3, 2022 22:58

camelid requested changes Jan 4, 2022

View reviewed changes

camelid added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label Jan 4, 2022

Iterator::cloned: document side effect behavior

49632b9

llogiq force-pushed the cloned-side-effect-doc branch from 703a639 to 49632b9 Compare January 4, 2022 07:40

llogiq closed this Jan 4, 2022

the8472 mentioned this pull request Jan 5, 2022

Elide clone operation when calling iter::Cloned::advance_{back}_by() #90209

Closed

llogiq deleted the cloned-side-effect-doc branch January 16, 2022 00:34

		/// on `iter.cloned().last()`, all elements will be cloned even if only the
		/// last one is actually returned.

Iterator::cloned: document side effect behavior #92543

Iterator::cloned: document side effect behavior #92543

Uh oh!

Conversation

llogiq commented Jan 3, 2022

Uh oh!

rust-highfive commented Jan 3, 2022

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

the8472 commented Jan 4, 2022

Uh oh!

the8472 commented Jan 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llogiq commented Jan 4, 2022

Uh oh!

llogiq commented Jan 4, 2022

Uh oh!

the8472 commented Jan 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llogiq commented Jan 4, 2022

Uh oh!

the8472 commented Jan 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llogiq commented Jan 4, 2022

Uh oh!

Uh oh!

`Iterator::cloned`: document side effect behavior #92543

`Iterator::cloned`: document side effect behavior #92543

the8472 commented Jan 4, 2022 •

edited

Loading

the8472 commented Jan 4, 2022 •

edited

Loading

the8472 commented Jan 4, 2022 •

edited

Loading