Optimize sum of Durations by using custom function #51598

Pazzaz · 2018-06-16T19:05:00Z

The current impl Sum for Duration uses fold to perform several adds (or really checked_adds) of durations. In doing so, it has to guarantee the number of nanoseconds is valid after every addition. If you squeese the current implementation into a single function it looks kind of like this:

fn sum<I: Iterator<Item = Duration>>(iter: I) -> Duration {
    let mut sum = Duration::new(0, 0);
    for rhs in iter {
        if let Some(mut secs) = sum.secs.checked_add(rhs.secs) {
            let mut nanos = sum.nanos + rhs.nanos;
            if nanos >= NANOS_PER_SEC {
                nanos -= NANOS_PER_SEC;
                if let Some(new_secs) = secs.checked_add(1) {
                    secs = new_secs;
                } else {
                    panic!("overflow when adding durations");
                }
            }
            sum = Duration { secs, nanos }
        } else {
            panic!("overflow when adding durations");
        }
    }
    sum
}

We only need to check if nanos is in the correct range when giving our final answer so we can have a more optimized version like so:

fn sum<I: Iterator<Item = Duration>>(iter: I) -> Duration {
    let mut total_secs: u64 = 0;
    let mut total_nanos: u64 = 0;

    for entry in iter {
        total_secs = total_secs
            .checked_add(entry.secs)
            .expect("overflow in iter::sum over durations");
        total_nanos = match total_nanos.checked_add(entry.nanos as u64) {
            Some(n) => n,
            None => {
                total_secs = total_secs
                    .checked_add(total_nanos / NANOS_PER_SEC as u64)
                    .expect("overflow in iter::sum over durations");
                (total_nanos % NANOS_PER_SEC as u64) + entry.nanos as u64
            }
        };
    }
    total_secs = total_secs
        .checked_add(total_nanos / NANOS_PER_SEC as u64)
        .expect("overflow in iter::sum over durations");
    total_nanos = total_nanos % NANOS_PER_SEC as u64;
    Duration {
        secs: total_secs,
        nanos: total_nanos as u32,
    }
}

We now only convert total_nanos to total_secs (1) if total_nanos overflows and (2) at the end of the function when we have to output a valid Duration. This gave a 5-22% performance improvement when I benchmarked it, depending on how big the nano value of the Durations in iter were.

pietroalbini · 2018-06-16T19:08:57Z

r? @sfackler (someone from the libs team)

pietroalbini · 2018-06-25T11:13:46Z

Ping from triage @sfackler! This PR needs your review.

sfackler · 2018-06-27T01:55:51Z

@bors r+

Sorry for the delay!

bors · 2018-06-27T01:55:52Z

📌 Commit d22ad76 has been approved by sfackler

bors · 2018-06-27T04:02:12Z

⌛ Testing commit d22ad76 with merge 612c280...

Optimize sum of Durations by using custom function The current `impl Sum for Duration` uses `fold` to perform several `add`s (or really `checked_add`s) of durations. In doing so, it has to guarantee the number of nanoseconds is valid after every addition. If you squeese the current implementation into a single function it looks kind of like this: ````rust fn sum<I: Iterator<Item = Duration>>(iter: I) -> Duration { let mut sum = Duration::new(0, 0); for rhs in iter { if let Some(mut secs) = sum.secs.checked_add(rhs.secs) { let mut nanos = sum.nanos + rhs.nanos; if nanos >= NANOS_PER_SEC { nanos -= NANOS_PER_SEC; if let Some(new_secs) = secs.checked_add(1) { secs = new_secs; } else { panic!("overflow when adding durations"); } } sum = Duration { secs, nanos } } else { panic!("overflow when adding durations"); } } sum } ```` We only need to check if `nanos` is in the correct range when giving our final answer so we can have a more optimized version like so: ````rust fn sum<I: Iterator<Item = Duration>>(iter: I) -> Duration { let mut total_secs: u64 = 0; let mut total_nanos: u64 = 0; for entry in iter { total_secs = total_secs .checked_add(entry.secs) .expect("overflow in iter::sum over durations"); total_nanos = match total_nanos.checked_add(entry.nanos as u64) { Some(n) => n, None => { total_secs = total_secs .checked_add(total_nanos / NANOS_PER_SEC as u64) .expect("overflow in iter::sum over durations"); (total_nanos % NANOS_PER_SEC as u64) + entry.nanos as u64 } }; } total_secs = total_secs .checked_add(total_nanos / NANOS_PER_SEC as u64) .expect("overflow in iter::sum over durations"); total_nanos = total_nanos % NANOS_PER_SEC as u64; Duration { secs: total_secs, nanos: total_nanos as u32, } } ```` We now only convert `total_nanos` to `total_secs` (1) if `total_nanos` overflows and (2) at the end of the function when we have to output a valid `Duration`. This gave a 5-22% performance improvement when I benchmarked it, depending on how big the `nano` value of the `Duration`s in `iter` were.

bors · 2018-06-27T06:04:44Z

☀️ Test successful - status-appveyor, status-travis
Approved by: sfackler
Pushing 612c280 to master...

Optimize sum of Durations by using custom function

d22ad76

rust-highfive assigned sfackler Jun 16, 2018

pietroalbini added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jun 16, 2018

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jun 27, 2018

bors merged commit d22ad76 into rust-lang:master Jun 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize sum of Durations by using custom function #51598

Optimize sum of Durations by using custom function #51598

Pazzaz commented Jun 16, 2018

Uh oh!

pietroalbini commented Jun 16, 2018

Uh oh!

pietroalbini commented Jun 25, 2018

Uh oh!

sfackler commented Jun 27, 2018

Uh oh!

bors commented Jun 27, 2018

Uh oh!

bors commented Jun 27, 2018

Uh oh!

bors commented Jun 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Optimize sum of Durations by using custom function #51598

Optimize sum of Durations by using custom function #51598

Conversation

Pazzaz commented Jun 16, 2018

Uh oh!

pietroalbini commented Jun 16, 2018

Uh oh!

pietroalbini commented Jun 25, 2018

Uh oh!

sfackler commented Jun 27, 2018

Uh oh!

bors commented Jun 27, 2018

Uh oh!

bors commented Jun 27, 2018

Uh oh!

bors commented Jun 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants