cmd/compile: automatically stack-allocate small non-escaping slices of dynamic size #27625

rasky · 2018-09-11T17:43:59Z

This commit:
95a11c7

shows a real-world performance gain triggered by moving a small non-escaping slice to the stack. It is my understanding that the Go compiler always allocated the slice in the heap because the length was not known at compile time.

Would it make sense to attempt a similar code transformation for many/all non escaping slices? What would be the cons? Any suggestion on how to identify which slices could benefit from this transformation and which would possibly just create overhead?

The text was updated successfully, but these errors were encountered:

randall77 · 2018-09-11T18:29:56Z

I think it's almost always a win to start on the stack, if we can.

This example is tricky, and probably very common:

var b []byte
for ... {
    b = append(b, ...)
}

How do we preallocate some space for b? We'd want to do:

var bStore [32]byte  // on stack
b := bStore[:0]

But that isn't right if for loop runs for 0 iterations. The result must be nil (and have 0 capacity).

mvdan · 2018-09-11T18:31:01Z

I wonder if always applying this transformation, even if it never hurt performance, would make binaries noticeably bigger.

Would this be done for all allocations of small non-escaping slices? Or only for those where the capacity is known at compile time to be small?

randall77 · 2018-09-11T19:25:43Z

We already allocate small non-escaping slices on the stack if their capacity is known at compile time.
This issue would be about when the size is not known at compile time.

ALTree · 2018-09-11T19:31:25Z

Is this #20533?

randall77 · 2018-09-11T20:26:27Z

It's definitely similar. #20533 is going more down the road of really allocating n bytes when you do a := make([]byte, n) (alloca-style, or on the heap with explicit free). This one is about allocating a constant size buffer and only using it when n is small enough.

rasky · 2018-09-11T22:22:04Z

Transforming a := make([]byte, n) into:

var a []byte
if n < 64 {
    a = make([]byte, n, 64)   // stack allocation
} else {
    a = make([]byte, n)       // heap allocation
}

can surely have some code size impact. In some cases, maybe prove is able to remove one of the two branches but I'm not holding my breath on that. I wonder if it's still worth, performance wise.

We should also explore doing this for slices of different types (while keeping total stack allocation within a certain limit).

navytux · 2020-04-30T06:49:50Z

Transforming a := make([]byte, n) into ...

Recent example where such transformation was done by hand for performance: 17d5cef (CL 230657).

/cc @martisch

rasky added the Performance label Sep 11, 2018

rasky changed the title ~~cmd/compile: automatically stack-allocate small non-escaping slices~~ cmd/compile: automatically stack-allocate small non-escaping slices of dynamic size Sep 11, 2018

gopherbot added the compiler/runtime Issues related to the Go compiler and/or runtime. label Jul 13, 2022

mknyszek added this to Go Compiler / Runtime Jul 13, 2022

mknyszek moved this to Triage Backlog in Go Compiler / Runtime Jul 15, 2022

flyingmutant mentioned this issue Jul 28, 2022

Find a good design for Sample flyingmutant/rand#2

Open

seankhliao added this to the Unplanned milestone Aug 20, 2022

seankhliao added the NeedsInvestigation Someone must examine and confirm this is a valid issue and not a duplicate of an existing one. label Aug 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cmd/compile: automatically stack-allocate small non-escaping slices of dynamic size #27625

cmd/compile: automatically stack-allocate small non-escaping slices of dynamic size #27625

rasky commented Sep 11, 2018

randall77 commented Sep 11, 2018

Uh oh!

mvdan commented Sep 11, 2018

Uh oh!

randall77 commented Sep 11, 2018

Uh oh!

ALTree commented Sep 11, 2018

Uh oh!

randall77 commented Sep 11, 2018

Uh oh!

rasky commented Sep 11, 2018

Uh oh!

navytux commented Apr 30, 2020

Uh oh!

cmd/compile: automatically stack-allocate small non-escaping slices of dynamic size #27625

cmd/compile: automatically stack-allocate small non-escaping slices of dynamic size #27625

Comments

rasky commented Sep 11, 2018

randall77 commented Sep 11, 2018

Uh oh!

mvdan commented Sep 11, 2018

Uh oh!

randall77 commented Sep 11, 2018

Uh oh!

ALTree commented Sep 11, 2018

Uh oh!

randall77 commented Sep 11, 2018

Uh oh!

rasky commented Sep 11, 2018

Uh oh!

navytux commented Apr 30, 2020

Uh oh!