gh-91247: improve performance of list and tuple repeat (with specialization for n=1) #91482

eendebakpt · 2022-04-12T19:14:09Z

Compared to current main branch there are 3 improvements

For the list inplace repeat there is an optimized copy method
This PR solves the inefficient pointer chasing for small (size 2 to 7) lists and tuples
It shares code between the implementations of list repeat, tuple repeat and list inplace repeat

This PR keeps the specializations for repeats of lists and tuples with length 1. An alternative PR without specialization for n=1 (#32045) was rejected because the specializations might yield a speedup.

For performance comparisons to main and the version without specializations and more detailed discussion see issue #91247.

The inefficient pointer chasing can be seen in the following performance benchmark:

(results obtained with runner.timeit(name=f"list({n}) repeat {r}", stmt=f"x=a*{r}", setup=f'a=list(range({n}))'))

Issue: Improve performance of list and tuple repeat methods #91247

eendebakpt · 2022-04-21T21:18:43Z

@sweeneyde Could you have a look at this ticket? The approach is similar to #31856 and #31999 which you reviewed.

Include/object.h

Include/internal/pycore_object.h

Objects/listobject.c

Objects/tupleobject.c

Include/internal/pycore_object.h

sweeneyde · 2022-05-24T12:30:01Z

@eendebakpt Sorry this slipped under my radar

@vstinner, would you mind reviewing? I think the change is good in principle but I want another reviewer and to make sure the header file changes are appropriate.

Include/internal/pycore_object.h

Include/internal/pycore_list.h

Include/internal/pycore_object.h

kumaraditya303

Minor nits

Objects/listobject.c

Co-authored-by: Kumar Aditya <[email protected]>

Misc/NEWS.d/next/Core and Builtins/2022-03-22-13-12-27.bpo-47091.tJcy-P.rst

…91.tJcy-P.rst Co-authored-by: Kumar Aditya <[email protected]>

Include/internal/pycore_object.h

Co-authored-by: Kumar Aditya <[email protected]>

sweeneyde

Some minor nits, but this is looking good. Thanks for your patience and persistence on this.

Include/internal/pycore_list.h

Objects/listobject.c

Include/internal/pycore_object.h

Objects/tupleobject.c

Include/internal/pycore_object.h

Co-authored-by: Dennis Sweeney <[email protected]>

sweeneyde · 2022-07-25T17:10:54Z

I verified a benchmark on a Windows machine since this might be a bit compiler-specific.

pyperf timeit -s "a = [0,1]" "a * 10_000"
Mean +- std dev: [before] 32.9 us +- 0.3 us -> [after] 21.8 us +- 0.3 us: 1.51x faster

Looks great!

bedevere-bot · 2022-07-25T17:12:59Z

🤖 New build scheduled with the buildbot fleet by @sweeneyde for commit 516f091 🤖

If you want to schedule another build, you need to add the ":hammer: test-with-buildbots" label again.

sweeneyde · 2022-07-26T02:00:55Z

Objects/listobject.c

            *dest++ = *src++;
        }
+
+        _Py_memory_repeat((char *)np->ob_item, sizeof(PyObject *)*output_size,


Note to self: this cannot overflow because list_new_prealloc uses PyMem_New, and so this multiplication already succeeded at some point.

bedevere-bot added the awaiting review label Apr 12, 2022

eendebakpt marked this pull request as draft April 12, 2022 19:14

eendebakpt mentioned this pull request Apr 12, 2022

gh-91247: improve performance of list and tuple repeat #32045

Closed

eendebakpt marked this pull request as ready for review April 12, 2022 21:26

eendebakpt and others added 8 commits April 21, 2022 20:20

use memcpy in list repat and list inplace repeat

48d758c

use memcpy in tuple repeat

e32f554

fix debug build

3408397

remove double ref counting

2ddedf9

📜🤖 Added by blurb_it.

74e7cc0

make implementations of list repeat and tuple repeat similar

493bb9d

eliminate duplicated code

4f292a4

fix ci

5445cfb

eendebakpt force-pushed the performance/list_repeat_v2_specialized branch from 05e2b70 to 5445cfb Compare April 21, 2022 18:20

sweeneyde reviewed Apr 21, 2022

View reviewed changes

eendebakpt added 2 commits April 22, 2022 10:13

remove Py_INCREF_n from public api

3d287c7

take multiplication out of the loop

640643e

eendebakpt mentioned this pull request Apr 22, 2022

Improve performance of list and tuple repeat methods #91247

Closed

eendebakpt requested a review from sweeneyde April 26, 2022 15:41

eendebakpt mentioned this pull request May 4, 2022

gh-91247: Performance improvement in list repeating #92286

Closed

eendebakpt added 2 commits May 11, 2022 10:24

Merge branch 'main' into performance/list_repeat_v2_specialized

7b5e5b2

fix formatting of news item

66cc8ae

eendebakpt force-pushed the performance/list_repeat_v2_specialized branch from 1491705 to 66cc8ae Compare May 11, 2022 08:39

eendebakpt added 2 commits May 17, 2022 10:25

Merge branch 'main' into performance/list_repeat_v2_specialized

30baa6f

Merge branch 'main' into performance/list_repeat_v2_specialized

820add4

sweeneyde reviewed May 24, 2022

View reviewed changes

Include/internal/pycore_object.h Outdated Show resolved Hide resolved

sweeneyde requested review from sweeneyde and vstinner May 24, 2022 12:28

vstinner reviewed May 24, 2022

View reviewed changes

Include/internal/pycore_object.h Outdated Show resolved Hide resolved

Include/internal/pycore_object.h Outdated Show resolved Hide resolved

kumaraditya303 reviewed Jul 12, 2022

View reviewed changes

Include/internal/pycore_list.h Outdated Show resolved Hide resolved

kumaraditya303 reviewed Jul 12, 2022

View reviewed changes

Include/internal/pycore_object.h Outdated Show resolved Hide resolved

kumaraditya303 reviewed Jul 12, 2022

View reviewed changes

Objects/listobject.c Outdated Show resolved Hide resolved

kumaraditya303 reviewed Jul 12, 2022

View reviewed changes

Objects/listobject.c Outdated Show resolved Hide resolved

eendebakpt and others added 2 commits July 12, 2022 11:00

Apply suggestions from code review

d47f6d7

Co-authored-by: Kumar Aditya <[email protected]>

Merge branch 'main' into performance/list_repeat_v2_specialized

4584415

kumaraditya303 reviewed Jul 12, 2022

View reviewed changes

Misc/NEWS.d/next/Core and Builtins/2022-03-22-13-12-27.bpo-47091.tJcy-P.rst Outdated Show resolved Hide resolved

Update Misc/NEWS.d/next/Core and Builtins/2022-03-22-13-12-27.bpo-470…

f6e6bdf

…91.tJcy-P.rst Co-authored-by: Kumar Aditya <[email protected]>

eendebakpt requested a review from kumaraditya303 July 16, 2022 19:10

kumaraditya303 reviewed Jul 22, 2022

View reviewed changes

Include/internal/pycore_object.h Outdated Show resolved Hide resolved

kumaraditya303 reviewed Jul 22, 2022

View reviewed changes

Include/internal/pycore_object.h Outdated Show resolved Hide resolved

kumaraditya303 approved these changes Jul 22, 2022

View reviewed changes

bedevere-bot added awaiting core review and removed awaiting review labels Jul 22, 2022

eendebakpt and others added 2 commits July 22, 2022 22:16

Apply suggestions from code review

a8c32ac

Co-authored-by: Kumar Aditya <[email protected]>

Merge branch 'main' into performance/list_repeat_v2_specialized

8f152f2

sweeneyde reviewed Jul 24, 2022

View reviewed changes

eendebakpt and others added 3 commits July 24, 2022 14:51

Apply suggestions from code review

4211b00

Co-authored-by: Dennis Sweeney <[email protected]>

pep7

7056599

move assert to _Py_memory_repeat

53fa880

Merge branch 'main' into performance/list_repeat_v2_specialized

516f091

sweeneyde added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jul 25, 2022

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jul 25, 2022

sweeneyde reviewed Jul 26, 2022

View reviewed changes

sweeneyde merged commit 2ef73be into python:main Jul 26, 2022

bedevere-bot removed the awaiting core review label Jul 26, 2022

eendebakpt deleted the performance/list_repeat_v2_specialized branch July 26, 2022 08:02

Uh oh!

gh-91247: improve performance of list and tuple repeat (with specialization for n=1) #91482

gh-91247: improve performance of list and tuple repeat (with specialization for n=1) #91482

Uh oh!

Conversation

eendebakpt commented Apr 12, 2022 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eendebakpt commented Apr 21, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sweeneyde commented May 24, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kumaraditya303 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sweeneyde left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sweeneyde commented Jul 25, 2022

Uh oh!

bedevere-bot commented Jul 25, 2022

Uh oh!

sweeneyde Jul 26, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eendebakpt commented Apr 12, 2022 •

edited by bedevere-bot

Loading