#127648 introduced a ~12% performance regression in `python_startup_no_site` benchmark #132952

mdboom · 2025-04-25T15:30:44Z

Bug report

Bug description:

Plotting the Faster CPython team's weekly benchmarks, there is an obvious discontinuity in the python_startup_no_site benchmark:

This is between these two commits:

2025-03-03 0.98 b3c18bf
2025-03-08 0.85 a3990df

Bisecting over this range, it's reproducible that the first bad commit is c6dd2348ca.

CPython versions tested on:

CPython main branch

Operating systems tested on:

Linux

Linked PRs

The text was updated successfully, but these errors were encountered:

JelleZijlstra · 2025-04-25T15:32:36Z

cc @srittau. I suppose this is because io is imported at startup no matter what, and importing io is now slightly slower.

mdboom · 2025-04-25T15:48:42Z

cc @srittau. I suppose this is because io is imported at startup no matter what, and importing io is now slightly slower.

Yes, that's the gist of it. I'm surprised by the size of the regression -- I think that's a testament to how close to the lower limit Python startup already is.

JelleZijlstra · 2025-04-25T15:59:41Z

It looks like we need io at startup (in pylifecycle.c) only in order to access io.open and io.TextIOWrapper, both of which come from the private _io module, so we could speed up startup by importing from _io instead: #132957. Not completely sure it's worth it, but it's a very simple change.

srittau · 2025-04-25T16:09:41Z

That's a huge difference in startup time.

It looks like we need io at startup (in pylifecycle.c) only in order to access io.open and io.TextIOWrapper, both of which come from the private _io module, so we could speed up startup by importing from _io instead: #132957. Not completely sure it's worth it, but it's a very simple change.

This sounds worthwhile to me, independent from this regression. Importing a Python-only module will most likely always incur a fairly large performance penalty.

We could also implement the pseudo-protocols in C if that helps with performance. Finally, we could go back to the original plan to move the protocols to typing, although they are better placed in io.

srittau · 2025-04-28T13:30:41Z

@mdboom For my understanding: This is only for the python_startup_no_site benchmark? Is there a significant slowdown for "normal" Python startup benchmark?

mdboom · 2025-04-28T13:34:33Z

Yes, there is also a slowdown for python_startup, but not so cleanly for a single commit like this, it's more death by a thousand papercuts:

JelleZijlstra · 2025-04-28T13:35:55Z

I think this regresssion is primarily from adding the _collections_abc import, and that module gets imported anyway when site.py is used (because os.py imports it). _collections_abc is relatively slow. The startup with site would also get slightly slower from adding two more classes defined at startup, but that's fast enough that it's probably not easily detectable.

JelleZijlstra · 2025-04-28T13:50:52Z

If we want to speed this up further, I think this is the most promising parts are these:

import time:       601 |        601 |   time
import time:       209 |        810 | zipimport
import time:        49 |         49 |     _codecs
import time:       535 |        583 |   codecs
import time:       400 |        400 |   encodings.aliases
import time:       608 |       1590 | encodings
import time:       172 |        172 | encodings.utf_8

zipimport seems like it should only be needed if we actually use ZIP imports, which many programs won't. Maybe we can defer importing it until we need it?

For encodings, many programs will only ever need UTF-8, not the whole registry system. It looks like this gets triggered by _PyUnicode_InitEncodings, which first initiates the codec registry (triggering import encodings) and then sets up the default encoding (usually UTF-8). Perhaps there can be a fast path where we only register UTF-8 and defer registering the others until we need them.

This is not really part of this issue since it's not a regression, but leaving this here in case there's interest in speeding up startup further.

mdboom added the type-bug An unexpected behavior, bug, or error label Apr 25, 2025

mdboom self-assigned this Apr 25, 2025

mdboom added the performance Performance or resource usage label Apr 25, 2025

bedevere-app bot mentioned this issue Apr 25, 2025

gh-132952: Improve Python startup time by ~12% #132956

Closed

mdboom mentioned this issue Apr 25, 2025

Investigate regression in python_startup_no_site faster-cpython/ideas#727

Closed

JelleZijlstra added a commit to JelleZijlstra/cpython that referenced this issue Apr 25, 2025

pythongh-132952: Import _io instead of io at startup

dac50cc

bedevere-app bot mentioned this issue Apr 25, 2025

gh-132952: Import _io instead of io at startup #132957

Merged

picnixz added the interpreter-core (Objects, Python, Grammar, and Parser dirs) label Apr 25, 2025

JelleZijlstra added a commit that referenced this issue Apr 28, 2025

gh-132952: Speed up startup by importing _io instead of io (#132957)

58567cc

sourcery-ai bot mentioned this issue Apr 28, 2025

[pull] main from python:main webfutureiorepo/cpython#268

Merged

AA-Turner closed this as completed Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#127648 introduced a ~12% performance regression in `python_startup_no_site` benchmark #132952

#127648 introduced a ~12% performance regression in `python_startup_no_site` benchmark #132952

mdboom commented Apr 25, 2025 •

edited by bedevere-app bot

Loading

JelleZijlstra commented Apr 25, 2025

mdboom commented Apr 25, 2025

JelleZijlstra commented Apr 25, 2025

srittau commented Apr 25, 2025

srittau commented Apr 28, 2025

mdboom commented Apr 28, 2025

JelleZijlstra commented Apr 28, 2025

JelleZijlstra commented Apr 28, 2025

#127648 introduced a ~12% performance regression in python_startup_no_site benchmark #132952

#127648 introduced a ~12% performance regression in python_startup_no_site benchmark #132952

Comments

mdboom commented Apr 25, 2025 • edited by bedevere-app bot Loading

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

JelleZijlstra commented Apr 25, 2025

mdboom commented Apr 25, 2025

JelleZijlstra commented Apr 25, 2025

srittau commented Apr 25, 2025

srittau commented Apr 28, 2025

mdboom commented Apr 28, 2025

JelleZijlstra commented Apr 28, 2025

JelleZijlstra commented Apr 28, 2025

#127648 introduced a ~12% performance regression in `python_startup_no_site` benchmark #132952

#127648 introduced a ~12% performance regression in `python_startup_no_site` benchmark #132952

mdboom commented Apr 25, 2025 •

edited by bedevere-app bot

Loading