We already copy-paste too much of code when we define the tp_vectorcall. We need a way to reduce it. ref: ~https://bugs.python.org/issue43447~ https://github.com/python/cpython/issues/87613