DOC: Improve reshape\concat #47061

anetakahle · 2022-05-19T15:09:14Z

closes DOC: write guide for how to replace append #46825
All code checks passed.

pep8speaks · 2022-05-19T15:09:17Z

Hello @anetakahle! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2022-05-27 10:26:04 UTC

MarcoGorelli

Hey @anetakahle , thanks for taking this on

Instead of this example, could you add an example of how to append a single row (e.g. here), which seems to be the most common source of confusion?

pandas/core/reshape/concat.py

anetakahle · 2022-05-20T07:11:19Z

@MarcoGorelli thanks for a quick review :)
I've updated the example, is the current version better?

MarcoGorelli · 2022-05-20T10:02:06Z

pandas/core/reshape/concat.py

+    >>> b = pd.DataFrame({"A": 3}, index=[0])
+    >>> b
+        A
+    0   3
+    >>> for rowIndex, row in b.iterrows():
+    >>>     print(pd.concat([a, row.to_frame().T], ignore_index=True))


does it work to make this

new_row = pd.Series([3])

and then just do

pd.concat([a, new_row.to_frame().T], ignore_index=True)

?
Iterating over rows is what we want to avoid

Also, let's rename a to df7

Changed as requested in 90bafc6

Is it ok that the data from the new_row have appeared in a new row and not in the first one as in the previous solution?

I've also read the discussion about the deprecated append and maybe we should add some note into the documentation as well, something like

It is not recomended to build DataFrames by adding single rows in a
not loop. Build a list of rows and make a DataFrame in a single concat.

Co-Authored-By: Matěj Štágl <[email protected]>

This reverts commit 824b9bd, reversing changes made to 982a839.

pandas/core/reshape/concat.py

MarcoGorelli · 2022-05-20T17:07:47Z

I've also read the discussion about the deprecated append and maybe we should add some note into the documentation as well, something like

Yes, good suggestion, thanks!

Co-Authored-By: Matěj Štágl <[email protected]>

MarcoGorelli

getting there 💪

pandas/core/reshape/concat.py

Co-Authored-By: Matěj Štágl <[email protected]>

MarcoGorelli

Lots of unrelated changes, this is showing 15 files changed - could you rebase onto main and only modify what's necessary?

Also, please run pre-commit on the file(s) you've changed (see https://pandas.pydata.org/docs/development/contributing_codebase.html#pre-commit)

pandas/core/reshape/concat.py

Co-authored-by: Marco Edward Gorelli <[email protected]>

This reverts commit f4e394d.

…o doc-concat

anetakahle · 2022-05-20T17:48:16Z

@MarcoGorelli fixed, shows only 1 file changed now as intended

anetakahle · 2022-05-21T00:55:27Z

@MarcoGorelli I also ran the pre-commit locally and it didn't fail.

But there are still some other fails here on GitHub, but some of them are on the document as a whole, not just my changes (for example in the case of Code Checks / Docstring and typing validation (pull_request) the test shows to some errors which are not in the code I posted)

MarcoGorelli · 2022-05-21T11:11:40Z

are you sure you committed your latest changes? can you show the output of:

pre-commit run --files pandas/core/reshape/concat.py
git remote -v
git fetch origin
git diff origin/doc-concat

please?

anetakahle · 2022-05-21T12:00:40Z

@MarcoGorelli
Sure!

Here is the output:
pre-commit run --files pandas/core/reshape/concat.py
git remote -v
git fetch origin - no output
git diff origin/doc-concat

MarcoGorelli · 2022-05-21T14:42:07Z

OK looks like you have lots of local unstaged changes (perhaps from a merge gone not quite right?)

I think the simplest way out would be:

git checkout -- .  # discard local changes
git reset --hard origin/doc-concat
pre-commit run --files pandas/core/reshape/concat.py
git commit -m 'lint file'
git push origin doc-concat

MarcoGorelli

😄 no worries - looks like you've added an extra level on identation to the docstring though, could you restore the original level of indentation please?

anetakahle · 2022-05-21T18:25:04Z

@MarcoGorelli
The extra level on indentation was added by the pre-commit.
But I think I fixed it. 😄

pandas/core/reshape/concat.py

anetakahle · 2022-05-21T19:44:05Z

@MarcoGorelli
Resolved. :)

pandas/core/reshape/concat.py

anetakahle · 2022-05-21T21:12:14Z

@MarcoGorelli
OMG, the pre-commit has done so much mess in there :D
I fixed it, I hope it's all ok and I can squash now? :)

MarcoGorelli

Nice, thanks!

MarcoGorelli

Changes look good, but there's still a doctest failure:

_________________ [doctest] pandas.core.reshape.concat.concat __________________
350     ValueError: Indexes have overlapping values: ['a']
351 
352     Append a single row to the end of a ``DataFrame`` object.
353 
354     >>> df7 = pd.DataFrame({'a': 1, 'b': 2}, index=[0])
355     >>> df7
356         a   b
357     0   1   2
358     >>> new_row = pd.Series({'a': 3, 'b': 4})
359     >>> new_row
Expected:
        a   3
        b   4
Got:
    a    3
    b    4
    dtype: int64

(and no need to squash the commits)

anetakahle · 2022-05-22T13:05:53Z

@MarcoGorelli
Ok, I think I fixed it,
but the Code Checks / Docstring and typing validation test is still failing and I don't know why :/ :D
Maybe because of some code I didn't add?

FAILED pandas/core/tools/datetimes.py::pandas.core.tools.datetimes.to_datetime
...
=================================== FAILURES ===================================
______________ [doctest] pandas.core.tools.datetimes.to_datetime _______________
1013 
1014     >>> pd.to_datetime(['2018-10-26 12:00 -0530', '2018-10-26 12:00 -0500'],
1015     ...                utc=True)
1016     DatetimeIndex(['2018-10-26 17:30:00+00:00', '2018-10-26 17:00:00+00:00'],
1017                   dtype='datetime64[ns, UTC]', freq=None)
1018 
1019     - Inputs can contain both naive and aware, string or datetime, the above
1020       rules still apply
1021 
1022     >>> pd.to_datetime(['2018-10-26 12:00', '2018-10-26 12:00 -0530',
UNEXPECTED EXCEPTION: NameError("name 'timezone' is not defined")
Traceback (most recent call last):
  File "/usr/share/miniconda/envs/pandas-dev/lib/python3.8/doctest.py", line [133](https://github.com/pandas-dev/pandas/runs/6543790014?check_suite_focus=true#step:10:134)6, in __run
    exec(compile(example.source, filename, "single",
  File "<doctest pandas.core.tools.datetimes.to_datetime[18]>", line 4, in <module>
NameError: name 'timezone' is not defined
/home/runner/work/pandas/pandas/pandas/core/tools/datetimes.py:1022: UnexpectedException

MarcoGorelli

Awesome, thanks!

The other failure looks unrelated, probably needs to be fixed on main first - I'll rebase later to make sure

anetakahle · 2022-05-22T19:10:17Z

@MarcoGorelli
Thank you😁💜☺️

michalkahle · 2022-05-23T12:20:47Z

Congratulations! 😉

MarcoGorelli

Apologies, looks like there's still a related failure:

Error: /home/runner/work/pandas/pandas/pandas/core/reshape/concat.py:146:GL03:pandas.concat:Double line break found; please use only one blank line to separate sections or paragraphs, and do not leave blank lines at the end of docstrings

anetakahle · 2022-05-27T10:30:11Z

@MarcoGorelli
Oh, I didn't notice.
I changed it.

Thank you :)

jreback · 2022-05-27T12:28:46Z

thanks @anetakahle

anetakahle mentioned this pull request May 19, 2022

DOC: write guide for how to replace append #46825

Closed

1 task

MarcoGorelli suggested changes May 19, 2022

View reviewed changes

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

anetakahle force-pushed the doc-concat branch 2 times, most recently from 758a4aa to 9dfef3a Compare May 20, 2022 07:09

MarcoGorelli suggested changes May 20, 2022

View reviewed changes

anetakahle force-pushed the doc-concat branch from 8488233 to 90bafc6 Compare May 20, 2022 16:50

DOC: Improve reshape\concat

982a839

Co-Authored-By: Matěj Štágl <[email protected]>

anetakahle force-pushed the doc-concat branch from f70786a to 982a839 Compare May 20, 2022 16:59

anetakahle and others added 3 commits May 20, 2022 18:59

Merge branch 'pandas-dev:main' into doc-concat

824b9bd

Update concat.py

108d96e

Co-Authored-By: Matěj Štágl <[email protected]>

Revert "Merge branch 'pandas-dev:main' into doc-concat"

f4e394d

This reverts commit 824b9bd, reversing changes made to 982a839.

MarcoGorelli suggested changes May 20, 2022

View reviewed changes

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

anetakahle and others added 2 commits May 20, 2022 19:14

Update concat.py

7d4a81f

Co-Authored-By: Matěj Štágl <[email protected]>

Update concat.py

873a59f

Co-Authored-By: Matěj Štágl <[email protected]>

MarcoGorelli suggested changes May 20, 2022

View reviewed changes

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

Update concat.py

9513912

Co-Authored-By: Matěj Štágl <[email protected]>

MarcoGorelli suggested changes May 20, 2022

View reviewed changes

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

anetakahle and others added 3 commits May 20, 2022 19:44

Update pandas/core/reshape/concat.py

0b29265

Co-authored-by: Marco Edward Gorelli <[email protected]>

Revert "Revert "Merge branch 'pandas-dev:main' into doc-concat""

eff3bad

This reverts commit f4e394d.

Merge branch 'doc-concat' of https://github.com/anetakahle/pandas int…

eddce8e

…o doc-concat

MarcoGorelli self-requested a review May 20, 2022 20:45

lint file

6b55391

MarcoGorelli suggested changes May 21, 2022

View reviewed changes

indentation fix

28c9ede

MarcoGorelli suggested changes May 21, 2022

View reviewed changes

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

spaces fix

721f63d

jreback added the Docs label May 21, 2022

small fix

83ed246

MarcoGorelli suggested changes May 21, 2022

View reviewed changes

pandas/core/reshape/concat.py Outdated Show resolved Hide resolved

anetakahle added 2 commits May 21, 2022 23:07

removed unrelated white spaces

6a1f171

Update concat.py

16c4cfd

MarcoGorelli approved these changes May 22, 2022

View reviewed changes

MarcoGorelli suggested changes May 22, 2022

View reviewed changes

anetakahle added 2 commits May 22, 2022 14:25

Update concat.py

d0c8af3

Update concat.py

9aa4b28

MarcoGorelli approved these changes May 22, 2022

View reviewed changes

MarcoGorelli added this to the 1.5 milestone May 22, 2022

anetakahle requested a review from MarcoGorelli May 24, 2022 14:15

Merge branch 'main' into doc-concat

b95eced

MarcoGorelli suggested changes May 24, 2022

View reviewed changes

Update concat.py

81b9809

jreback approved these changes May 27, 2022

View reviewed changes

jreback merged commit 17c6e2b into pandas-dev:main May 27, 2022

yehoshuadimarsky pushed a commit to yehoshuadimarsky/pandas that referenced this pull request Jul 13, 2022

DOC: Improve reshape\concat (pandas-dev#47061)

ce9f1eb

Uh oh!

DOC: Improve reshape\concat #47061

DOC: Improve reshape\concat #47061

Uh oh!

Conversation

anetakahle commented May 19, 2022

Uh oh!

pep8speaks commented May 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comment last updated at 2022-05-27 10:26:04 UTC

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anetakahle commented May 20, 2022

Uh oh!

MarcoGorelli May 20, 2022

Choose a reason for hiding this comment

Uh oh!

anetakahle May 20, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MarcoGorelli commented May 20, 2022

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anetakahle commented May 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anetakahle commented May 21, 2022

Uh oh!

MarcoGorelli commented May 21, 2022

Uh oh!

anetakahle commented May 21, 2022

Uh oh!

MarcoGorelli commented May 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

anetakahle commented May 21, 2022

Uh oh!

Uh oh!

Uh oh!

anetakahle commented May 21, 2022

Uh oh!

Uh oh!

anetakahle commented May 21, 2022

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

anetakahle commented May 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

anetakahle commented May 22, 2022

Uh oh!

michalkahle commented May 23, 2022

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

anetakahle commented May 27, 2022

Uh oh!

pep8speaks commented May 19, 2022 •

edited

Loading

anetakahle commented May 20, 2022 •

edited

Loading

MarcoGorelli commented May 21, 2022 •

edited

Loading

anetakahle commented May 22, 2022 •

edited

Loading