DOC: Restructure and expand UDF page #61470

datapythonista · 2025-05-21T11:34:50Z

I changed the order in which the methods are presented,both in the table and in the sections, to be:

map
apply
pipe
filter
agg
transform

I find it easier to explain them in this order.

And I expanded the method sections with examples and a bit more of information.

I removed the most complex example in the intro, as I think the examples in the sections will make a better job now at explaining the most complex cases.

@arthurlw @rhshadrach do you mind having a look?

arthurlw · 2025-05-22T15:47:35Z

doc/source/user_guide/user_defined_functions.rst

@@ -118,101 +104,229 @@ decisions, ensuring more efficient and maintainable code.
    and :ref:`ewm()<window>` for details.


-:meth:`DataFrame.apply`
-~~~~~~~~~~~~~~~~~~~~~~~
+.. _udf.map:


If we plan to use udf as the reference, then we should rename the reference on the top of the file from:

.. _user_defined_functions:

to:

.. _udf:

arthurlw · 2025-05-22T15:54:29Z

doc/source/user_guide/user_defined_functions.rst


-    df_filtered = df.filter(items=[col for col in df.columns if is_long_name(col)])
-    print(df_filtered)
+    temperature.apply(highest_jump)


Suggested change

temperature.apply(highest_jump)

temperature.agg(highest_jump)

arthurlw · 2025-05-22T16:03:07Z

Looks good to me! I think the example under vectorized operations should be changed to fit with the Fahrenheit example, but that can be added in a follow-up PR.

datapythonista · 2025-05-22T16:18:30Z

Thanks @arthurlw, great feedback. I'll leave the example on the vectorized section for now, as it may make sense to also expand that section as we make progress with the IT engines. Feel free to update it now if you want, but I'm unsure at this point how to add the JIT engines to that section, and how to better present all the performance related topics. Maybe we can just add a section for it, but maybe we can find a way to present it so one topic expands on the previous, as I tried to do with the different methods.

datapythonista · 2025-05-27T15:14:55Z

Merging here, as I want to add few more things to this page. Please let me know if any comment, happy to address feedback in a follow up PR.

DOC: Restructure and expand UDF page

2e1b427

datapythonista added Docs Apply Apply, Aggregate, Transform, Map labels May 21, 2025

datapythonista added 2 commits May 21, 2025 18:46

Adding examples to all methods

e148685

Fix table

32cd67d

arthurlw reviewed May 22, 2025

View reviewed changes

Update label and change typo in example

c13b819

datapythonista merged commit f8d93d8 into pandas-dev:main May 27, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DOC: Restructure and expand UDF page #61470

DOC: Restructure and expand UDF page #61470

Uh oh!

datapythonista commented May 21, 2025 •

edited

Loading

Uh oh!

arthurlw May 22, 2025

Uh oh!

arthurlw May 22, 2025

Uh oh!

arthurlw commented May 22, 2025

Uh oh!

datapythonista commented May 22, 2025

Uh oh!

datapythonista commented May 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

	temperature.apply(highest_jump)
	temperature.agg(highest_jump)

Uh oh!

DOC: Restructure and expand UDF page #61470

DOC: Restructure and expand UDF page #61470

Uh oh!

Conversation

datapythonista commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arthurlw May 22, 2025

Choose a reason for hiding this comment

Uh oh!

arthurlw May 22, 2025

Choose a reason for hiding this comment

Uh oh!

arthurlw commented May 22, 2025

Uh oh!

datapythonista commented May 22, 2025

Uh oh!

datapythonista commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

datapythonista commented May 21, 2025 •

edited

Loading

datapythonista commented May 27, 2025 •

edited

Loading