REF: share getitem for Categorical/PandasArray/DTA/TDA/PA #36391

jbrockmendel · 2020-09-15T21:54:39Z

No description provided.

jreback

just a question really, nice de-dupes

pandas/core/arrays/categorical.py

simonjayhawkins · 2020-09-16T13:44:08Z

pandas/core/arrays/categorical.py

-        result = self._codes[key]
-        if result.ndim > 1:
+        result = super().__getitem__(key)
+        if np.ndim(result) > 1:


Is np.ndim(result) > 1 generally slower than something like not lib.is_scalar(result) and result.ndim > 1?

result = 5
%timeit np.ndim(result) > 1
2.77 µs ± 48.3 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

%timeit not lib.is_scalar(result) and result.ndim > 1
154 ns ± 1.2 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

or is this not important?

wow that's neat

not lib.is_scalar(result) and result.ndim > 1?

The trouble here is if we have a tuple result. I'll take a look at finding a more performant solution.

I did wonder about that. but pandas\tests\arrays\categorical and pandas\tests\extension\test_categorical.py didn't fail with this change. Do we have tests for this.

also is it possible to get a nested tuple result? np.ndim would give 2 in this case?

Looks like getattr(result, "ndim", 1) > 1 outperforms both np.ndim and lib.is_scalar checks, and dont need to worry about tuples. Will update.

also is it possible to get a nested tuple result? np.ndim would give 2 in this case?

Yes. Supporting list-like object-dtype is a massive PITA.

simonjayhawkins · 2020-09-16T13:54:28Z

pandas/core/arrays/_mixins.py

+        result = self._from_backing_data(result)
+        return result
+
+    def _validate_getitem_key(self, key):


this duplicates _validate_setitem_key. is this needed?

in what cases could one of a getitem and a setitem key be valid and other invalid?

DatetimeLikeArrayMixin has special logic here. #36210 is about deprecating that

jreback · 2020-09-16T14:54:57Z

@jbrockmendel merge away if good here (likely address @simonjayhawkins points as followups?)

…array-share-7

jorisvandenbossche · 2020-09-16T15:28:40Z

pandas/core/arrays/_mixins.py

@@ -30,6 +31,9 @@ def _from_backing_data(self: _T, arr: np.ndarray) -> _T:
        """
        raise AbstractMethodError(self)

+    def _box_func(self, x):


Can you add a docstring for this function?

…-dev#36391)

REF: share __getitem__ for Categorical/PandasArrat/DTA/TDA/PA

74931a2

jreback requested changes Sep 15, 2020

View reviewed changes

pandas/core/arrays/categorical.py Show resolved Hide resolved

jreback added Indexing Related to indexing on series/frames, not to indexes themselves Clean labels Sep 15, 2020

jreback added this to the 1.2 milestone Sep 15, 2020

jreback approved these changes Sep 15, 2020

View reviewed changes

simonjayhawkins reviewed Sep 16, 2020

View reviewed changes

jbrockmendel added 2 commits September 16, 2020 07:56

Merge branch 'master' of https://github.com/pandas-dev/pandas into nd…

783a5b8

…array-share-7

PERF: better ndim check

685433a

jorisvandenbossche reviewed Sep 16, 2020

View reviewed changes

docstring

4403ca6

jreback merged commit 28068da into pandas-dev:master Sep 17, 2020

jbrockmendel deleted the ndarray-share-7 branch September 17, 2020 16:51

rhshadrach pushed a commit to rhshadrach/pandas that referenced this pull request Sep 17, 2020

REF: share __getitem__ for Categorical/PandasArray/DTA/TDA/PA (pandas…

c65dbee

…-dev#36391)

kesmit13 pushed a commit to kesmit13/pandas that referenced this pull request Nov 2, 2020

REF: share __getitem__ for Categorical/PandasArray/DTA/TDA/PA (pandas…

a535f79

…-dev#36391)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

REF: share getitem for Categorical/PandasArray/DTA/TDA/PA #36391

REF: share getitem for Categorical/PandasArray/DTA/TDA/PA #36391

Uh oh!

jbrockmendel commented Sep 15, 2020

Uh oh!

jreback left a comment

Uh oh!

Uh oh!

simonjayhawkins Sep 16, 2020

Uh oh!

jreback Sep 16, 2020

Uh oh!

jbrockmendel Sep 16, 2020

Uh oh!

simonjayhawkins Sep 16, 2020

Uh oh!

jbrockmendel Sep 16, 2020

Uh oh!

simonjayhawkins Sep 16, 2020

Uh oh!

jbrockmendel Sep 16, 2020

Uh oh!

jreback commented Sep 16, 2020

Uh oh!

jorisvandenbossche Sep 16, 2020

Uh oh!

Uh oh!

Uh oh!

REF: share __getitem__ for Categorical/PandasArray/DTA/TDA/PA #36391

REF: share __getitem__ for Categorical/PandasArray/DTA/TDA/PA #36391

Uh oh!

Conversation

jbrockmendel commented Sep 15, 2020

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jreback commented Sep 16, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

REF: share getitem for Categorical/PandasArray/DTA/TDA/PA #36391

REF: share getitem for Categorical/PandasArray/DTA/TDA/PA #36391