-
Notifications
You must be signed in to change notification settings - Fork 15
Closed
Labels
Description
At the moment in order to apply a Colwise
transformation to a selected column(s) one has to use Select
or is there another way?
I am finding that Select
has some drawbacks when used in this way since it results in all the non selected columns being put into the cache but for e.g. the following pipeline this would not be necessary
(Select(:a) → MinMax()) ⊔ (Select(:b) → ZScore())
If I have a large pipeline/table, getting a large cache could be annoying . There is also an issue with revert
#80.
So would it make sense to introduce a wrapper transform (or something else), where we give it a subset of columns, that only applies the (Colwise
) transform to the subset of columns? For the pipeline above this could for example look like the following
Restrict(:a)(MinMax()) → Restrict(:b)(ZScore())