Add an 'axis' parameter to concat_images, plus two tests. #298

bcipolli · 2015-02-24T09:38:14Z

Trying to solve issue #207. There are now three cases:

axis is not None: concat on the specified dimension
axis is None and all images have a final dimension of 1: concat on the final dimension (don't add a new one)
axis is None and any image has a final dimension != 1: concat on a new dimension.

Existing tests covered the 3rd case. New tests were added for the first two cases.

coveralls · 2015-02-24T09:44:45Z

Coverage increased (+0.01%) to 94.33% when pulling c6c6a1d on bcipolli:issue-207 into 96d474c on nipy:master.

matthew-brett · 2015-02-24T22:32:49Z

nibabel/funcs.py

+        out_data[i] = img.get_data().copy()
+    if axis is not None:
+        out_data = np.concatenate(out_data, axis=axis)
+    elif np.all([d.shape[-1] == 1 for d in out_data]):


I would prefer not to have a special case for ones on the last axis - the user can always do (when fixed) axis=-1 for that case.

Fair enough. :)

bcipolli · 2015-02-24T23:44:52Z

@matthew-brett Please take a look. I know np.asarray copies from lists; I believe np.concatenate must copy as well. I beefed up the test coverage as well.

coveralls · 2015-02-24T23:57:18Z

Coverage increased (+0.01%) to 94.33% when pulling c39caac on bcipolli:issue-207 into 96d474c on nipy:master.

matthew-brett · 2015-02-25T01:01:21Z

nibabel/funcs.py

@@ -115,15 +117,19 @@ def concat_images(images, check_affines=True):
    affine = img0.affine
    header = img0.header
    out_shape = (n_imgs, ) + i0shape


This loses the memory efficiency of the original. How about:

if axis is None: # collect images in output array for efficiency out_shape = (n_imgs, ) + i0shape out_data = np.empty(out_shape) else: # collect images in list for use with np.concatenate out_data = [None] * n_imgs

Then you can use out_data[i] = img.get_data() in the loop.

Why is this less memory efficient? It doesn't copy anywhere. I assume that, under the covers, numpy preallocates a large array and does a memcopy. So I believe this should be very efficient.

Sorry - I should have explained - it's memory efficient because Python can delete the memory for the individual images as it goes through the loop. Having said that, in order for that to happen, we need more logic:

if is_filename: del img

Got it. I have an idea how to do all the cases efficiently; I'll try it out sometime today. I'll also look into improving the test coverage and checks.

matthew-brett · 2015-02-28T19:46:33Z

Anything I can do to help?

bcipolli · 2015-02-28T19:48:03Z

Sorry, had a conference this weekend and have been doing some research. I can get get back to this on Monday and try to wrap things up!

matthew-brett · 2015-02-28T19:49:01Z

No problem - and thanks for your work on this.

bcipolli · 2015-03-11T16:14:22Z

Hope to get to this today or tomorrow.

matthew-brett · 2015-03-11T17:16:53Z

Great - thanks.

bcipolli · 2015-03-11T18:04:01Z

@matthew-brett I've got the basic code working, but had a question while testing.

How flexible do you want concat_images to be? Specifically, if a mix of 3D and 4D images (with a unary 4th dimension, for example) are passed, it is possible to concat them--but it'd take an extra check to make it work.

Would you prefer simply to let that error, or to do the check to make sure those work together? From the testing standpoint, it's no problem to test that case as I've written things now.

bcipolli · 2015-03-11T21:02:07Z

Also, mixed image/filename is not allowed. Easy to extend for that as well.
Finally, I think del img is fine without the if check; it just removes the reference (doesn't force garbage collection); see https://docs.python.org/2/reference/simple_stmts.html#the-del-statement

matthew-brett · 2015-03-11T21:05:11Z

My instinct is to go for simpler code at the expense of not allowing obscurities like mixing 3D and 4D, or image objects and filenames. We can always add those later if it looks like a significant use-case.

…es possible. Test extensively.

…fine on first image.

bcipolli · 2015-03-11T21:22:06Z

@matthew-brett It made the code simpler to allow mixed than to disallow it. However, the 3D/4D possibilities did make the code more complex.

The code could lose about 5-6 lines of complexity if all images must be 3D or 4D. The testing code is going to be complex no matter what, due to the testing of all axes, None, and no argument specified.

Let me know if you'd like to trim some of the cases, but things are working as-is.

coveralls · 2015-03-11T21:42:26Z

Coverage increased (+0.02%) to 94.34% when pulling 47fc8f0 on bcipolli:issue-207 into 96d474c on nipy:master.

matthew-brett · 2015-03-11T21:47:39Z

nibabel/funcs.py

-    header = img0.header
-    out_shape = (n_imgs, ) + i0shape
-    out_data = np.empty(out_shape)
+    if n_imgs == 0:


Drop this check? I guess if they pass in an empty list they can expect an empty list back?

In the past, and currently, this throws an error. I added the check because the error did not indicate the issue clearly.

Fair enough.

coveralls · 2015-03-12T23:08:39Z

Coverage increased (+0.01%) to 94.34% when pulling 0567ac3 on bcipolli:issue-207 into 96d474c on nipy:master.

coveralls · 2015-03-12T23:13:59Z

Coverage increased (+0.01%) to 94.34% when pulling 0567ac3 on bcipolli:issue-207 into 96d474c on nipy:master.

bcipolli · 2015-03-12T23:16:47Z

@matthew-brett I cut out the 3D/4D special cases and added testing for 2D and 5D.

I did have to add some complexity. I found a lot of edge cases where numpy's broadcasting, or details of np.concatenate, led to unexpected results. These are unlikely edge cases, but to do things correctly (and test generically), I have to explicitly test the shape.

The upside is that the error message will be extremely clear. The downside is that the code to check the shape properly in both axis cases was ~7 lines.

Let me know what you think! I think we're pretty close now.

coveralls · 2015-03-12T23:25:47Z

Coverage increased (+0.01%) to 94.34% when pulling e9298cf on bcipolli:issue-207 into 96d474c on nipy:master.

matthew-brett · 2015-03-25T23:19:49Z

nibabel/funcs.py

    check_affines : {True, False}, optional
       If True, then check that all the affines for `images` are nearly
       the same, raising a ``ValueError`` otherwise.  Default is True
-
+    axis : None or int, optional
+        If None, concatenates on a new dimension.  This rrequires all images


typo rrequires

coveralls · 2015-03-27T05:02:30Z

Coverage increased (+0.01%) to 94.34% when pulling 84c0fd2 on bcipolli:issue-207 into cf4f946 on nipy:master.

Try doing the checks on the first image etc outside the body of the loop.

bcipolli · 2015-03-27T05:20:52Z

@matthew-brett made the requested changes, sans the question about the error check.

matthew-brett · 2015-03-27T06:14:20Z

OK - thanks very much for all your work on this.

The last remaining thing is that we are deleting the image even if it is passed in as an image object, in the list, and that could be bad. We either want to delete the image only if we created it by loading the filename, or (probably better) we could use img.get_data(caching='unchanged') so we don't fill up the cache in the image if it is already empty, and the image objects will stay really small.

Also you kindly removed the expect_error from the try branch, but it is undefined in some tests now.

I put in a pull request to your branch just now with fixes for these, and some optional refactoring. If you prefer my version, please go ahead and merge, otherwise - fix these last things and we'll merge asap.

Thanks again...

bcipolli · 2015-03-27T06:21:10Z

@matthew-brett I pushed a change for expect_error issue.

As for the del, I believe it is proper--we're not telling Python to delete the image; we're simply deleting the (local) reference to the image. If the image was passed in as an object, the calling code will still have a reference and the image data will remain in memory; if it was loaded locally, then that reference is the only one, and the object will be marked for garbage collection. If that's right, then there's no need for any special logic.

Thanks for the pull request / refactor. I prefer my version with the initialization within the loop, as it potentially avoids loading the first image's data twice. Since we worked hard to improve efficiency in this version, I think that small optimization is worthwhile. But as with you, my preference isn't strong; happy to defer if you feel any obstacle.

Thanks also for your work, sorry I didn't get this done more smoothly! Let me know if, after reading these comments, you think there are still some changes to make, and I'll be glad to take care of them ASAP.

matthew-brett · 2015-03-27T06:31:09Z

For the del - yes, thinking about it, the loop iteration will make new reference to the object that may have its only other reference in a passed list. On the other hand, this function will fill the image array cache of the passed image objects, which may be desirable or not desirable. I think probably not.

For the load data twice if doing initialization outside the loop - could you explain? I think there's only one 'get_data' call in the code.

I prefer my code because - well who doesn't prefer their own code? - but also I have an allergy to doing i == 1 initialization in the loop that is instinctive, but this is your work, so this is your call.

bcipolli · 2015-03-27T06:52:23Z

On the other hand, this function will fill the image array cache of the passed image objects, which may be desirable or not desirable. I think probably not.

I believe I understand. I also believe this issue existed in the old version of the code. If so, can we open a new issue to address it?

For the load data twice if doing initialization outside the loop - could you explain? I think there's only one 'get_data' call in the code.

I probably simply don't know the nibabel code well enough. I assumed the data was loaded when load(img) was called; sounds like it's always deferred to get_data. In that case, the in-loop initialization is totally unnecessary and just more confusing.

MRG: suggested refactoring of concat checks

bcipolli · 2015-03-27T06:54:33Z

Just merged your code; given what was discussed above, I think it solves the remaining issues. Let's be done with this! :)

matthew-brett · 2015-03-27T07:08:54Z

Thank you for being so patient with my nit-picks and slowness, and thanks very much for taking the time to do this. I think the test fails are spurious and it's time to merge...

MRG: Add an 'axis' parameter to concat_images, plus two tests Add ability to concatenate images over given axis, with tests. Closes #207

MRG: Add an 'axis' parameter to concat_images, plus two tests Add ability to concatenate images over given axis, with tests. Closes nipy#207

Add an 'axis' parameter to concat_images, plus two tests.

c6c6a1d

matthew-brett reviewed Feb 24, 2015
View reviewed changes

Ben Cipollini added 2 commits February 24, 2015 15:42

Try again, this time with lists and more tests...

2626884

Add greater coverage of different shapes.

c39caac

matthew-brett reviewed Feb 25, 2015
View reviewed changes

Ben Cipollini added 4 commits March 11, 2015 14:06

Make this work for all 3D and 4D combinations possible, across all ax…

afaa5fe

…es possible. Test extensively.

Allow mixed files and objects.

3331a51

Improve efficiency: load img0 once, del reference, and don't check af…

41732f4

…fine on first image.

Add a final comment.

47fc8f0

matthew-brett reviewed Mar 11, 2015
View reviewed changes

Ben Cipollini added 3 commits March 12, 2015 15:59

Similar bug in axis=int pathway, due to np.concatenate "smartness".

49b353a

Test 2D - 5D; remove some tests to increase speed.

99f1168

Convert exceptions to string.

0567ac3

Remove default argument.

e9298cf

effigies mentioned this pull request Mar 25, 2015

ENH: Add image_like function for SpatialImages #300

Closed

matthew-brett reviewed Mar 25, 2015
View reviewed changes

Small code review tweaks.

84d990d

RF: try doing image 1 concat stuff outside loop

188c7ea

Try doing the checks on the first image etc outside the body of the loop.

Merge pull request #1 from matthew-brett/concat-image-outside-loop

69aa16f

MRG: suggested refactoring of concat checks

matthew-brett added a commit that referenced this pull request Mar 27, 2015

Merge pull request #298 from bcipolli/issue-207

b0a8006

MRG: Add an 'axis' parameter to concat_images, plus two tests Add ability to concatenate images over given axis, with tests. Closes #207

matthew-brett merged commit b0a8006 into nipy:master Mar 27, 2015

bcipolli deleted the issue-207 branch July 8, 2015 17:34

bcipolli restored the issue-207 branch July 8, 2015 17:34

bcipolli deleted the issue-207 branch July 8, 2015 17:35

grlee77 pushed a commit to grlee77/nibabel that referenced this pull request Mar 15, 2016

Merge pull request nipy#298 from bcipolli/issue-207

10c5d7f

MRG: Add an 'axis' parameter to concat_images, plus two tests Add ability to concatenate images over given axis, with tests. Closes nipy#207

Add an 'axis' parameter to concat_images, plus two tests. #298

Add an 'axis' parameter to concat_images, plus two tests. #298

Uh oh!

Conversation

bcipolli commented Feb 24, 2015

Uh oh!

coveralls commented Feb 24, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bcipolli commented Feb 24, 2015

Uh oh!

coveralls commented Feb 24, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matthew-brett commented Feb 28, 2015

Uh oh!

bcipolli commented Feb 28, 2015

Uh oh!

matthew-brett commented Feb 28, 2015

Uh oh!

bcipolli commented Mar 11, 2015

Uh oh!

matthew-brett commented Mar 11, 2015 via email

Uh oh!

bcipolli commented Mar 11, 2015

Uh oh!

bcipolli commented Mar 11, 2015

Uh oh!

matthew-brett commented Mar 11, 2015

Uh oh!

bcipolli commented Mar 11, 2015

Uh oh!

coveralls commented Mar 11, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Mar 12, 2015

Uh oh!

coveralls commented Mar 12, 2015

Uh oh!

bcipolli commented Mar 12, 2015

Uh oh!

coveralls commented Mar 12, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Mar 27, 2015

Uh oh!

bcipolli commented Mar 27, 2015

Uh oh!

matthew-brett commented Mar 27, 2015

Uh oh!

bcipolli commented Mar 27, 2015

Uh oh!

matthew-brett commented Mar 27, 2015

Uh oh!

bcipolli commented Mar 27, 2015

Uh oh!

bcipolli commented Mar 27, 2015

Uh oh!

matthew-brett commented Mar 27, 2015

Uh oh!

Uh oh!