FIX: make_image should not modify original array #29892

rcomer · 2025-04-09T20:11:18Z

PR summary

Fixes #29891. The problem is that we divide the RGB part of the array by its alpha on each draw. The simple fix is to make a copy but I assume for efficiency we should only copy when necessary.

If we didn't start with RGBA, we already made a copy
If alpha was an array, we already made a copy
If the alpha channel is one everywhere, it does not matter

PR checklist

"closes #0000" is in the body of the PR description to link the related issue
new and changed code is tested
[n/a] Plotting related features are demonstrated in an example
[n/a] New Features and API Changes are noted with a directive and release note
[n/a] Documentation complies with general and docstring guidelines

jklymak · 2025-04-09T21:51:03Z

lib/matplotlib/tests/test_image.py

@@ -281,6 +281,17 @@ def test_imshow_alpha(fig_test, fig_ref):
    ax3.imshow(rgbau)


+def test_imshow_rgba_multi_draw():


You do this conditionally - may be helpful to test the conditions where you are not doing the copy to demonstrate that they work fine on redraw

dstansby · 2025-04-10T07:11:12Z

This adds another copy of the input image, which for medium to large images is not neglegible. Running the script below I get a peak memory usage of ~450 MB, but with this PR I get a peak usage of ~580MB (as expected the difference is ~one copy of the input image, which is ~120MB). ~~The performance of resizing the window is also much worse, I presume because it's making a copy of the array every time the window size changes.~~ < ignore this

I guess the other way to fix this is to multiply in-place, then divide in place after the resample. That would avoid the extra memory, but if it's constantly happening during a window resize I could imagine it has performace implications, and perhaps floating point errors could accumulate pretty fast with all the multiplications and divisions?

I can't think of another option to solve this without going back to applying alpha in resampled space (ie undoing #29776). Perhaps @anntzer has some thoughts as the original PR author?

import matplotlib.pyplot as plt
import numpy as np

np.random.seed(42)
rgba = np.random.random((2000, 2000, 4))

fig, ax = plt.subplots()
ax.imshow(rgba)
fig.canvas.draw()

anntzer · 2025-04-10T10:54:01Z

The fix looks correct to me, sorry for introducing the bug.

re: performance:

It's clear to me that unpremultiplied filtering is wrong because agg explicitly clips the RGB values to no more than alpha after the premultiplication (Filter images in premultiplied alpha mode. #29776 (comment)), which only makes sense in premultiplied space; if we really don't want to premultiply we should then at least remove that clipping.
In an initial version I wrote the premultiplication by upcasting uint8 just to uint16 e52c150 for int input, which should save a lot of memory, but following review (Filter images in premultiplied alpha mode. #29776 (review)) I moved everything to float for simplicity. I could bring back these narrower types to save memory.
It may be possible to save memory by merging the premultiplication step into the C++ filtering code; it's probably doable but a quick check suggests it's still a bit of work.

Thoughts?

rcomer · 2025-04-10T11:46:04Z

I guess we can add "input was uint8" to my list of reasons not to copy, since we already cast to float.

jklymak · 2025-04-10T16:00:58Z

In my opinion #29776 should remain. The previous results were incorrect, and I think having correct results should trump efficiency.

We copy the user data in _normalize_image_array and set that to self._A. It seems to me the proper thing to do it recopy the user data, not make another version of our self._A. I'm not sure how inefficient that is, but doing that at the correct level would make it so we only ever have one extra copy.

dstansby · 2025-04-10T19:21:47Z

👍 to doing the right thing at the expense of memory - at a glance I think there might be ways to save creating extra memory for the alpha channel, but for simplicity I think worth getting this in as is and then optimizing (if possible) later.

The problem is that we divide the RGB part of the array by its alpha on each draw. The simple fix is to make a copy but I assume for efficiency we should only copy when necessary. * If we didn't start with float RGBA, we already made a copy * If alpha was an array, we already made a copy * If the alpha channel is one everywhere, it does not matter

QuLogic · 2025-04-11T05:46:33Z

It may be possible to save memory by merging the premultiplication step into the C++ filtering code; it's probably doable but a quick check suggests it's still a bit of work.

Either #29453, or some not-yet-pushed version of it, does attempt to do some of this deduplication. I don't have time to look at that again until I finish the text stuff, but it may yet be a possibility to reduce the memory footprint, which can be done after this PR at least fixes the bug.

jklymak · 2025-04-11T06:08:48Z

That's fine as long as we remember to remove the extra copy this introduces.

QuLogic · 2025-04-11T07:51:00Z

I thought #29776 removed a copy (from _rgb_to_rgba), so we should be net even with this PR? At least for now.

jklymak · 2025-04-11T19:58:33Z

Sure? But we still will want to remove this copy?

tacaswell · 2025-04-13T02:28:29Z

I just want to clarify the issue is that we are modifying our copy of the array not the object the user passed to us.

dstansby · 2025-04-13T09:42:08Z

I just want to clarify the issue is that we are modifying our copy of the array not the object the user passed to us.

Yes, this is correct

github-actions bot added the topic: images label Apr 9, 2025

rcomer added Release critical For bugs that make the library unusable (segfaults, incorrect plots, etc) and major regressions. PR: bugfix Pull requests that fix identified bugs labels Apr 9, 2025

jklymak requested a review from anntzer April 9, 2025 21:49

jklymak reviewed Apr 9, 2025

View reviewed changes

rcomer added this to the v3.11.0 milestone Apr 10, 2025

rcomer force-pushed the array-alpha branch from 2684bae to 507f8a4 Compare April 10, 2025 20:34

tacaswell approved these changes Apr 13, 2025

View reviewed changes

dstansby approved these changes Apr 13, 2025

View reviewed changes

dstansby merged commit 349aa96 into matplotlib:main Apr 13, 2025
41 checks passed

rcomer deleted the array-alpha branch April 13, 2025 10:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX: make_image should not modify original array #29892

FIX: make_image should not modify original array #29892

Uh oh!

rcomer commented Apr 9, 2025 •

edited

Loading

Uh oh!

jklymak Apr 9, 2025

Uh oh!

dstansby commented Apr 10, 2025 •

edited

Loading

Uh oh!

anntzer commented Apr 10, 2025

Uh oh!

rcomer commented Apr 10, 2025

Uh oh!

jklymak commented Apr 10, 2025

Uh oh!

dstansby commented Apr 10, 2025

Uh oh!

QuLogic commented Apr 11, 2025

Uh oh!

jklymak commented Apr 11, 2025

Uh oh!

QuLogic commented Apr 11, 2025

Uh oh!

jklymak commented Apr 11, 2025

Uh oh!

tacaswell commented Apr 13, 2025

Uh oh!

dstansby commented Apr 13, 2025

Uh oh!

Uh oh!

Uh oh!

		@@ -281,6 +281,17 @@ def test_imshow_alpha(fig_test, fig_ref):
		ax3.imshow(rgbau)


		def test_imshow_rgba_multi_draw():

Uh oh!

FIX: make_image should not modify original array #29892

FIX: make_image should not modify original array #29892

Uh oh!

Conversation

rcomer commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR summary

PR checklist

Uh oh!

jklymak Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

dstansby commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anntzer commented Apr 10, 2025

Uh oh!

rcomer commented Apr 10, 2025

Uh oh!

jklymak commented Apr 10, 2025

Uh oh!

dstansby commented Apr 10, 2025

Uh oh!

QuLogic commented Apr 11, 2025

Uh oh!

jklymak commented Apr 11, 2025

Uh oh!

QuLogic commented Apr 11, 2025

Uh oh!

jklymak commented Apr 11, 2025

Uh oh!

tacaswell commented Apr 13, 2025

Uh oh!

dstansby commented Apr 13, 2025

Uh oh!

Uh oh!

Uh oh!

rcomer commented Apr 9, 2025 •

edited

Loading

dstansby commented Apr 10, 2025 •

edited

Loading