[MRG] Fixes test_scale_and_stability by clipping small values #13903

thomasjpfan · 2019-05-18T04:32:06Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Clips small values in _PLS.fit.

This has been causing errors in our nightly tests (scipy-dev). This will start to cause errors in our regular tests since scipy 1.13.0 came out.

qinhanmin2014 · 2019-05-18T08:32:34Z

This test is still failing.
What has changed in numpy/scipy?

thomasjpfan · 2019-05-18T13:18:58Z

It's this PR that changed the default behavior of cond in pinv2: scipy/scipy#10067

thomasjpfan · 2019-05-18T13:22:23Z

I changed the tol for the failing estimator, CCA. This fixed the issue locally, lets see what our CI thinks.

qinhanmin2014

I'm not familiar with that algorithm but this seems reasonable as long as CIs are green.

qinhanmin2014 · 2019-05-18T13:55:12Z

Hmm, I'm surprised to see something like

Max absolute difference: 0.06155424
Max relative difference: 1.

Is there a bug? Will be grateful if you can briefly explain what's happening here.

qinhanmin2014 · 2019-05-18T13:58:58Z

I quickly went through the code, it's strange that scaling the same data for multiple times will introduce such a big difference?

thomasjpfan · 2019-05-18T14:19:40Z

This is likely a 32-bit numerical issue, since the tests pass for the other environments.

thomasjpfan · 2019-05-18T14:51:35Z

When I set the tolerance too low, it went below np.finfo('f').eps=1e-07, which caused the 32 bit float comparisons in _nipals_twoblocks_inner_loop to fail.

qinhanmin2014 · 2019-05-18T15:42:07Z

Heading to bed and thanks for your hard work here :)
Honestly I'm OK to skip the test on 32-bit machines and keep the issue open. This does not seem to be a serious issue.

…ability_fix

NicolasHug · 2019-05-20T14:45:14Z

Quick recap of our pair debugging sesh with @thomasjpfan:

it's not just windows or 32 bits: we can reproduce on linux and macos, using scipy 1.3
it's definitely caused by MAINT: Fix the cutoff value inconsistency for pinv2 and pinvh scipy/scipy#10067 which changes the defualt cond param of pinv2: small eigen values (smaller than cond) would be ignored before. On 1.3, they're not ignored, causing some extremely large values in the inverse matrix Y_pinv (since pinv2 would divide by these small eigen values).

There are 2 solutions:

Not use the default cond in our calls to pinv2
Change the tolerance in the test. We went for this solution

qinhanmin2014 · 2019-05-20T15:07:32Z

@NicolasHug @thomasjpfan tests are still failing

thomasjpfan · 2019-05-21T03:28:40Z

This error stems from how _nipals_twoblocks_inner_loop calculates y_weights, when dealing with small numbers with different signs:

X = np.array([[1.0, 2.0], 
              [4.0, 3.0]])

# negative y[1,0]
y_n = np.array([[1.0e-15, 1.0],
              [-1.0e-14, 2.0]])

# positive y[1,0]
y_p = np.array([[1.0e-15, 1.0],
               [1.0e-14, 2.0]])

_nipals_twoblocks_inner_loop(X, y_n, mode="B")
# array([[-4.06091728e+12], [1.24670160e+00]])

_nipals_twoblocks_inner_loop(X, y_p, mode="B")
# array([[4.22666900e+12], [1.23841402e+00]])

The PR fixes this issue by replacing the small values with zero.

sklearn/cross_decomposition/pls_.py

This reverts commit da17e66.

NicolasHug

So changing cond in the call to pinv2 doesn't work?

I'm fine with this anyway, I don't think there's much more we can do.

@qinhanmin2014 you might want to have another look since this has changed since you approved it.

jnothman

This requires a what's new (0.21.2) entry.

Do the original tests now pass with the small value clipping?

Does the eigenvalue conditioning issue relate to #12145?

thomasjpfan · 2019-05-22T14:02:32Z

So changing cond in the call to pinv2 doesn't work?

I think we would need two conditions, one for single precision and another for double precision. (The double precision would need to be scaled up more). This was how it was done in pinv2 before scipy 1.13.0.

Does the eigenvalue conditioning issue relate to #12145?

This is related. The cond parameter in pinv2 determines if an eigenvalue is "significant". scipy 1.13.0 fixed a bug that made it more inclusive, ie smaller eigenvalues are now considered "significant".

Do the original tests now pass with the small value clipping?

Yes. The value clipping forces the eigenvalue to zero, thus forcing pinv2 to consider it "insignificant".

jnothman · 2019-05-22T21:37:32Z

If the original tests pass, why have we modified them?

…ability_fix

jnothman · 2019-05-23T01:01:21Z

Thanks for the fix, @thomasjpfan

…ity (scikit-learn#13903)

…ity (#13903)

TST Reduce decimal points

07c5c04

TST Decrease tol

1fb5448

qinhanmin2014 approved these changes May 18, 2019

View reviewed changes

qinhanmin2014 added the Blocker label May 18, 2019

qinhanmin2014 added this to the 0.21.2 milestone May 18, 2019

thomasjpfan changed the title ~~[MRG] "Fixes" test_scale_and_stability by reducing decimals~~ [MRG] "Fixes" test_scale_and_stability by changing tol May 18, 2019

thomasjpfan changed the title ~~[MRG] "Fixes" test_scale_and_stability by changing tol~~ [MRG] Fixes test_scale_and_stability by changing tol May 18, 2019

TST Add decimal as well

e005f90

WIP

2724fc6

TST Adds decimal to both assertions

2cb5563

thomasjpfan added 2 commits May 18, 2019 11:12

TST Adjust tol

e52ae4e

TST Further decrease decimal

a859a51

TST Change tol

08191f0

thomasjpfan changed the title ~~[MRG] Fixes test_scale_and_stability by changing tol~~ [WIP] Fixes test_scale_and_stability by changing tol May 19, 2019

ogrisel mentioned this pull request May 20, 2019

[MRG] Fix eulidean distances (float32) batch management #13910

Merged

thomasjpfan added 2 commits May 20, 2019 10:39

TST magic numbers for assert_allclose

ddbce77

Merge remote-tracking branch 'upstream/master' into test_scale_and_st…

cb400bc

…ability_fix

thomasjpfan changed the title ~~[WIP] Fixes test_scale_and_stability by changing tol~~ [MRG] Fixes test_scale_and_stability by changing tol May 20, 2019

thomasjpfan added 2 commits May 20, 2019 11:13

TST Adds more magic

ebc9216

BUG Force small values to be zero

dc56a6a

REV Less diffs

0bc9e79

ogrisel reviewed May 21, 2019

View reviewed changes

sklearn/cross_decomposition/pls_.py Outdated Show resolved Hide resolved

ogrisel reviewed May 21, 2019

View reviewed changes

sklearn/cross_decomposition/pls_.py Outdated Show resolved Hide resolved

jnothman mentioned this pull request May 21, 2019

Release 0.21.2 #13915

Merged

thomasjpfan added 4 commits May 21, 2019 07:24

CLN Address comments

1e7b7e5

FIX Adjust cond in pinv2

da17e66

TST Adds atol

afbd67e

Revert "FIX Adjust cond in pinv2"

95056aa

This reverts commit da17e66.

NicolasHug approved these changes May 21, 2019

View reviewed changes

jnothman reviewed May 21, 2019

View reviewed changes

thomasjpfan closed this May 22, 2019

thomasjpfan reopened this May 22, 2019

DOC Adds whats new

6afffed

thomasjpfan changed the title ~~[MRG] Fixes test_scale_and_stability by changing tol~~ [MRG] Fixes test_scale_and_stability by clipping small values May 22, 2019

thomasjpfan and others added 4 commits May 22, 2019 19:04

REV Changes

c1e9497

Merge remote-tracking branch 'upstream/master' into test_scale_and_st…

34e6b21

…ability_fix

REV Test changes

12ce3db

y -> Y

396c1df

jnothman approved these changes May 23, 2019

View reviewed changes

jnothman merged commit c418761 into scikit-learn:master May 23, 2019

jnothman pushed a commit to jnothman/scikit-learn that referenced this pull request May 23, 2019

FIX clip small values in PLS cross-decomposition for increased stabil…

b1a531a

…ity (scikit-learn#13903)

NicolasHug mentioned this pull request Jun 10, 2019

ARD Regressor accuracy degrades when upgrading Scipy 1.2.1 -> 1.3.0 #14055

Closed

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

FIX clip small values in PLS cross-decomposition for increased stabil…

f605a17

…ity (scikit-learn#13903)

jnothman pushed a commit that referenced this pull request Jul 30, 2019

FIX clip small values in PLS cross-decomposition for increased stabil…

bddd925

…ity (#13903)

thomasjpfan mentioned this pull request Nov 19, 2019

[MRG] BUG Fixes test_scale_and_stability in windows #15661

Merged

thomasjpfan mentioned this pull request Nov 3, 2020

FIX Fixes test_scale_and_stability #18746

Merged

Uh oh!

[MRG] Fixes test_scale_and_stability by clipping small values #13903

[MRG] Fixes test_scale_and_stability by clipping small values #13903

Uh oh!

Conversation

thomasjpfan commented May 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

qinhanmin2014 commented May 18, 2019

Uh oh!

thomasjpfan commented May 18, 2019

Uh oh!

thomasjpfan commented May 18, 2019

Uh oh!

qinhanmin2014 left a comment

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 commented May 18, 2019

Uh oh!

qinhanmin2014 commented May 18, 2019

Uh oh!

thomasjpfan commented May 18, 2019

Uh oh!

thomasjpfan commented May 18, 2019

Uh oh!

qinhanmin2014 commented May 18, 2019

Uh oh!

NicolasHug commented May 20, 2019

Uh oh!

qinhanmin2014 commented May 20, 2019

Uh oh!

thomasjpfan commented May 21, 2019

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan commented May 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented May 22, 2019

Uh oh!

jnothman commented May 23, 2019

Uh oh!

Uh oh!

thomasjpfan commented May 18, 2019 •

edited

Loading

thomasjpfan commented May 22, 2019 •

edited

Loading