[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case #19300

joclement · 2021-01-29T15:09:47Z

Reference Issues/PRs

Fixes small bug in #16625

What does this implement/fix? Explain your changes.

A small typo
A bug when an actual "multiclass" type is passed, which detected as a "binary" type

Details on the bug:
I stumbled upon this problem when using the LOGO cross validator. When not all the classes of a multiclass problem are contained in the y_true parameter, the top_k_accuracy_score function raises the following error:

ValueError: y should be a 1d array, got an array of shape (4, 4) instead.

In order to replicate this bug I added a test, which fails, if my fix is not added. The error above is caused by that test.
The reason is that the function call type_of_target(y_true) determines the type "binary", if the number of unique values in y_true is <= 2.

I believe that this error doesn't need to happen, if the labels parameter is passed. Then the function can determine the classes and the type of the problem by inspecting that parameter as well.

thomasjpfan

Thank you for the PR @flyingdutchman23 !

Please add an entry to the change log at doc/whats_new/v1.0.rst with tag |Fix|. Like the other entries there, please reference this pull request with :pr: and credit yourself (and other contributors if applicable) with :user:.

sklearn/metrics/_ranking.py

sklearn/metrics/tests/test_ranking.py

joclement · 2021-01-29T16:06:05Z

Thank you @thomasjpfan for reviewing this PR.

joclement · 2021-03-19T10:39:45Z

I accidently closed this PR, because it was on my main branch. I will open a new one with the same changes.

github-actions bot added the module:metrics label Jan 29, 2021

thomasjpfan reviewed Jan 29, 2021

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

sklearn/metrics/tests/test_ranking.py Outdated Show resolved Hide resolved

joclement changed the title ~~Fix top_k_accuracy_score neglecting labels for "multiclass" case~~ WIP: Fix top_k_accuracy_score neglecting labels for "multiclass" case Jan 29, 2021

joclement changed the title ~~WIP: Fix top_k_accuracy_score neglecting labels for "multiclass" case~~ Fix top_k_accuracy_score neglecting labels for "multiclass" case Jan 29, 2021

joclement changed the title ~~Fix top_k_accuracy_score neglecting labels for "multiclass" case~~ [MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case Jan 30, 2021

joclement closed this Mar 19, 2021

joclement mentioned this pull request Mar 19, 2021

[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case #19721

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case #19300

[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case #19300

Uh oh!

joclement commented Jan 29, 2021 •

edited

Loading

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

joclement commented Jan 29, 2021

Uh oh!

joclement commented Mar 19, 2021

Uh oh!

Uh oh!

Uh oh!

[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case #19300

[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case #19300

Uh oh!

Conversation

joclement commented Jan 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joclement commented Jan 29, 2021

Uh oh!

joclement commented Mar 19, 2021

Uh oh!

Uh oh!

joclement commented Jan 29, 2021 •

edited

Loading