Unification of shape checks #250

msluszniak · 2024-04-06T13:45:01Z

As mentioned here.

krstopro · 2024-05-16T20:55:13Z

Is this really completed?

josevalim · 2024-05-16T21:00:49Z

Errr, wrong issue.

josevalim · 2024-05-16T21:01:23Z

At the same time, I am not sure if there is something to unify here? I think checking for shapes should be done per module, as they can provide better error messages within their context?

krstopro · 2024-05-16T21:04:04Z

At the same time, I am not sure if there is something to unify here? I think checking for shapes should be done per module, as they can provide better error messages within their context?

Could be the case. My idea was to have several functions for the cases that keep repeating, e.g. if x is of rank 2, if x and y agree on the first axis, etc.

josevalim · 2024-05-16T22:46:27Z

The problem is that the error message ends up being generic: "expected tensors to have matching leading dimensions" while today we can say "expected the data to have the same dimension as the query" etc. We actually had those helpers inside Nx itself at the beginning but we removed them because of that once we introduced case. :)

JoaquinIglesiasTurina · 2024-06-14T12:51:44Z

I am not sure whether this comment is fully on-topic, but on linear models, I find the behaviour of the api to be inconsistent when it comes to data shapes.

You can fit all linear models with a target shaped {n_samples, 1}. But when you call predict, some models yield an ArgumentError with the following message:
"dot/zip expects shapes to be compatible, dimension 1 of left-side (1) does not equal dimension 0 of right-side (10)".
The remaining models have a multioutput option, thus they can take an {n_samples, 1} shaped target. Leading to this inconsistency.

The models yielding this error are:

Scholar.Linear.BayesianRidgeRegression
Scholar.Linear.IsotonicRegression
Scholar.Linear.LinearRegression
Scholar.Linear.SVM

The models not yielding this error are:

Linear.PolynomialRegression
Linear.RidgeRegression

A livebook showcasing this behaviour of linear models is available here.

I think all linear models should work fine with {n_samples, 1} and {n_samples} shaped targets. If you all agree with that, I would be happy to work on this issue of linear models.

msluszniak · 2024-06-14T14:10:56Z

Sure, that is really strange behaviour and not supposed to happen, so feel free to work on it. I'll appreciate that :)

JoaquinIglesiasTurina · 2024-06-22T19:06:23Z

I've taken a further look at the issue with linear models. I think there is a decision to be made on how to handle the situation:

Should the output of an {n_samples, 1} and {n_samples} shaped target be the same? Or should it be different?

Meaning, RidgeRegression returns {1, n_samples} shaped coefficients for {n_samples, 1} shaped targets. While for {n_samples} targets, it returns {n_samples} shaped coefficients.
This is the behaviour of scikit's ordinary least squares. This behaviour can be achieved by fixing some Nx.dot/2 to Nx.dot/4.

A different approach would be, shape-check the target, and make sure to flatten it prior to fitting the model. This would ensure that {n_samples, 1} and {n_samples} shaped targets yield equally shaped coefficients.
This approach has some points of significance:

Inconsistency with scikit's api (I don't know if this is a problem).
It raises the question: How do we handle linear models with multioutput options?
It's potentially a breaking change of RidgeRegression's api.

I mathematically favor the second option, when linear models are described mathematically, the target is a column vector and actually fitting a column vector and a vector should yield the same results.
However, I feel like that is the riskier approach, it can yield to some inconsistencies and leaving things as they are is the safer choice.

Looking forward to your comments.

msluszniak · 2024-06-26T17:04:03Z

I think that we may try the second direction, and if it will introduce some breaking changes to RidgeRegression then we will fix it.

msluszniak added the suggestion ideas and/or plans to put forward for consideration. label Apr 6, 2024

josevalim closed this as completed May 16, 2024

josevalim reopened this May 16, 2024

JoaquinIglesiasTurina mentioned this issue Jul 11, 2024

Fix/linear shapes #288

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unification of shape checks #250

Unification of shape checks #250

msluszniak commented Apr 6, 2024

krstopro commented May 16, 2024

josevalim commented May 16, 2024

josevalim commented May 16, 2024

krstopro commented May 16, 2024

josevalim commented May 16, 2024

JoaquinIglesiasTurina commented Jun 14, 2024

msluszniak commented Jun 14, 2024

JoaquinIglesiasTurina commented Jun 22, 2024

msluszniak commented Jun 26, 2024

Unification of shape checks #250

Unification of shape checks #250

Comments

msluszniak commented Apr 6, 2024

krstopro commented May 16, 2024

josevalim commented May 16, 2024

josevalim commented May 16, 2024

krstopro commented May 16, 2024

josevalim commented May 16, 2024

JoaquinIglesiasTurina commented Jun 14, 2024

msluszniak commented Jun 14, 2024

JoaquinIglesiasTurina commented Jun 22, 2024

msluszniak commented Jun 26, 2024