Implement masking to control how embedded points are updated #620

matthieuheitz · 2021-03-15T17:49:57Z

No description provided.

coveralls · 2021-03-24T22:06:42Z

Coverage decreased (-1.7%) to 87.245% when pulling c35db5f on matthieuheitz:fixedpoints into f86c922 on lmcinnes:master.

lmcinnes

Looks good so far. I would like to consider different names than pin_mask. It is a small thing, and naming is hard (I don't have a particularly good alternative suggestion), but good names help users a lot. My best suggestion at the moment is inertia or sample_inertia (similar to sample_weight used in other sklearn models). Of course we would need then then pass through 1.0 - inertia but that seems tractable. I would welcome your thoughts however.

lmcinnes · 2021-03-29T16:17:55Z

umap/umap_.py

@@ -2671,7 +2727,7 @@ def fit_transform(self, X, y=None):
        r_emb: array, shape (n_samples)
            Local radii of data points in the embedding (log-transformed).
        """
-        self.fit(X, y)
+        self.fit(X, y, pin_mask)


Probably best to make this a keyword arg in case we get more fit parameters later (such as a init fit param).

Suggested change

self.fit(X, y, pin_mask)

self.fit(X, y, pink_mask=pin_mask)

lmcinnes · 2021-03-29T16:22:56Z

Note that the remaining test failures are due to how coveralls plays with azure, so you can safely ignore them.

Another note: I may start a new 0.6dev branch and target this at that so we can merge it in sooner and work on it more easily without messing with the main branch. Let me know if you think that would be a good idea.

jondo · 2021-03-30T17:26:25Z

I have now tested this, and I found out that I additionally needed to block this rescaling such that my points (which have coordinates slightly outside [0, 10]) stay fixed.

jondo · 2021-03-30T18:02:20Z

Also, I would like to base my pinned initial embedding on the spectral embedding, and I suggest this change to get it.

Update: The suggested change was merged into master 👍

lmcinnes · 2021-03-30T19:07:55Z

Matthieu raised the rescaling issue with me elsewhere. It is a little tricky as the actual init does need to land in a reasonable spot, or the resulting embedding can go very badly. Leaving the rescaling in ensured that we had a sensible starting point. Otherwise there is the question of whether we leave it to the user -- it is not hard to accidentally provide a bad initialization that produces unexpected results and is hard to diagnose as to what is going wrong. I was hoping to avoid that if possible. Perhaps a reasonable option would be to come up with some semi-reasonable checks and warn if the provided initialization is troublesome?

jondo · 2021-03-31T09:08:39Z

I also think that warning instead of rescaling is the way to go. Perhaps "one of the coordinate ranges is outside [8, 12] (i.e. more than 20 % off)" is a semi-reasonable condition? This assumes that the layout optimization is independent of absolute embedding location.

kruus · 2021-05-04T20:03:38Z

A colleague working on an interactive visualization tool had an interesting scenario. A user could (say) inspect and then drag certain points to left/right and "pin" the x-axis of the dragged points while leaving other axes of the dragged points adaptable.

Supporting also a 2-D nsamples x dim pin_mask seems to describe the capability he was looking for.
In this case, we want the behavior of gradient weights other_mask and current_mask to change from scalars into vector gradient multipliers.

jondo mentioned this pull request Mar 22, 2021

Possibility to fix points in the low embedding #606

Open

matthieuheitz force-pushed the fixedpoints branch from 82fbb28 to c35db5f Compare March 24, 2021 21:29

lmcinnes reviewed Mar 29, 2021

View reviewed changes

Implement masking to control how embedded points are updated

c7fe35e

matthieuheitz force-pushed the fixedpoints branch from c35db5f to c7fe35e Compare June 16, 2021 21:29

lmcinnes mentioned this pull request Jul 5, 2021

Supervised UMAP using already projected points as regression #712

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement masking to control how embedded points are updated #620

Implement masking to control how embedded points are updated #620

matthieuheitz commented Mar 15, 2021

coveralls commented Mar 24, 2021

lmcinnes left a comment

lmcinnes Mar 29, 2021

lmcinnes commented Mar 29, 2021

jondo commented Mar 30, 2021

jondo commented Mar 30, 2021 •

edited

Loading

lmcinnes commented Mar 30, 2021

jondo commented Mar 31, 2021

kruus commented May 4, 2021

Implement masking to control how embedded points are updated #620

Are you sure you want to change the base?

Implement masking to control how embedded points are updated #620

Conversation

matthieuheitz commented Mar 15, 2021

coveralls commented Mar 24, 2021

lmcinnes left a comment

Choose a reason for hiding this comment

lmcinnes Mar 29, 2021

Choose a reason for hiding this comment

lmcinnes commented Mar 29, 2021

jondo commented Mar 30, 2021

jondo commented Mar 30, 2021 • edited Loading

lmcinnes commented Mar 30, 2021

jondo commented Mar 31, 2021

kruus commented May 4, 2021

jondo commented Mar 30, 2021 •

edited

Loading