ENH: Hierarchical clustering of the correlation matrix #19

celprov · 2023-03-24T10:02:16Z

Implement a new feature to perform hierarchical clustering on the correlation matrix before plotting it.

The hierarchical clustering can be activated by the flag sort in plot_corrmat(), which is by default set to False, such that the default behavior of this function remains unchanged.

fix : correct the alignment of the labels on the horizontal axis

…he correlation matrix before plotting it

oesteban

looking good. main caveat is whether pandas is not an overkill

mriqc_learn/viz/metrics.py

oesteban · 2023-03-24T10:11:24Z

mriqc_learn/viz/metrics.py

+        # Build a new dataframe with the sorted columns
+        for idx, i in enumerate(data.columns[labels_order]):
+            if idx == 0:
+                clustered = pd.DataFrame(data[i])
+            else:
+                df_to_append = pd.DataFrame(data[i])
+                clustered = pd.concat([clustered, df_to_append], axis=1)
+        data = clustered


Numpy should be sufficient to reorder, something like

Suggested change

# Build a new dataframe with the sorted columns

for idx, i in enumerate(data.columns[labels_order]):

if idx == 0:

clustered = pd.DataFrame(data[i])

else:

df_to_append = pd.DataFrame(data[i])

clustered = pd.concat([clustered, df_to_append], axis=1)

data = clustered

data = np.take(data, labels_order, axis=0)

Q2 - don't you want to also sort the rows?

Wow very fast, thanks.
The panda implementation reorder both the rows and the columns.

Okay, I think np.take will then work for you with something like (labels_order, labels_order) or zip((labels_order, labels_order)) for the indexes and no axis argument.

Unfortunately, none of the suggestions work and with a quick search on internet, I couldn't figure out how to reorder both rows and columns in a np.array. I thus suggest we keep the panda implementation.

I think this is easier than you think:

reordered_idx = (0, 1, 2, 4, 5, 3, 6, 7, 8, 9) data.take(indices=reordered_idx, axis=0).take(indices=reordered_idx, axis=1)

The only caveat is that you need to do the reordering on the full correlation matrix, and only after the reordering drop the upper triangle (if you want to do so).

Thanks a lot, it works with this suggestion. I really could not figure out how to do the reordering on np.array.
It indeed greatly simplifies the code.
Can I merge the PR now?

Co-authored-by: Oscar Esteban <[email protected]>

celprov · 2023-10-25T21:22:25Z

@oesteban time to revive this. I think it is ready to just merge it, as I could run the code on the IQMs from the IXI dataset and since we already worked on reviews long time back.

Here is a resulting correlation matrix plot

celprov requested a review from oesteban March 24, 2023 10:02

enh : implement a new feature to perform hierarchical clustering on t…

a373e37

…he correlation matrix before plotting it

celprov force-pushed the enh/clustering_corrmat branch from c66f057 to a373e37 Compare March 24, 2023 10:08

oesteban reviewed Mar 24, 2023

View reviewed changes

Apply suggestions from code review to rename flag

00ea0e8

Co-authored-by: Oscar Esteban <[email protected]>

celprov mentioned this pull request Mar 24, 2023

ENH : Add notebook to plot IQMs correlation TheAxonLab/defacing-and-qc-analysis#19

Merged

celprov force-pushed the enh/clustering_corrmat branch from d273c1e to 97fd28b Compare March 24, 2023 12:59

sty : simplify the reordering of the correlation matrix

3f10199

celprov force-pushed the enh/clustering_corrmat branch from 97fd28b to 3f10199 Compare March 24, 2023 13:10

celprov requested a review from oesteban October 25, 2023 21:15

fix: TypeError launched by ax.spines[:]

81867f0

celprov force-pushed the enh/clustering_corrmat branch from 33e999f to 81867f0 Compare October 25, 2023 21:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Hierarchical clustering of the correlation matrix #19

ENH: Hierarchical clustering of the correlation matrix #19

celprov commented Mar 24, 2023 •

edited

Loading

oesteban left a comment

oesteban Mar 24, 2023

celprov Mar 24, 2023

oesteban Mar 24, 2023

celprov Mar 24, 2023

oesteban Mar 24, 2023

celprov Mar 24, 2023

celprov commented Oct 25, 2023

ENH: Hierarchical clustering of the correlation matrix #19

Are you sure you want to change the base?

ENH: Hierarchical clustering of the correlation matrix #19

Conversation

celprov commented Mar 24, 2023 • edited Loading

oesteban left a comment

Choose a reason for hiding this comment

oesteban Mar 24, 2023

Choose a reason for hiding this comment

celprov Mar 24, 2023

Choose a reason for hiding this comment

oesteban Mar 24, 2023

Choose a reason for hiding this comment

celprov Mar 24, 2023

Choose a reason for hiding this comment

oesteban Mar 24, 2023

Choose a reason for hiding this comment

celprov Mar 24, 2023

Choose a reason for hiding this comment

celprov commented Oct 25, 2023

celprov commented Mar 24, 2023 •

edited

Loading