Pushing bug fix for metacat #487

shubham-s-agarwal · 2024-09-04T09:20:33Z

2-phase learning for MetaCAT utilises data_undersampled.
Fixed a bug in the eval function, which was incorrectly using the data_undersampled instead of the full_data

2-phase learning for MetaCAT utilises data_undersampled. Fixed a bug in the eval function, which was incorrectly using the data_undersampled instead of the full_data

mart-r

Just the lazy logging formatting.
Also, perhaps that should be a logger.debug instead?

PS:
At initial look it seems like you're changing the order of the returned arguments in the method as well as when unpacking. And that wouldn't change the behaviour. But for the lines changed, that is the intention.
What the change actually does is force the use of encode_category_values in line 354 to use the full data rather than the undersampled data.
So there's a little bit of a confusing path of fixing an issue. In order to fix an issue of unpacking data in line 354, the order of the returned arguments is changed everywhere else.
But this order probably does make sense overall.

mart-r · 2024-09-04T09:24:20Z

medcat/utils/meta_cat/data_utils.py

@@ -210,6 +210,8 @@ def encode_category_values(data: Dict, existing_category_value2id: Optional[Dict
    for i in range(len(data)):
        if data[i][2] in category_value2id.values():
            label_data_[data[i][2]] = label_data_[data[i][2]] + 1
+
+    logger.info(f"Original label_data: {label_data_}")


We want to use lazy formatting for logging. So instead of an f-string, use %s formatting.

Oh yes I'll fix that!
Regarding the ordering of unpacking, I was going to only change the order for link 354, however Tom pointed out that it would make the most sense to have it full_data and then undersampled_data
That's why the long route

Should I make the change for all logging statements to use lazy logging?

Generally don't want to change the purpose of a PR.
But I don't think there's too many. So why not.

As for unpacking - yes, I agree. This order makes more sense in general. Just thought I'd mention it.

mart-r

Lazy logging means allowing the logger to parse in the % formats if (and only if) the logged message has a handler (i.e if it needs to be shown/written somewhere).

So a messages gets logged as follows:

logger.info("Messages about %s and nr %d", "topic 1", 4)

shubham-s-agarwal · 2024-09-04T14:07:13Z

Fixed the lazy logging formatting

mart-r

Looks good to me.

* Pushing bug fix for metacat 2-phase learning for MetaCAT utilises data_undersampled. Fixed a bug in the eval function, which was incorrectly using the data_undersampled instead of the full_data * Pushing change for lazy logging * Pushing update for lazy logging * Pushing lint fix

Pushing bug fix for metacat

c83780e

2-phase learning for MetaCAT utilises data_undersampled. Fixed a bug in the eval function, which was incorrectly using the data_undersampled instead of the full_data

shubham-s-agarwal requested a review from mart-r September 4, 2024 09:20

shubham-s-agarwal self-assigned this Sep 4, 2024

mart-r requested changes Sep 4, 2024

View reviewed changes

Pushing change for lazy logging

231cccb

shubham-s-agarwal marked this pull request as draft September 4, 2024 12:44

mart-r requested changes Sep 4, 2024

View reviewed changes

shubham-s-agarwal added 2 commits September 4, 2024 14:22

Pushing update for lazy logging

a75a7c6

Pushing lint fix

f4341df

shubham-s-agarwal marked this pull request as ready for review September 4, 2024 14:06

mart-r approved these changes Sep 4, 2024

View reviewed changes

shubham-s-agarwal merged commit 6127f77 into master Sep 5, 2024
8 checks passed

shubham-s-agarwal deleted the metacat_bug_fix branch September 5, 2024 08:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pushing bug fix for metacat #487

Pushing bug fix for metacat #487

shubham-s-agarwal commented Sep 4, 2024

mart-r left a comment

mart-r Sep 4, 2024

shubham-s-agarwal Sep 4, 2024

shubham-s-agarwal Sep 4, 2024

mart-r Sep 4, 2024

mart-r left a comment

shubham-s-agarwal commented Sep 4, 2024

mart-r left a comment

Pushing bug fix for metacat #487

Pushing bug fix for metacat #487

Conversation

shubham-s-agarwal commented Sep 4, 2024

mart-r left a comment

Choose a reason for hiding this comment

mart-r Sep 4, 2024

Choose a reason for hiding this comment

shubham-s-agarwal Sep 4, 2024

Choose a reason for hiding this comment

shubham-s-agarwal Sep 4, 2024

Choose a reason for hiding this comment

mart-r Sep 4, 2024

Choose a reason for hiding this comment

mart-r left a comment

Choose a reason for hiding this comment

shubham-s-agarwal commented Sep 4, 2024

mart-r left a comment

Choose a reason for hiding this comment