You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I attempted to replicate my problem in the code snippet above. There are 5966 candidate pairs and my ECM classifier returns 5836 of them as matches.
Problem: I want to use ECMClassifier for Entity matching. However, when I apply it to my dataset, ALL the candidate pairs are identified as matches, which is unfortunate.
Is there some parameter I can set to tweak the threshold for match vs non-match, or am I missing something else here?
The text was updated successfully, but these errors were encountered:
I think the threshold for binarizing is too low and you are thus converting all the feature vectors to 1 and getting all matches. Try increasing the binarize threshold
I attempted to replicate my problem in the code snippet above. There are 5966 candidate pairs and my ECM classifier returns 5836 of them as matches.
Problem: I want to use
ECMClassifier
for Entity matching. However, when I apply it to my dataset, ALL the candidate pairs are identified as matches, which is unfortunate.Is there some parameter I can set to tweak the threshold for match vs non-match, or am I missing something else here?
The text was updated successfully, but these errors were encountered: