You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I thought ranking was based on aggregate scores (0.2 * iptm + 0.8 * ptm - 100 * clash)?
If so, model_idx_2 (with "aggregate_score": 0.28714820742607117) should be rank_0, but it is rank_3.
The results from the .json files (with their aggregate_score):
- model_idx_2 (0.28714820742607117) is rank_3 (should be rank_0)
- model_idx_4 (0.24289762973785400) is rank_1
- model_idx_3 (0.24094939231872559) is rank_0 (should be rank_2)
- model_idx_0 (0.23602318763732910) is rank_2 (should be rank_3)
- model_idx_1 (0.21727515757083893) is rank_4
We merged a change that we're hoping fixes this issue. Please let us know if this issues recurs (N.B. we cache jobs, so if you specify the exact same inputs as last time, there won't be any change to the outputs--you'll need to test with new inputs).
I thought ranking was based on aggregate scores (0.2 * iptm + 0.8 * ptm - 100 * clash)?
If so,
model_idx_2
(with"aggregate_score": 0.28714820742607117
) should berank_0
, but it isrank_3
.The results from the .json files (with their aggregate_score):
Here are the .json files:
The text was updated successfully, but these errors were encountered: