You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please let me know if I should separate this issue in three issues. The pattern I noticed is exactly the same.
I do not know if this is specific to the Toronto Star or a more general problem.
In four instances (one of which will instead be discussed in a separate issue because there are other challenges), I find there are two or more headline Hublot elements for the same frontpage. Interestingly, some of these headlines do include duplicates, and some frontpages in the past day were associated with only one headline as they should, so the issue does not happen all the time. The headline elements associated with the same frontpage have only a few differences with each other: (1) the hashed_html is different; (2) timestamps are different and non-overlapping (but, collectively, they do match with the frontpage); (3) the lake item's final numbers are different.
This is normal. In situations where modifications are made to the article, the frontpage will remain the same but a different headline will be pushed to the lake.
If the hashed_html are different, then the code is finding differences in the content. This could be a bug, but the code is behaving as it should in situations where it sees different content related to the same frontpage.
Yes, that makes sense. Maybe the differences are about some other items, such as "next recommended article"? If this doubles the amount of data found in radarplus/headlines this might be something worth inquiring eventually. But maybe the steps required to fix it would be too burdensome to be undertaken.
Please let me know if I should separate this issue in three issues. The pattern I noticed is exactly the same.
I do not know if this is specific to the Toronto Star or a more general problem.
In four instances (one of which will instead be discussed in a separate issue because there are other challenges), I find there are two or more headline Hublot elements for the same frontpage. Interestingly, some of these headlines do include duplicates, and some frontpages in the past day were associated with only one headline as they should, so the issue does not happen all the time. The headline elements associated with the same frontpage have only a few differences with each other: (1) the hashed_html is different; (2) timestamps are different and non-overlapping (but, collectively, they do match with the frontpage); (3) the lake item's final numbers are different.
Instance #1:
Instance #2:
Instance #3:
Screenshots from instance #3 only (since the pattern is very similar for instances #1 and #2; happy to add more screenshots if needed):
The text was updated successfully, but these errors were encountered: