Stop storing FileError items from Kingfisher Collect in the database #366

jpmckinney · 2022-04-20T17:40:33Z

open-contracting/kingfisher-collect#917 (comment)

SInce this version of Process stores the Scrapyd job ID, it's easy to use scrapy-log-analyzer to parse the log file itself. This avoids errors being introduced by Process, the network, etc.

jpmckinney · 2022-06-08T17:53:26Z

~~Hmm, the job ID is stored by the data registry. It is sent to create_collection from Collect's spider_opened callback, but the ID is not yet stored. See #341~~ Done

jpmckinney · 2024-04-12T05:15:21Z

Obviously, as part of this, we would also stop sending messages for file errors from Collect.

Edit: There is also some logic in collectionstatus that we can remove .exclude(data__has_key="http_error")

As part of this, we can delete collection_note rows WHERE note LIKE 'Couldn''t download %'

jpmckinney added refactor feature Relating to loading data from the web API or CLI command and removed refactor labels Jun 8, 2022

jpmckinney added this to the Priority milestone Jun 8, 2022

jpmckinney changed the title ~~Stop storing collection errors in the database~~ Stop storing FileError items from Kingfisher Collect in the database Jul 4, 2023

jpmckinney added the blocked label Jul 4, 2023

jpmckinney mentioned this issue Apr 10, 2024

Acceptance criteria - Kingfisher Process open-contracting/data-registry#30

Open

jpmckinney modified the milestones: Database changes, Priority Apr 12, 2024

jpmckinney removed the blocked label Apr 12, 2024

jpmckinney added database Changes to the database (adding indices, renaming columns) and removed feature Relating to loading data from the web API or CLI command labels Apr 17, 2024

jpmckinney added a commit that referenced this issue Nov 2, 2024

fix: Stop creating collection_file rows for file errors, #366

b5f7bda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop storing FileError items from Kingfisher Collect in the database #366

Stop storing FileError items from Kingfisher Collect in the database #366

jpmckinney commented Apr 20, 2022

jpmckinney commented Jun 8, 2022 •

edited

Loading

jpmckinney commented Apr 12, 2024 •

edited

Loading

Stop storing FileError items from Kingfisher Collect in the database #366

Stop storing FileError items from Kingfisher Collect in the database #366

Comments

jpmckinney commented Apr 20, 2022

jpmckinney commented Jun 8, 2022 • edited Loading

jpmckinney commented Apr 12, 2024 • edited Loading

jpmckinney commented Jun 8, 2022 •

edited

Loading

jpmckinney commented Apr 12, 2024 •

edited

Loading