You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've read through all the readme files in this project but I can't find the answer to a simple question: what are the tests that are run against the images?
I'm trying to determine if its worth the effort installing and running this software but there is not enough information provided (that I could easily find) to allow me to make this call.
The system checks that the files pass a number of tests and displays the results in a Shiny dashboard.
It would be very useful to have some indication of what these tests are and what sort of errors they identify. Either in terms of listing the various classes of tests or listing them all. I have no idea how many or how detailed they are.
Background: 700K herbarium sheets scanned by Picturae that have not been QA'ed or processed into any downline system yet.
The text was updated successfully, but these errors were encountered:
The software is still under heavy development, so it is not optimized for easy deployment yet. I'll make a note to add these details to the documentation.
Deploying it now requires knowledge of Python, Flask, Linux, and MySQL to be able to run it.
The tests are configurable per project, but include:
raw_pair: Is there a raw file in the 'raw_files_path'
valid_name: Filename is in the list of allowed names
unique_file: Name is not repeated in the project
dupe_elsewhere: Check name against the dupe_elsewhere table, from other projects
jhove: Run jhove validation
magick: Run validation test with Imagemagick
tifpages: Check the number of pages in the tif, typically a thumbnail
tif_compression: Check if tif is compressed using LZW
derivative: Check for a derivative file in 'derivative_files_path'
The mention of Shiny is from an old version. The current version uses a Python/Flask application for the dashboard.
We are doing a major overhaul this Summer and will have a more stable version in a few weeks.
I've read through all the readme files in this project but I can't find the answer to a simple question: what are the tests that are run against the images?
I'm trying to determine if its worth the effort installing and running this software but there is not enough information provided (that I could easily find) to allow me to make this call.
All I found were a number of instances of:
It would be very useful to have some indication of what these tests are and what sort of errors they identify. Either in terms of listing the various classes of tests or listing them all. I have no idea how many or how detailed they are.
Background: 700K herbarium sheets scanned by Picturae that have not been QA'ed or processed into any downline system yet.
The text was updated successfully, but these errors were encountered: