Drexel Metadata

Goal

To develop a tool to check the validity of metadata associated with an image, and generate things that are missing. Also includes various geometric and statistical properties on the mask generated over the biological specimen presented.

Functionality

Object detection is currently being performed on 5 detection classes (fish, fish eyes, rulers, and the twos and threes found on rulers). The current setup is performed on the INHS and UWZM biological specimen image repositories.

Current Criteria

Image must contain a fish species (no eels, seashells, butterflies, seahorses, snakes, etc).
Image must contain only 1 of each class (except eyes).
Specimen body must lie alone the image plane from a side view.
Ruler must be consistent (only two ruler types, INHS + UWZM, were used in training set).
Fish must not be obscured by another object (petri dish for example).
Whole body of fish must be present (no heads, tails, or standalone features).
Fish body must not be folded and should have no curvature.

These do not need to be adhered to if properly set up/modified for a specific use case.

Dependencies

Every dependency is stored in a Pipfile. To set this up, run the following commands:

pip install pipenv
pipenv install

There may be OS dependent installations one may need to perform.

Training

Setup:

Create a COCO JSON training set using the images and labels.
1. This is done currently using makesense in their polygon object detection mode.
2. The labels currently used are: fish, ruler, eye, two, three in that exact order.
3. Save as a COCO JSON after performing manual segmentations and labeling. Then, place in datasets.
In the config directory, create a JSON file with a key name of the image directory on your local system, and then a value of an array of dataset names in the datasets folder.
1. For multiple image collections, have multiple keys.
2. For multiple datasets for the same collection, append to the respective value array.
3. For example: {"image_folder_name": ["dataset_name.json"]}.
In the train script, set the conf variable at the top of the main() function to load the JSON file name created in the previous step.
Create a text file named overall_prefix.txt file in the config folder. This file should have the absolute path to the directory in which all the image repositories will be stored.
1. Currently it is /usr/local/bgnn/. There are various image folders like tulane, inhs_filtered, uwzm_filtered, etc.
To edit the learning rate, batch size, or any other base training configuration, edit the base training configurations file.
To edit the number of iterations, dropoffs, or any model specific configurations, edit the model training configurations file.

Finally, to train the model, run the following command:

pipenv run python3 train_model.py

Metadata Generation

The metadata generated is extremely specific to our use case. In addition, we perform additional image processing techniques to improve our accuracies that may not work for other use cases. These include:

Image scaling when a fish is detected but not an eye, in an attempt to lower missing eyes.
Selection of highest confidence fish bounding box given our criterion of single fish in an image.
Contrast enhancement (CLAHE)

The metadata generated produces various statistical and geometric properties of a biological specimen image or collection in a JSON format. When a single file is passed, the data is yielded to the console (stdout). When a directory is passed, the data is stored in a JSON file.

Model

The trained model is available as "Drexel_metadata_generator" at https://datacommons.tdai.osu.edu/dataverse/fish-traits/. The model can be downloaded from that website or via the dva command line utility. To download from the command line install dva then run the following command:

dva download --url https://datacommons.tdai.osu.edu/ doi:10.5072/FK2/MMX6FY .

The above command will download the file and verify the checksum.

Running

To generate the metadata, run the following command:

pipenv run python3 gen_metadata.py [file_or_dir_name]

Usage:

gen_metadata.py [-h] [--device {cpu,cuda}] [--outfname OUTFNAME] [--maskfname MASKFNAME] [--visfname VISFNAME]
                       file_or_directory [limit]

The limit parameter will limit the number of files processed in the directory. The limit positional argument is only applicable when passing a directory.

Device Configuration

By default gen_metadata.py requires a GPU (cuda). To use a CPU instead pass the --device cpu argument to gen_metadata.py.

Single File Usage

The following three arguments are only supported when processing a single image file:

--outfname <filename> - When passed the script will save the output metadata JSON to <filename> instead of printing to the console (the default behavior when processing one file).
--maskfname <filename> - Enables logic to save an output mask to <filename> for the single input file.
--visfname <filename> - Changes the script to save the output visualization to <filename> instead of the hard coded location.

These arguments are meant to simplify adding gen_metadata.py to a workflow that process files individually.

Running with Singularity

A Docker container is automatically built for each drexel_metadata release. This container has the requirements installed and includes the model file. To run the singularity container for a specific version follow this pattern:

singularity run docker://ghcr.io/hdr-bgnn/drexel_metadata:<release> gen_metadata.py ...

Properties Generated

Property	Association	Type	Explanation
has_fish	Overall Image	Boolean	Whether a fish was found in the image.
fish_count	Overall Image	Integer	The quantity of fish present.
has_ruler	Overall Image	Boolean	Whether a ruler was found in the image.
ruler_bbox	Overall Image	4 Tuple	The bounding box of the ruler (if found).
scale*	Overall Image	Float	The scale of the image in $\frac{\mathrm{pixels}}{\mathrm{cm}}$.
bbox	Per Fish	4 Tuple	The top left and bottom right coordinates of the bounding box for a fish.
background.mean	Per Fish	Float	The mean intensity of the background within a given fish's bounding box.
background.std	Per Fish	Float	The standard deviation of the background within a given fish's bounding box.
foreground.mean	Per Fish	Float	The mean intensity of the foreground within a given fish's bounding box.
foreground.std	Per Fish	Float	The standard deviation of the foreground within a given fish's bounding box.
contrast*	Per Fish	Float	The contrast between foreground and background intensities within a given fish's bounding box.
centroid	Per Fish	4 Tuple	The centroid of a given fish's bitmask.
primary_axis*	Per Fish	2D Vector	The unit length primary axis (eigenvector) for the bitmask of a given fish.
clock_value*	Per Fish	Integer	Fish's primary axis converted into an integer "clock value" between 1 and 12.
oriented_length*	Per Fish	Float	The length of the fish bounding box in centimeters.
mask	Per Fish	2D Matrix	The bitmask of a fish in 0's and 1's.
pixel_analysis_failed	Per Fish	Boolean	Whether the pixel analysis process failed for a given fish. If true, detectron's mask and bounding box were used for metadata generation.
score	Per Fish	Float	The percent confidence score output by detectron for a given fish.
has_eye	Per Fish	Boolean	Whether an eye was found for a given fish.
eye_center	Per Fish	2 Tuple	The centroid of a fish's eye.
side*	Per Fish	String	The side (i.e. 'left' or 'right') of the fish that is facing the camera (dependent on finding its eye).
area	Per Fish	Float	Area of fish in $\mathrm{cm^2}$.
cont_length	Per Fish	Float	The longest contiguous length of the fish in centimeters.
cont_width	Per Fish	Float	The longest contiguous width of the fish in centimeters.
convex_area	Per Fish	Float	Area of convex hull image (smallest convex polygon that encloses the fish) in $\mathrm{cm^2}$.
eccentricity	Per Fish	Float	Ratio of the focal distance over the major axis length of the ellipse that has the same second-moments as the fish.
extent	Per Fish	Float	Ratio of pixels of fish to pixels in the total bounding box. Computed as $\frac{\mathrm{area}}{\mathrm{rows} * \mathrm{cols}}$
feret_diameter_max	Per Fish	Float	The longest distance between points around the fish’s convex hull contour.
kurtosis	Per Fish	2D Vector	The sharpness of the peaks of the frequency-distribution curve of mask pixel coordinates.
major_axis_length	Per Fish	Float	The length of the major axis of the ellipse that has the same normalized second central moments as the fish.
mask.encoding	Per Fish	String	The 8-way Freeman Encoding of the outline of the fish.
mask.start_coord	Per Fish	2D Vector	The starting coordinate of the Freeman encoded mask.
minor_axis_length	Per Fish	Float	The length of the minor axis of the ellipse that has the same normalized second central moments as the fish.
oriented_width	Per Fish	Float	The width of the fish bounding box in centimeters.
perimeter	Per Fish	Float	The approximation of the contour in centimeters as a line through the centers of border pixels using 8-connectivity.
skew	Per Fish	2D Vector	The measure of asymmetry of the frequency-distribution curve of mask pixel coordinates.
solidity	Per Fish	Float	The ratio of pixels in the fish to pixels of the convex hull image.
std	Per Fish	Float	The standard deviation of the mask pixel coordinate distribution.

Associated Publication

J. Pepper, J. Greenberg, Y. Bakiş, X. Wang, H. Bart and D. Breen, "Automatic Metadata Generation for Fish Specimen Image Collections," 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2021, pp. 31-40, doi: 10.1109/JCDL52503.2021.00015.

Kevin Karnani, Joel Pepper, Yasin Bakis et al. Computational Metadata Generation Methods for Biological Specimen Image Collections, 27 April 2022, PREPRINT (Version 1) available at Research Square https://doi.org/10.21203/rs.3.rs-1506561/v1

Authors

Joel Pepper

Kevin Karnani

Name		Name	Last commit message	Last commit date
Latest commit History 348 Commits
.github/workflows		.github/workflows
.idea		.idea
config		config
datasets		datasets
gen_metadata_mini		gen_metadata_mini
paper		paper
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md
Training_model_notes		Training_model_notes
brightness.py		brightness.py
check_labels.py		check_labels.py
contrast.py		contrast.py
error_check.py		error_check.py
error_check.sh		error_check.sh
gen_metadata.py		gen_metadata.py
gen_metadata.sh		gen_metadata.sh
picture_viewer.ui		picture_viewer.ui
pixel_analysis.ui		pixel_analysis.ui
pixel_analysis_checker.py		pixel_analysis_checker.py
plot.py		plot.py
profiler.py		profiler.py
pyproject.toml		pyproject.toml
tag_result_checker.py		tag_result_checker.py
train.sh		train.sh
train_iterations.sh		train_iterations.sh
train_model.py		train_model.py
train_model_labels.py		train_model_labels.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Drexel Metadata

Goal

Functionality

Current Criteria

Dependencies

Training

Metadata Generation

Model

Running

Device Configuration

Single File Usage

Running with Singularity

Properties Generated

Associated Publication

Authors

About

Releases 6

Packages

Contributors 7

Languages

License

hdr-bgnn/drexel_metadata

Folders and files

Latest commit

History

Repository files navigation

Drexel Metadata

Goal

Functionality

Current Criteria

Dependencies

Training

Metadata Generation

Model

Running

Device Configuration

Single File Usage

Running with Singularity

Properties Generated

Associated Publication

Authors

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 7

Languages

Packages