Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I would like to contribute some piece of code I added to provide logging capabilities of video predictions to file.
This is useful to extract raw actions (class labels and confidence) over video segments.
The idea is simple. I override the
draw_clip_range
method of the demo visualizer to report predicted actions and bounding boxes to apredictions.log
file instead of written onto output video frames.To preserve original behavior of the demo visualizer, I add a option
DEMO.OUTPUT_DISPLAY
that is by default running the original demo code (eg: display video in window or written to file). When set toFalse
, the logging override is used instead. This will write what would otherwise be drawn on the frames into text form in the log file underOUTPUT_DIR
.The resulting log will be similar to the following :
Each new sampled "clip section" is marked with
<clip/task-id> [<start-frame>, <end-frame>]
, and then provides the predicted actions for each detected bounding box.Above results where obtained using AVA checkpoint & classes, and Detectron2 predictor for bounding boxes.
The
top-k
mode andk=5
were used to generate these results, but outputs will adjust accordingly withthres
mode or other values ofk
, in the same manner the original visualizer did.