This repository is the official implementation of the MVA2021 paper: Leveraging Frequency Based Salient Spatial Sound Localization to Improve 360◦ Video Saliency Prediction.
- GNU Octave-5.2.0
- Python 3.6 (see
requirements.txt
for required packages.) - FFMPEG
python3 -m pip install -r requirements.txt
octave -W MCSR/Main.m
python3 ambisonic_saliency/main.py <<path containing *_saliency.mat>> <<output path>>
python3 uv_visualization/fixmap2salmap.py -i ambisonic_saliency_predictions/pred_<<video name>> -o <<folder containing videos/video name/>> -r <<output height x width>>
python3 fusion.py