Visione's Capabilities #25

MalikAhmed2 · 2024-08-12T10:19:21Z

MalikAhmed2
Aug 12, 2024

I'm curious about Visione's abilities. Can Visione identify faces and logos in videos? Can I search for videos based on a specific face or logo? And, does Visione work with audio too, like turning speech into text?

fabiocarrara · 2024-08-19T08:05:30Z

fabiocarrara
Aug 19, 2024
Maintainer

Hi @MalikAhmed2 ,

right now, VISIONE's visual search is supported by the following global image descriptors: OpenCLIP, ALADIN, CLIP2Video, and DINOv2.
None of them are specifically designed to identify/recognize faces or logos.

For faces, you might be able to search for some celebrities and other public figures that ended up in CLIP's training set, but that's all... no custom face search.
For logos, you might also be able to search for custom ones using VISIONE's query-by-example powered by DINOv2, which performs reasonably well in instance retrieval tasks.

Concerning audio, it is not analyzed in VISIONE (yet).

0 replies

MalikAhmed2 · 2024-08-19T08:26:00Z

MalikAhmed2
Aug 19, 2024
Author

Is it possible to add different modules, such as face recognition, to VISIONE?

1 reply

fabiocarrara Aug 20, 2024
Maintainer

Yes, the analysis part is kind of modular (based on dockerized services, see visione/services/analysis) and thought for extension, but it might get tricky to integrate it with the indexing and UI modules. If you are willing to contribute with a PR, we can provide support on this.

MalikAhmed2 · 2024-08-20T09:52:10Z

MalikAhmed2
Aug 20, 2024
Author

I'm eager to contribute in the future when my schedule allows. Keep up the excellent work!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visione's Capabilities #25

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Visione's Capabilities #25

MalikAhmed2 Aug 12, 2024

Replies: 3 comments · 1 reply

fabiocarrara Aug 19, 2024 Maintainer

MalikAhmed2 Aug 19, 2024 Author

fabiocarrara Aug 20, 2024 Maintainer

MalikAhmed2 Aug 20, 2024 Author

MalikAhmed2
Aug 12, 2024

Replies: 3 comments 1 reply

fabiocarrara
Aug 19, 2024
Maintainer

MalikAhmed2
Aug 19, 2024
Author

fabiocarrara Aug 20, 2024
Maintainer

MalikAhmed2
Aug 20, 2024
Author