Camera Vision realtime still not support ? #11

skyxiaobai · 2024-09-10T05:45:58Z

I tested found vision parts just show front camera view, but vision frame not realtime comunicated with LLM, So, Any plan to support this?

marcus-daily · 2024-09-12T17:08:45Z

Thanks for the question @skyxiaobai. Vision is only supported with some models, such as Claude Sonnet. Also please make sure you have "Voice and Vision" selected in the settings.

skyxiaobai · 2024-09-13T01:35:44Z

Thank you for your reminder. This is what I am doing. I am using GPT-4o by default and the RTVI Android SDK. After generating the APK and running it on the phone, I can see that the camera opens, but I found out that during the conversation, the content from the camera cannot be recognized. For example, in a scenario where you ask, "Can you see what I am doing?" the camera content is not detected. Actually, my goal is to utilize the camera's real-time video stream for conversation, similar to a video chat function. However, after testing, I found that only voice is real-time. Thanks again for your support.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Camera Vision realtime still not support ? #11

Camera Vision realtime still not support ? #11

skyxiaobai commented Sep 10, 2024

marcus-daily commented Sep 12, 2024

skyxiaobai commented Sep 13, 2024

Camera Vision realtime still not support ? #11

Camera Vision realtime still not support ? #11

Comments

skyxiaobai commented Sep 10, 2024

marcus-daily commented Sep 12, 2024

skyxiaobai commented Sep 13, 2024