Chat with GPT-4o with Webcam

I trust everyone can't wait to start building something with a fantastic GPT-4o multimodal model from OpenAI. At the Microsoft Build 2024 event, Microsoft announced the general availability of GPT-4o with text and image capability. I tried to quickly build something to make traditional conversational chat more fun by adding a camera so the model could see.

This is a simple prototype code repository sharing with the basic web UI. I used simple components such as Flask, CV2 (OpenCV), OpenAI client, and the rest of the simple baseline libraries to make this. You need to create a deployment of GPT-4o in Azure OpenAI Services.

This program logic is simple: the webpage will ask permission to turn on the camera. It would automatically take a snapshot of the image every second. Then, you can type in some text in the chat windows, and it will post the text together with the image to the GPT-4o multimodel model at the same time and then get a response. So you can ask some questions like what you saw.

The response from GPT-4o is not fast at the moment, but it works, and I look forward to the service adding audio and video support soon.

Enjoy!

Prerequisites

Python 3.6+
Flask
OpenAI
cv2
numpy

Installing & Make it Run

Clone the repository:

git clone https://github.com/easonlai/webcam_chat_with_aoai_gpt4o.git

Install the dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
python run.py
```

Usage

Navigate to the home page (localhost:5000 by default).
Give access to the camera, then show and ask it something.
GPT-4o will process the image and the text at the same time and give the result.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
git-images		git-images
templates		templates
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat with GPT-4o with Webcam

Prerequisites

Installing & Make it Run

Usage

Let's have some fun

About

Releases

Packages

Languages

easonlai/webcam_chat_with_aoai_gpt4o

Folders and files

Latest commit

History

Repository files navigation

Chat with GPT-4o with Webcam

Prerequisites

Installing & Make it Run

Usage

Let's have some fun

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages