Ollama MiniApp

Ollama MiniApp is a web application that enables interaction with various language models (LLMs) like Llama, Gemma, or Mistral and vision-language models (VLMs) for image description, all through an intuitive user interface. This project is built with a FastAPI backend, a Vue.js frontend, and utilizes Ollama to run these models efficiently.

Features

Intuitive User Interface: A modern, responsive design for interacting with language models.
Model Selection: Choose from a list of predefined models directly in the interface.
Image Description: Generate descriptions for uploaded images.
Extensible Backend: Built on FastAPI to facilitate robust API management.
Dockerized Deployment: Quick setup and deployment using Docker Compose.
Customizable Models: Manage and configure models easily with environment variables.

Screenshots

Home Page

Chat Page

Image Description Page

Architecture Overview

Backend

Framework: FastAPI
Key Components:
- app/core: Core utilities for configuration and logging.
- app/api/v1: API routes for chat, image descriptions, and model management.
- app/services: Service layer for handling API interactions with Ollama.

Frontend

Framework: Vue.js
Structure:
- src/pages: Individual pages for chat, home, and image description functionalities.
- src/components: Reusable components for chat and image interactions.
- src/api: API utility functions for backend communication.

Models

Managed via Ollama, supporting the latest LLM and VLM models.

Prerequisites

Docker: Install Docker and Docker Compose.
Node.js: Required for frontend development if you intend to run locally.

Installation and Deployment

Development Setup

Use the following steps for local development with hot-reloading:

Clone the Repository:

git clone https://github.com/akdavid/ollama_miniapp.git
cd ollama_miniapp

Set Up the Environment:
```
cp .env.example .env
```
Start the Development Environment:
```
docker compose up --build
```
Access the Interface:
- Frontend: http://localhost:8080
- Backend API Docs: http://localhost:8000/docs

In development mode:

Changes to the frontend code will trigger hot-reloading.
Backend code updates will restart the container automatically.

Production Setup

Use these steps to build and deploy the application for production:

Clone the Repository:

git clone https://github.com/akdavid/ollama_miniapp.git
cd ollama_miniapp

Set Up the Environment:
```
cp .env.example .env
```

Build and Start the Production Environment:

docker compose -f docker-compose.prod.yml up

Access the Interface:
- Frontend: http://localhost
- Backend API Docs: http://localhost:8000/docs

In production mode:

The frontend is built and served as static files through Nginx.
The backend runs with FastAPI, optimized for production.

Usage

Chat with a Language Model

Select a Model: Use the dropdown menu on the chat page.
Send a Message: Enter a message and click "Send."
Receive Responses: View responses in a real-time conversation format.

Image Description

Upload an Image: Use the form to upload an image file.
Generate Description: Click "Send" to receive a detailed description of the image.

Troubleshooting

Common Issues

Port Already in Use:
- The port used for the Ollama container (11434) may already be in use by the Ollama application running natively on your system.
- To stop the native Ollama service, use the following command:
```
systemctl stop ollama.service
```
Insufficient RAM for Image Descriptions:
- Generating image descriptions may fail if there is not enough available RAM.
- Solution:
  - Free up RAM by closing unnecessary applications.
  - Use a smaller model configured via the VLM_MODEL environment variable.

Removing Unused Models in Ollama

To delete a model from the Ollama container, use these commands:

docker exec -it ollama_miniapp-ollama-1 bash
ollama rm <model_name>

Tasks Progress

Chat with a language model (LLM).
Dropdown for model selection.
Image description with a vision-language model (VLM).

To Do List

[OPS] Improve CI/CD for tests and code formatting.
- [FRONT] Add unit tests for the frontend.
- [FRONT] Add code formatting.
[FULL] Add button to add a new LLM model from the web interface.
[FULL] Improve error handling for image description failures due to insufficient RAM or model limitations.
[BACK] Add support for GPU inference in the Ollama container.
[FRONT] Enhance the interface with real-time feedback for responses.
[BACK] Add history of the chat in LLM prompt (persistence).
[FRONT] Add Light/Dark modes.
[FULL] Add a prompt for the image description.
[FRONT] Add drag and drop for the uploaded image.
[FRONT] Try Tailwind CSS.
[NA] Add a documentation using Sphinx or MkDocs.
[FRONT] Add markdown visualization of chat responses.

Do not hesitate to contribute! :)

Environment Variables

Configure these variables in .env:

LLM_MODELS: A comma-separated list of available language models (e.g., gemma:2b,llama3.2:3b,mistral:7b).
OLLAMA_API_URL: The base URL for Ollama's API (default: http://ollama:11434).
VLM_MODEL: Vision-language model for image descriptions.

Contribution

Contributions are welcome! If you'd like to add features or fix issues:

Fork the repository.
Create a feature branch.
Make your changes and test locally.
Submit a pull request with a clear explanation of your changes.

License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Ollama MiniApp

Features

Screenshots

Home Page

Chat Page

Image Description Page

Architecture Overview

Backend

Frontend

Models

Prerequisites

Installation and Deployment

Development Setup

Production Setup

Usage

Chat with a Language Model

Image Description

Troubleshooting

Common Issues

Removing Unused Models in Ollama

Tasks Progress

To Do List

Environment Variables

Contribution

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Ollama MiniApp

Features

Screenshots

Home Page

Chat Page

Image Description Page

Architecture Overview

Backend

Frontend

Models

Prerequisites

Installation and Deployment

Development Setup

Production Setup

Usage

Chat with a Language Model

Image Description

Troubleshooting

Common Issues

Removing Unused Models in Ollama

Tasks Progress

To Do List

Environment Variables

Contribution

License