Snip-IGEL Model Server

This repo provides the code for serving the instruction finetuned IGEL model for german snippet generation in production. It can be used to spin up a simple HTTP server, that handles snippet generation.

NOTE: There is an even faster server for this model, built with the Potassium framework. Please use the Potassium Server for optimal performance.

Quickstart:

Curious to get your hand on an IGEL server capable of german news snippet generation?

You can check it out with docker:

Run docker build -t snip-igel-model-server . && docker run -it snip-igel-model-server to build and run the docker container.

Or you can check it out manually:

Run pip3 install -r requirements.txt to download dependencies.
Run python3 server.py to start the server.
Run python3 test.py in a different terminal session to test against it.

Note: Model requires a GPU with ~ 15GB memory for generation!

Overview:

app.py contains the code to load and run the model for inference.
You can run a simple test with test.py!

if deploying using Docker:

download.py is a script to download our finetuned model weights at build time.

Production:

This repo provides you with a functioning http server for our finetuned snip-igel-500 model. You can use it as is, or package it up with our provided Dockerfile and deploy it to your favorite container hosting provider!

We are currently running this code on Banana, where you can get 1 hour of model hosting for free. Feel free to choose a different hosting provider. In the following section we provide instructions for deployment with Banana.

🍌

To deploy to Banana Serverless:

Fork this repo
Log in to the Banana App
Select your forked repo for deploy

It'll then be built from the dockerfile, optimized, then deployed on Banana Serverless GPU cluster.
You can monitor buildtime and runtime logs by clicking the logs button in the model view on the Banana Dashboard.

Demo Integration:

When build and optimization finished successfully you will find your credentials printed in the build logs.

Your model was updated and is now deployed!
It is runnable with the same credentials:

API_KEY=Your-Personal-Api-Key
MODEL_KEY=Your-Personal-Model-Key

You need these keys to hook up the web app with the model.
To setup the frontend follow the instructions in the web-app repository.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
client.py		client.py
download.py		download.py
requirements.txt		requirements.txt
server.py		server.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snip-IGEL Model Server

Quickstart:

Overview:

Production:

🍌

To deploy to Banana Serverless:

Demo Integration:

About

Releases

Packages

Languages

License

snipaid-nlg/snip-igel-model-server

Folders and files

Latest commit

History

Repository files navigation

Snip-IGEL Model Server

Quickstart:

Overview:

Production:

🍌

To deploy to Banana Serverless:

Demo Integration:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages