`ui.Chat()` now correctly handles new `ollama.chat()` return value introduced in ollama 0.4 #1787

cpsievert · 2024-11-26T17:15:15Z

Ollama 0.4 changed the return type of ollama.chat() (ChatResponse) from a TypedDict to a pydantic.BaseModel. As a result, passing that return value directly to .append_message() or .append_message_stream() doesn't work:

This PR fixes the issue, adds a unit test, and fixes another failing unit test.

# ------------------------------------------------------------------------------------
# A basic Shiny Chat example powered by Ollama.
# To run it, you'll need an Ollama server running locally.
# To download and run the server, see https://github.com/ollama/ollama
# To install the Ollama Python client, see https://github.com/ollama/ollama-python
# ------------------------------------------------------------------------------------

import ollama

from shiny.express import ui

# Set some Shiny page options
ui.page_opts(
    title="Hello Ollama Chat",
    fillable=True,
    fillable_mobile=True,
)

# Create and display empty chat
chat = ui.Chat(id="chat")
chat.ui()


# Define a callback to run when the user submits a message
@chat.on_user_submit
async def _():
    # Get messages currently in the chat
    messages = chat.messages(format="ollama")
    # Create a response message stream
    # Assumes you've run `ollama run llama3` to start the server
    response = ollama.chat(
        model="llama3",
        messages=messages,
        stream=True,
    )
    # Append the response stream into the chat
    await chat.append_message_stream(response)

…uced in ollama 0.4

cpsievert · 2024-11-26T18:29:34Z

shiny/ui/_chat_tokenizer.py

+def get_default_tokenizer() -> TokenizersTokenizer:
    try:
        from tokenizers import Tokenizer

        return Tokenizer.from_pretrained("bert-base-cased")  # type: ignore
-    except Exception:
-        pass
-
-    return None
+    except ImportError:
+        raise ValueError(
+            "Failed to download a default tokenizer. "
+            "A tokenizer is required to impose `token_limits` on `chat.messages()`. "
+            "To get a generic default tokenizer, install the `tokenizers` "
+            "package (`pip install tokenizers`). "
+        )
+    except Exception as e:
+        raise ValueError(
+            "Failed to download a default tokenizer. "
+            "A tokenizer is required to impose `token_limits` on `chat.messages()`. "
+            "Try downloading a different tokenizer using "
+            "`tokenizers.Tokenizer.from_pretrained()`. "
+            f"Error: {e}"
+        ) from e


This change is orthogonal to the main fix of this PR, but it's a good idea, and I ran into because bert-base-cased temporarily went offline for a couple of the test runs.

Are token_limits always imposed? Or is that opt-in or opt-out? It's hard to follow the thread back to where that would be chosen by the user and you might want to include that information in the message, e.g. if it's possible to disable token limits.

No, it's opt-in

Seems like the second message is missing some instruction about how to choose a specific tokenizer that was in the original message. I also think it'd be nice to include how to turn off the need for a tokenizer, but that might be obvious from the code you'd write to get here.

shiny/ui/_chat_tokenizer.py

gadenbuie · 2024-11-26T18:39:39Z

shiny/ui/_chat_normalize.py

+            if isinstance(ChatResponse, dict):
+                return "message" in message and super().can_normalize(
+                    message["message"]
+                )
+            else:
+                return isinstance(message, ChatResponse)


Is some of the context here that you have a message normalizer for Pydantic models?

I don't think you necessarily need to leave a comment, but it'd be helpful for my understanding of the code if you just quickly explained how this fixes the problem (beyond the simple explanation that before it was a dict and now it isn't, that part I get).

DictNormalizer works for either case since ollama defines __getitem__() on the pydantic model. I suppose that is a weird/subtle thing that requires extra context, and it'd be nice to take advantage of stronger pydantic typing, but I opted for the minimal change (especially if we're going to support older versions)

gadenbuie

My comments are all small and non-blocking

cpsievert added 3 commits November 26, 2024 11:08

ui.Chat() now correctly handles new ollama.chat() return value introd…

ee260c5

…uced in ollama 0.4

Update changelog

97e24a6

Merge branch 'main' into ollama-0.4-fix

9efe068

cpsievert requested a review from gadenbuie November 26, 2024 17:18

cpsievert added 3 commits November 26, 2024 11:22

Avoid using is_typeddict

b26a435

Better error reporting in get_default_tokenizer()

14539a8

Improve error messages

29ab06a

cpsievert commented Nov 26, 2024

View reviewed changes

gadenbuie reviewed Nov 26, 2024

View reviewed changes

shiny/ui/_chat_tokenizer.py Outdated Show resolved Hide resolved

gadenbuie reviewed Nov 26, 2024

View reviewed changes

gadenbuie approved these changes Nov 26, 2024

View reviewed changes

cpsievert added 2 commits November 26, 2024 12:59

Raise better error types

2e39d53

Slightly more informative error message

0ae6c71

cpsievert enabled auto-merge (squash) November 26, 2024 19:44

cpsievert merged commit 46d8ab8 into main Nov 26, 2024
40 checks passed

cpsievert deleted the ollama-0.4-fix branch November 26, 2024 19:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ui.Chat()` now correctly handles new `ollama.chat()` return value introduced in ollama 0.4 #1787

`ui.Chat()` now correctly handles new `ollama.chat()` return value introduced in ollama 0.4 #1787

cpsievert commented Nov 26, 2024

cpsievert Nov 26, 2024

gadenbuie Nov 26, 2024

cpsievert Nov 26, 2024

gadenbuie Nov 26, 2024

gadenbuie Nov 26, 2024

gadenbuie Nov 26, 2024

cpsievert Nov 26, 2024 •

edited

Loading

gadenbuie left a comment

ui.Chat() now correctly handles new ollama.chat() return value introduced in ollama 0.4 #1787

ui.Chat() now correctly handles new ollama.chat() return value introduced in ollama 0.4 #1787

Conversation

cpsievert commented Nov 26, 2024

cpsievert Nov 26, 2024

Choose a reason for hiding this comment

gadenbuie Nov 26, 2024

Choose a reason for hiding this comment

cpsievert Nov 26, 2024

Choose a reason for hiding this comment

gadenbuie Nov 26, 2024

Choose a reason for hiding this comment

gadenbuie Nov 26, 2024

Choose a reason for hiding this comment

gadenbuie Nov 26, 2024

Choose a reason for hiding this comment

cpsievert Nov 26, 2024 • edited Loading

Choose a reason for hiding this comment

gadenbuie left a comment

Choose a reason for hiding this comment

`ui.Chat()` now correctly handles new `ollama.chat()` return value introduced in ollama 0.4 #1787

`ui.Chat()` now correctly handles new `ollama.chat()` return value introduced in ollama 0.4 #1787

cpsievert Nov 26, 2024 •

edited

Loading