Add support for Anthropic prompt caching #1755

wch · 2024-11-02T13:54:25Z

This change adds support for Anthopic's beta prompt caching feature.

cpsievert · 2024-11-04T16:53:13Z

Here's a minimal example derived https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching

from anthropic import AsyncAnthropic
from app_utils import load_dotenv

from shiny.express import ui

load_dotenv()
llm = AsyncAnthropic()

chat = ui.Chat(id="chat")
chat.ui()
chat.update_user_input(value="Analyze the major themes in 'Pride and Prejudice'")


@chat.on_user_submit
async def _():
    response = await llm.beta.prompt_caching.messages.create(
        model="claude-3-5-sonnet-20241022",
        max_tokens=1024,
        stream=True,
        system=[
            {
                "type": "text",
                "text": "You are an AI assistant tasked with analyzing literary works. Your goal is to provide insightful commentary on themes, characters, and writing style.\n",
            },
            {
                "type": "text",
                "text": "<the entire contents of 'Pride and Prejudice'>",
                "cache_control": {"type": "ephemeral"},
            },
        ],
        messages=[{"role": "user", "content": chat.user_input()}],
    )
    await chat.append_message_stream(response)

cpsievert · 2024-11-04T16:55:45Z

shiny/ui/_chat_normalize.py

+            if isinstance(chunk, RawPromptCachingBetaMessageStartEvent):
+                return True
+
+            return False


This change to can_normalize_chunk() looks good.

If we want to also support this feature for the non-streaming case, we'll want to also update can_normalize() to know about PromptCachingBetaMessage (i.e., the return that you get with llm.beta.prompt_caching.messages.create(..., stream=False))

Add support for Anthropic prompt caching

c5f9551

wch requested a review from cpsievert November 2, 2024 13:54

cpsievert mentioned this pull request Nov 4, 2024

Change chat custom normalizer error message to point to correct function #1754

Open

cpsievert reviewed Nov 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Anthropic prompt caching #1755

Add support for Anthropic prompt caching #1755

wch commented Nov 2, 2024

cpsievert commented Nov 4, 2024

cpsievert Nov 4, 2024 •

edited

Loading

Add support for Anthropic prompt caching #1755

Are you sure you want to change the base?

Add support for Anthropic prompt caching #1755

Conversation

wch commented Nov 2, 2024

cpsievert commented Nov 4, 2024

cpsievert Nov 4, 2024 • edited Loading

Choose a reason for hiding this comment

cpsievert Nov 4, 2024 •

edited

Loading