adding exp backoff retry decorator to openai embedding and completion calls #12

cfortuner · 2023-02-14T18:30:54Z

Adds a new utility -> Retry!

Updated Openai provider's embedOne, generate and stream methods to use Retry!

It's a typescript decorator that lets you retry N number of times:

  @retry(3)
  async generate(
    promptText: string,
    options: GenerateCompletionOptions = DEFAULT_COMPLETION_OPTIONS
  ) {
    try {
      if (options.s

Let me know what you think!

… calls

vercel · 2023-02-14T18:30:58Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated
docs-promptable	❌ Failed (Inspect)			Feb 14, 2023 at 7:26PM (UTC)

mathisobadia

Make some comments to make clear what I already said on discord

mathisobadia · 2023-02-15T16:13:19Z

packages/promptable/src/providers/OpenAI.ts


-  private embedMany = async (
+  @retry(3)


I would remove this line as it would retry the whole batch

mathisobadia · 2023-02-15T16:13:47Z

packages/promptable/src/providers/OpenAI.ts

-        this.api.createEmbedding({
-          ...options,
-          input: text.replace(/\n/g, " "),
-        })


replace this to a call to this.embedOne, since embedOne has the retry decorator, each function call should be retried in case of failure instead of the whole batch

mathisobadia

another comment on checking if the error is throttling before retrying. You don't want to wait for up to 10 seconds before realizing that the request if malformed or that you api key is wrong

mathisobadia · 2023-02-15T17:00:16Z

packages/promptable/src/utils/retry.ts

+            logger.log(chalk.red(`Maximum retries exceeded`));
+            throw error; // re-throw error if maximum retries exceeded
+          } else {
+            logger.log(chalk.yellow(`Retrying...`));


I think you might want to only retry if the error is that there is throttling of the request. This would make this function less abstract as we have to assume that the error is an axios result and do a check that looks like

if (error.response.status === 429)

So this would only work for axios errors but I think it makes sense for now as this is only used with openai client that uses axios under the hood.

yourbuddyconner · 2023-02-15T22:20:13Z

Another option for retry logic here that is probably more performant / better supported is the async-retry library.

Here's an example implementation from my codebase, wrapping the openai SDK:

import retry from "async-retry";

export const openaiCompletion = trace("openaiCompletion", _openaiCompletion);
async function _openaiCompletion(prompt: string, model: string = "text-davinci-003", temperature: number = 1, nTokens: number = 500): Promise<string> {
    const response = await retry(
        async (bail) => {
            return openai.createCompletion({
                model: model,
                prompt,
                temperature: temperature,
                max_tokens: nTokens,
                top_p: 1,
                frequency_penalty: 0,
                presence_penalty: 0
            })
        },
        {
            retries: 8,
            factor: 4,
            minTimeout: 1000,
            // onRetry: (error: any) => console.log(error)
        }
    )
    const text = response.data.choices[0].text
    return text!
}

Thoughts:

Tracing this with promptable is useful, as an aside.
Lets you pass an arbitrary error handler callback (not wired up here)
Retry logic (ex. retries and factor) can be parameterized and presented to the user to turn the knobs
Not a lot of overhead to adding, just an extra import.

I would probably opt to add this at the base ModelProvider level, and I think it's worth considering implementing this as a function decorator s.t. the retry logic can be added without too much additional boilerplate. That said, thinking deeply about retry logic on an api-specific basis is a real good idea because what is good for OpenAI APIs might not hold for other service providers.

cfortuner · 2023-02-16T02:43:41Z

Just updating here. Holding off on adding this for now,

we have some other ideas that we'd like to try that might be better.

yourbuddyconner · 2023-02-16T04:19:01Z

Cool @cfortuner lmk if you want me to review the solution when you have a PR.

ymansurozer · 2023-02-16T11:08:47Z

Having read @mathisobadia's comments, I think that makes more sense. There is a trade-off here:

In batching, we save on request counts, so less requests toward requests per minute rate limit.
In one by one, we save time in processing because we do not lose time when a batch is rejected, but this means there are a lot of requests toward requests per minute rate limit.

One more thing to consider is how to handle embedding requests that take more than 250k tokens per minute. If we do batching, we have to construct the array to remain below that. But in one by one, as one embedding request cannot exceed model max token length, we are safe. Even if the total of those exceed 250k/m, the retry policies would handle that.

So I'd say reducing the time needed to process embeddings and being able to handle large text processing is more important (at least for my case). But maybe we can have processing policies to handle both.

adding exp backoff retry decorator to openai embedding and completion…

4fb5b15

… calls

cfortuner mentioned this pull request Feb 14, 2023

Rate limit #8

Open

adding export

94400bf

vercel bot had a problem deploying to Preview February 14, 2023 18:37 Failure

send all inputs to oai in embed many instead of 1 at a time

7be33ed

vercel bot had a problem deploying to Preview February 14, 2023 19:26 Failure

mathisobadia reviewed Feb 15, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding exp backoff retry decorator to openai embedding and completion calls #12

adding exp backoff retry decorator to openai embedding and completion calls #12

cfortuner commented Feb 14, 2023

vercel bot commented Feb 14, 2023 •

edited

Loading

mathisobadia left a comment

mathisobadia Feb 15, 2023

mathisobadia Feb 15, 2023

mathisobadia left a comment

mathisobadia Feb 15, 2023

yourbuddyconner commented Feb 15, 2023

cfortuner commented Feb 16, 2023

yourbuddyconner commented Feb 16, 2023

ymansurozer commented Feb 16, 2023

adding exp backoff retry decorator to openai embedding and completion calls #12

Are you sure you want to change the base?

adding exp backoff retry decorator to openai embedding and completion calls #12

Conversation

cfortuner commented Feb 14, 2023

vercel bot commented Feb 14, 2023 • edited Loading

mathisobadia left a comment

Choose a reason for hiding this comment

mathisobadia Feb 15, 2023

Choose a reason for hiding this comment

mathisobadia Feb 15, 2023

Choose a reason for hiding this comment

mathisobadia left a comment

Choose a reason for hiding this comment

mathisobadia Feb 15, 2023

Choose a reason for hiding this comment

yourbuddyconner commented Feb 15, 2023

cfortuner commented Feb 16, 2023

yourbuddyconner commented Feb 16, 2023

ymansurozer commented Feb 16, 2023

vercel bot commented Feb 14, 2023 •

edited

Loading