avoid expensive initialization #18

mimoo · 2023-06-20T00:59:33Z

Hello,

I'm using the following:

import { encode, isWithinTokenLimit } from 'gpt-tokenizer/model/text-davinci-003';

which seems to slow down the initialization, enough that I can't deploy to cloudflare workers with this library. Is there a way to lazily initialize things?

The text was updated successfully, but these errors were encountered:

airhorns · 2023-06-27T18:35:15Z

We're experincing this as well -- requiring this package takes ~600ms on my M1 MBP:

❯ time node -r gpt-tokenizer -e "1"

________________________________________________________
Executed in  548.82 millis    fish           external
   usr time  616.81 millis    4.71 millis  612.10 millis
   sys time   99.25 millis    9.21 millis   90.04 millis

Would it be hard to lazily require the encodings only once the first encode call is made?

zakariamehbi · 2023-08-12T16:58:34Z

Same issue on my end.

thdoan · 2023-10-17T09:30:16Z

For Cloudflare Workers I suggest you look at this:
https://github.com/dqbd/tiktoken#cloudflare-workers

luizzappa · 2024-02-26T03:30:41Z

To get around the 400ms startup time limit of Cloudflare Workers, I just import the library within fetch.

export default {
	async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
              const { encode } = await import('gpt-tokenizer');
              // ....
        }
}

Regarding another suggested library by @thdoan, I couldn't get tiktoken or js-tiktoken to work within the limits of Cloudflare Workers.
The js-tiktoken bundles all the encoders, so this makes the bundle larger than the 1mb limit of the Cloudflare Worker (see here).
And tiktoken/lite, which allows you to import only the necessary encoder, which makes it within the size <= 1mb, has a bug that has not yet been fixed.

niieani · 2024-07-18T00:58:38Z

When designing the decision was made to make it possible for the tokenizer loadable synchronously.
The large startup time is likely because of the large file containing the encodings and the base64 parsing that needs to happen after the load.

You could try to experiment with enabling v8's code cache introduced in node 22.1.0. It should start much faster with it enabled. Here's more info about this.

We could also experiment with an alternative way of storing the encodings so that parsing is much simpler/easier on the resources. Would need to profile and see what is causing the bulk of the startup time right now.

Suggestions and PRs welcome, as I'm constrained on time right now.

fixes #18 (or at least attempts to do so)

fixes #18 chore: preliminary benchmarking tool feat: update tokenization format to make the package smaller docs: update README

github-actions · 2024-09-20T02:37:42Z

🎉 This issue has been resolved in version 2.3.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

niieani · 2024-09-20T05:42:51Z

New version should be much faster to load and lighter (data for GPT-3.5-turbo tokenizer):

Package	Version	Load time (ms)	Memory (Heap) consumption (MB)	Total Memory (RSS) (MB)	Encode Avg (ms)	Decode Avg (ms)
gpt-tokenizer	2.2.3	253.04	45.83	150.42	0.0208	0.0033
gpt-tokenizer	2.3.0	44.58	9.65	35.76	0.0102	0.0025

luizzappa · 2024-09-22T18:13:02Z

amazing work!

niieani added the help wanted Extra attention is needed label Jul 18, 2024

niieani added a commit that referenced this issue Sep 18, 2024

feat: improve initialization time

076aafb

fixes #18 (or at least attempts to do so)

niieani added a commit that referenced this issue Sep 20, 2024

feat: improve performance & initialization time

70eaa50

fixes #18 chore: preliminary benchmarking tool feat: update tokenization format to make the package smaller docs: update README

niieani added a commit that referenced this issue Sep 20, 2024

feat: improve performance & initialization time

77d88da

fixes #18 chore: preliminary benchmarking tool feat: update tokenization format to make the package smaller docs: update README

niieani closed this as completed in e2c560a Sep 20, 2024

github-actions bot added the released label Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid expensive initialization #18

avoid expensive initialization #18

mimoo commented Jun 20, 2023

airhorns commented Jun 27, 2023

zakariamehbi commented Aug 12, 2023

thdoan commented Oct 17, 2023

luizzappa commented Feb 26, 2024 •

edited

Loading

niieani commented Jul 18, 2024

github-actions bot commented Sep 20, 2024

niieani commented Sep 20, 2024 •

edited

Loading

luizzappa commented Sep 22, 2024

avoid expensive initialization #18

avoid expensive initialization #18

Comments

mimoo commented Jun 20, 2023

airhorns commented Jun 27, 2023

zakariamehbi commented Aug 12, 2023

thdoan commented Oct 17, 2023

luizzappa commented Feb 26, 2024 • edited Loading

niieani commented Jul 18, 2024

github-actions bot commented Sep 20, 2024

niieani commented Sep 20, 2024 • edited Loading

luizzappa commented Sep 22, 2024

luizzappa commented Feb 26, 2024 •

edited

Loading

niieani commented Sep 20, 2024 •

edited

Loading