Auto truncate for vertex embedding is broken by TokenCountBatchingStrategy #1831

GregoireW · 2024-11-26T16:23:06Z

Bug description

I try to use vertex embedding and did test big document. I did set the auto-truncate to true.
This correspond to this options:

spring-ai/models/spring-ai-vertex-ai-embedding/src/main/java/org/springframework/ai/vertexai/embedding/text/VertexAiTextEmbeddingOptions.java

Line 67 in be0f9fb

private @JsonProperty("autoTruncate") Boolean autoTruncate;

But I got an exception from TokenCountBatchingStrategy (

spring-ai/spring-ai-core/src/main/java/org/springframework/ai/embedding/TokenCountBatchingStrategy.java

Line 147 in be0f9fb

if (tokenCount > this.maxInputTokenCount) {

)

What to do in this situation ?

Environment

Spring AI 1.0.0-M4 / jdk21

Steps to reproduce

Use vertex embedding with the "auto truncate" option, and test with a large payload.

Expected behavior

Success or at least some way in the documentation to make it works.

Minimal Complete Reproducible example

var document = new Document("go ".repeat(50000));
vectorStore.add(List.of(document));

sobychacko · 2024-11-27T17:22:54Z

@GregoireW, Which vector store are you using? To fix this issue, you need to provide a custom BatchingStrategy ben in your application. The default TokenCountBatchingStrategy implementation uses the default context-window size set by openai - 8191. You need to adjust the max token size when using different embedding models. Here is an example of overriding this bean:

@Bean
@ConditionalOnMissingBean(BatchingStrategy.class)
    BatchingStrategy chromaBatchingStrategy() {
      return new TokenCountBatchingStrategy(EncodingType.CL100K_BASE, maxInputTokenCount, 0.1);
}

See the javadoc on ToeknCountBatchingStrategy for more details: https://github.com/spring-projects/spring-ai/blob/main/spring-ai-core/src/main/java/org/springframework/ai/embedding/TokenCountBatchingStrategy.java

GregoireW · 2024-11-27T19:17:50Z

I use pgVector storage.

If the way is to create a BatchingStragery, I guess the documentation on the auto-truncate feature should be explicit.

Or it is still a work in progress to make it work with embedding models in which case the the case would be corrected later.

I let you close this issue or set a "todo something" if you want.

sobychacko added this to the 1.0.0-M5 milestone Nov 27, 2024

sobychacko self-assigned this Nov 27, 2024

markpollack added bug Something isn't working vertex labels Dec 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto truncate for vertex embedding is broken by TokenCountBatchingStrategy #1831

Auto truncate for vertex embedding is broken by TokenCountBatchingStrategy #1831

GregoireW commented Nov 26, 2024

sobychacko commented Nov 27, 2024

GregoireW commented Nov 27, 2024

Auto truncate for vertex embedding is broken by TokenCountBatchingStrategy #1831

Auto truncate for vertex embedding is broken by TokenCountBatchingStrategy #1831

Comments

GregoireW commented Nov 26, 2024

sobychacko commented Nov 27, 2024

GregoireW commented Nov 27, 2024