LlamaIndex Instrumentation #24

Kartik1397 · 2023-12-31T12:57:34Z

Fixes: #22

/claim #22

Testing:

llamaindex.mp4

I had skip embedding part in this video because my daily quota of embedding api got exhausted. I'll update the testing video once I receive a new quota.

nirga · 2024-01-08T11:17:34Z

packages/instrumentation-llamaindex/src/instrumentation.ts

nit: this file is long 😅
Can you split into multiple files based on the part of llamaindex you instrument?

I've moved llm instrumentation in to separate class and genericWrapper into utils. I tried to refactor LlamaIndexInstrumentation similar to how it's done in Python. However, the _wrap method is a part of the InstrumentationBase class.

packages/instrumentation-llamaindex/src/instrumentation.ts

packages/traceloop-sdk/package.json

packages/instrumentation-llamaindex/src/custom-llm-instrumentation.ts

galkleinman · 2024-01-09T08:07:53Z

packages/instrumentation-llamaindex/src/custom-llm-instrumentation.ts

+              if (shouldSendPrompts(plugin.config)) {
+                span.setAttribute(
+                  `${SpanAttributes.LLM_COMPLETIONS}.0.role`,
+                  result.message.role,
+                );
+                span.setAttribute(
+                  `${SpanAttributes.LLM_COMPLETIONS}.0.content`,
+                  result.message.content,
+                );


2 points here:

Are you sure role exist here?

There can't be more than one completion?

Are you sure role exist here?

Yes, it's a non optional field of ChatMessage Interface.

interface ChatMessage { content: any; role: MessageType; }

There can't be more than one completion?

Just checked return type of chat, it could also return AsyncGenerator. I'll update chat and completion wrapper to handle streaming response.

@galkleinman I've pushed changes to add support for capturing streaming response.

galkleinman · 2024-01-09T08:22:27Z

packages/instrumentation-llamaindex/src/instrumentation.ts

+
+    for (const key in moduleExports) {
+      const cls = (moduleExports as any)[key];
+      if (this.isLLM(cls.prototype)) {


This one will result in duplicate spans.

Explanation:
We instrument OpenAI (for example) independently of LlamaIndex. It means that for a call to OpenAI, since LlamaIndex uses OpenAI sdk theirselves, you'll have duplicate spans, One reported from the LlamaIndex Instrumentation and the other one from OpenAI instrumentation.

By the naming you gave of CustomLLMInstrumentation I guess you saw my python impl.
How I solved this kind of issue there?

In python LllamaIndex have a class called CustomLLM which is intended to be a class where the non trivial/commercial LLMs (openai, cohere, anthropic, etc.) inherits from - example for such one is Ollama.
So I instrumented only those inherit CustomLLM, than the commercial ones which inherit LLM don't get instrumented by LlamaIndex and therefore will not produce span here, but rather produce span through their designated instrumentation.

Unfortunately, I checked, and it seems like there isn't such CustomLLM base class in LlamaIndexTS (validate me).

Maybe we should have some exclusion list of LLMs which have their own instrumentation.

But here only calls instrumented in llamaindex are getting captured.

Code I used for testing

import * as traceloop from "@traceloop/node-server-sdk"; import { OpenAI } from "llamaindex"; traceloop.initialize({ appName: "sample_llamaindex", apiKey: process.env.TRACELOOP_API_KEY, disableBatch: true, }); class SampleLlamaIndex { @traceloop.workflow("sample_query") async query() { const openai = new OpenAI(); const res = await openai.complete('How are you?'); return res; } } traceloop.withAssociationProperties( { user_id: "12345", chat_id: "789" }, async () => { const sampleLlamaIndex = new SampleLlamaIndex(); const result = await sampleLlamaIndex.query(); console.log(result); }, );

nirga · 2024-01-11T16:25:00Z

packages/instrumentation-llamaindex/src/custom-llm-instrumentation.ts

+          `${lodash.snakeCase(className)}.completion`,
+        );
+
+        span.setAttribute(SpanAttributes.LLM_VENDOR, "llamaindex");


Is there a way to figure out the actual llm vendor here? Cause I'd say that if we're using OpenAI for example than writing OpenAI here is more correct

I can replace it with class name of LLM. That's I think the closest thing to vendor. We've mapped LLM_VENDOR to class name in python instrumentation.

Kartik1397 changed the title ~~[WIP] LlamaIndex Instrumentation~~ LlamaIndex Instrumentation Jan 1, 2024

algora-pbc bot mentioned this pull request Jan 1, 2024

🚀 Feature: LlamaIndex Instrumentation #22

Closed

1 task

algora-pbc bot added the 🙋 Bounty claim label Jan 1, 2024

Kartik1397 marked this pull request as ready for review January 1, 2024 14:17

nirga self-requested a review January 1, 2024 14:38

Kartik1397 force-pushed the instrumentation-llamaindex branch 2 times, most recently from 021a28f to 821e8c9 Compare January 5, 2024 21:10

nirga reviewed Jan 8, 2024

View reviewed changes

Kartik1397 requested a review from nirga January 8, 2024 19:53

galkleinman reviewed Jan 9, 2024

View reviewed changes

nirga force-pushed the main branch from a32c6fa to 250b7d1 Compare January 11, 2024 16:20

nirga force-pushed the instrumentation-llamaindex branch from 349936e to 40a5609 Compare January 11, 2024 16:21

nirga reviewed Jan 11, 2024

View reviewed changes

Kartik1397 and others added 17 commits January 12, 2024 00:12

Add skeleton

439c23c

Instrument LLM

4194b7a

Add more instrumentation

ee834a1

Complete instrumentation

be2be79

Remove unneeded import

b1f3045

Testing setup

ce3d6c7

Add working test

b03401d

Add more test cases

bc4a185

Update sample app

f0bed3d

Use SimpleVectorStore

3d3affb

Fix lint errors

ac4e2b6

Fix formatting

b622acc

Refactor code and fix issues

d4dfef9

Fix incorrect span name

d352b7c

Add assert statements response components

566f646

chore: regenerated package-lock

41ddee8

Fix linting errors

982307d

Resolve conflicts

b8fb4c5

Kartik1397 force-pushed the instrumentation-llamaindex branch from 0c04f42 to b8fb4c5 Compare January 11, 2024 18:50

Capture streaming response

e62a8a0

nirga approved these changes Jan 12, 2024

View reviewed changes

chore: update README

c8627a2

nirga merged commit 7fea505 into traceloop:main Jan 12, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LlamaIndex Instrumentation #24

LlamaIndex Instrumentation #24

Kartik1397 commented Dec 31, 2023 •

edited

Loading

nirga Jan 8, 2024

Kartik1397 Jan 8, 2024

galkleinman Jan 9, 2024

Kartik1397 Jan 9, 2024

Kartik1397 Jan 11, 2024

galkleinman Jan 9, 2024

Kartik1397 Jan 9, 2024

nirga Jan 11, 2024

Kartik1397 Jan 11, 2024

LlamaIndex Instrumentation #24

LlamaIndex Instrumentation #24

Conversation

Kartik1397 commented Dec 31, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Kartik1397 commented Dec 31, 2023 •

edited

Loading