supporting local model for jupyter-ai with ollama #868

767472021 · 2024-07-02T09:44:12Z

Add Ollama, now supporting the following local models: "gemma", "gemma2", "llama2", "llama3", "phi3", "mistral", "tinyllama", and "qwen2"!

Magic commands can also be used 【learn、 ask ....】.

test completed and work well

…l","tinyllama","qwen2"!

# Conflicts: # docs/source/users/index.md # packages/jupyter-ai-magics/jupyter_ai_magics/embedding_providers.py

for more information, see https://pre-commit.ci

srdas · 2024-07-02T14:05:27Z

Cross-referencing PR #646. Related to suggestions in Issue #389.

pedrogutobjj · 2024-07-02T14:14:34Z

how do i use this update?

767472021 · 2024-07-03T03:01:16Z

how do i use this update?

@pedrogutobjj

Move to the root of the repo package
cd $your-jupyter-ai-top
Then merge this pr
Installs all the dependencies and sets up the dev environment
./scripts/install.sh
Run ollama serve local and pull test model
Test

Ref: https://jupyter-ai.readthedocs.io/en/latest/contributors/index.html

srdas

Thanks @767472021 for this PR. I left some comments herein. Please refer to PR #646 as well. Nice work! I have also tested the code on a few Ollama models.

srdas · 2024-07-03T16:43:49Z

docs/source/users/index.md

Thank you very much for adding this PR, which also complements PR #646.

[1] It would be nice to add more documentation on Ollama, directing the user to install it first and how, pointing to the necessary Ollama resources. Specific examples of Ollama CLI commands needed would help the user. See the note by @pedrogutobjj above for example. Hopefully, people who use Ollama with its CLI will know what to do, but the docs should reflect the requirement that the Ollama local server should be kicked off with ollama serve.

[2] Since the number of models in Ollama is very large, the drop down list can become unwieldy. Would it be possible to implement Ollama in the same way as Huggingface is implemented with a input box for the model in chat settings?

It would also need a similar approach for choice of Embedding Model.

srdas · 2024-07-03T21:38:51Z

packages/jupyter-ai-magics/jupyter_ai_magics/providers.py

@@ -728,6 +730,84 @@ async def _acall(self, *args, **kwargs) -> Coroutine[Any, Any, str]:
        return await self._call_in_executor(*args, **kwargs)


+class OllamaProvider(BaseProvider, Ollama):


Please see a similar implementation in PR #646 with a much simpler class and see if all the functions defined in your file from line 760 onwards are necessary, or being called to produce the response.

srdas · 2024-07-03T22:20:21Z

packages/jupyter-ai-magics/jupyter_ai_magics/providers.py

+    def _send_request(self, endpoint: str, data: dict) -> dict:
+        """Send a POST request to the specified Ollama API endpoint."""
+        url = f"{self.base_url}/{endpoint}"
+        print("url is : ", url)


Do you need this print statement? If so, use self.log.info() as is the approach in much of the code base.

srdas · 2024-07-03T22:39:08Z

packages/jupyter-ai-magics/jupyter_ai_magics/providers.py

@@ -18,6 +18,7 @@
    Union,
 )

+import requests


If the functions in your new class can be removed, then you will not need this line.

3coins · 2024-07-04T01:04:26Z

packages/jupyter-ai-magics/jupyter_ai_magics/providers.py

+    def _generate(self, prompt: str, model: str = "mistral", stop: list = None) -> str:
+        """Generate text using the /generate endpoint."""
+        data = {
+            "model": model,
+            "prompt": prompt,
+            **({"stop": stop} if stop else {}),
+        }
+        response = self._send_request("api/generate", data)
+        return response.get("response", "")
+
+    def _chat(self, messages: list, model: str = "mistral") -> str:
+        """Chat using the /chat endpoint."""
+        data = {
+            "model": model,
+            "messages": messages,
+        }
+        response = self._send_request("api/chat", data)
+        return response.get("response", "")
+
+    def _call(self, prompt: str = None, messages: list = None, **kwargs) -> str:
+        """Determine which API endpoint to use based on input and call it."""
+        print(self.base_url)
+        if prompt is not None:
+            return self._generate(prompt=prompt, **kwargs)
+        elif messages is not None:
+            return self._chat(messages=messages, **kwargs)
+        else:
+            raise ValueError("Either 'prompt' or 'messages' must be provided.")


Is there a need for implementing the Ollama class methods? Ollama chat model provides these methods, forking the code might deviate from the official LangChain LLMs.
https://python.langchain.com/v0.1/docs/integrations/chat/ollama/

Agree, I noted the same above, and I have also tested removing these functions that the functionality is not impaired.

pedrogutobjj · 2024-07-04T16:18:44Z

How long will it take for the final implementation of ollama in jupyter there and to release the update to everyone?

srdas · 2024-07-10T22:13:04Z

Thanks very much for revisiting this important enhancement that was raised in PR #646 -- which has now been implemented and merged, closing this one out now, and very grateful for your contribution.

xiayu and others added 3 commits July 2, 2024 15:41

add Ollama, support "gemma","gemma2","llama2","llama3","phi3","mistra…

4b2755c

…l","tinyllama","qwen2"!

Merge remote-tracking branch 'origin/main'

597b325

# Conflicts: # docs/source/users/index.md # packages/jupyter-ai-magics/jupyter_ai_magics/embedding_providers.py

[pre-commit.ci] auto fixes from pre-commit.com hooks

566c686

for more information, see https://pre-commit.ci

srdas mentioned this pull request Jul 2, 2024

Add Ollama #646

Merged

5 tasks

srdas requested changes Jul 3, 2024

View reviewed changes

srdas added the enhancement New feature or request label Jul 3, 2024

3coins reviewed Jul 4, 2024

View reviewed changes

srdas closed this Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supporting local model for jupyter-ai with ollama #868

supporting local model for jupyter-ai with ollama #868

767472021 commented Jul 2, 2024

srdas commented Jul 2, 2024 •

edited

Loading

pedrogutobjj commented Jul 2, 2024

767472021 commented Jul 3, 2024 •

edited

Loading

srdas left a comment

srdas Jul 3, 2024

srdas Jul 3, 2024

srdas Jul 3, 2024

srdas Jul 3, 2024

3coins Jul 4, 2024

srdas Jul 4, 2024

pedrogutobjj commented Jul 4, 2024

srdas commented Jul 10, 2024

		@@ -728,6 +730,84 @@ async def _acall(self, args, *kwargs) -> Coroutine[Any, Any, str]:
		return await self._call_in_executor(args, *kwargs)


		class OllamaProvider(BaseProvider, Ollama):

supporting local model for jupyter-ai with ollama #868

supporting local model for jupyter-ai with ollama #868

Conversation

767472021 commented Jul 2, 2024

srdas commented Jul 2, 2024 • edited Loading

pedrogutobjj commented Jul 2, 2024

767472021 commented Jul 3, 2024 • edited Loading

srdas left a comment

Choose a reason for hiding this comment

srdas Jul 3, 2024

Choose a reason for hiding this comment

srdas Jul 3, 2024

Choose a reason for hiding this comment

srdas Jul 3, 2024

Choose a reason for hiding this comment

srdas Jul 3, 2024

Choose a reason for hiding this comment

3coins Jul 4, 2024

Choose a reason for hiding this comment

srdas Jul 4, 2024

Choose a reason for hiding this comment

pedrogutobjj commented Jul 4, 2024

srdas commented Jul 10, 2024

srdas commented Jul 2, 2024 •

edited

Loading

767472021 commented Jul 3, 2024 •

edited

Loading