Deion Is it possible to use Conversational Search (RAG) with

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

CC: <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

You can now customize the openai api base url: <a href="https://typesense.org/docs/26.

[Feature Request] Conversational Search (RAG) with a local LLM about typesense HOT 10 OPEN

elliot-sawyer commented on June 29, 2024 1

[Feature Request] Conversational Search (RAG) with a local LLM

from typesense.

Comments (10)

jasonbosco commented on June 29, 2024 3

I misspoke earlier. Turns out that we actually added support for vLLM through which you can run several local LLMs. Just haven't documented it yet.

Will post a link here once we update the docs.

from typesense.

piccaso commented on June 29, 2024 1

I misspoke earlier. Turns out that we actually added support for vLLM through which you can run several local LLMs. Just haven't documented it yet.

Will post a link here once we update the docs.

Thats huge. If you manage to make this easily accessible it could be quite the hype.
Looking forward to try it!

from typesense.

piccaso commented on June 29, 2024

It takes only care of the R part of RAG but yes, custom models and using GPU are supported.
And you also have the option to generate the embeddings yourself and store them.

Check out all the subtopics of this part of the documentation:
https://typesense.org/docs/26.0/api/vector-search.html#index-embeddings

from typesense.

jasonbosco commented on June 29, 2024

@piccaso Typesense does support the "AG" part of RAG, by integrating with ChatGPT / Cloduflare APIs: https://typesense.org/docs/26.0/api/conversational-search-rag.html

@elliot-sawyer We don't yet have a way to integrate with local LLMs. But I'll leave this open as a feature request.

from typesense.

jasonbosco commented on June 29, 2024

May I know which local LLMs you're looking for?

from typesense.

elliot-sawyer commented on June 29, 2024

I don't have a particular one in mind yet - would any of the Typesense models on HuggingFace be appropriate? I'll have an NVIDIA A100 available in a couple of months to do some Typesense work with, but only on the stipulation that I use a locally downloaded LLM (no network or API keys).

from typesense.

Ku3mi41 commented on June 29, 2024

@jasonbosco It's nice to know that this is being done. I already started testing this myself, without documentation (heh) and ran into an authorization problem. Now api_key is not used by vLLM at all, could you add this? In case when LLM inference on different server it's is important to have authentication.

from typesense.

jasonbosco commented on June 29, 2024

CC: @ozanarmagan

from typesense.

0x4139 commented on June 29, 2024

I misspoke earlier. Turns out that we actually added support for vLLM through which you can run several local LLMs. Just haven't documented it yet.

Will post a link here once we update the docs.

It would be great if we could use any OpenAI compatible APIs like Ollama LLama.cpp VLLM and etc, i think that would save everybody a lot of headache, we just need a way to set the openai api url in the creation of the model.

from typesense.

jasonbosco commented on June 29, 2024

You can now customize the openai api base url: https://typesense.org/docs/26.0/api/vector-search.html#using-openai-compatible-apis

from typesense.

[Feature Request] Conversational Search (RAG) with a local LLM about typesense HOT 10 OPEN

Comments (10)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent