Comments (10)
I misspoke earlier. Turns out that we actually added support for vLLM through which you can run several local LLMs. Just haven't documented it yet.
Will post a link here once we update the docs.
from typesense.
I misspoke earlier. Turns out that we actually added support for vLLM through which you can run several local LLMs. Just haven't documented it yet.
Will post a link here once we update the docs.
Thats huge. If you manage to make this easily accessible it could be quite the hype.
Looking forward to try it!
from typesense.
It takes only care of the R part of RAG but yes, custom models and using GPU are supported.
And you also have the option to generate the embeddings yourself and store them.
Check out all the subtopics of this part of the documentation:
https://typesense.org/docs/26.0/api/vector-search.html#index-embeddings
from typesense.
@piccaso Typesense does support the "AG" part of RAG, by integrating with ChatGPT / Cloduflare APIs: https://typesense.org/docs/26.0/api/conversational-search-rag.html
@elliot-sawyer We don't yet have a way to integrate with local LLMs. But I'll leave this open as a feature request.
from typesense.
May I know which local LLMs you're looking for?
from typesense.
I don't have a particular one in mind yet - would any of the Typesense models on HuggingFace be appropriate? I'll have an NVIDIA A100 available in a couple of months to do some Typesense work with, but only on the stipulation that I use a locally downloaded LLM (no network or API keys).
from typesense.
@jasonbosco It's nice to know that this is being done. I already started testing this myself, without documentation (heh) and ran into an authorization problem. Now api_key
is not used by vLLM at all, could you add this? In case when LLM inference on different server it's is important to have authentication.
from typesense.
CC: @ozanarmagan
from typesense.
I misspoke earlier. Turns out that we actually added support for vLLM through which you can run several local LLMs. Just haven't documented it yet.
Will post a link here once we update the docs.
It would be great if we could use any OpenAI compatible APIs like Ollama LLama.cpp VLLM and etc, i think that would save everybody a lot of headache, we just need a way to set the openai api url in the creation of the model.
from typesense.
You can now customize the openai api base url: https://typesense.org/docs/26.0/api/vector-search.html#using-openai-compatible-apis
from typesense.
Related Issues (20)
- Parent of nested object array is the array of object itself and not the parent object HOT 1
- add a sortable field
- Requesting support for Scandinavian letters (ä, ö, æ, ø, å) HOT 2
- Empty matched_tokens when synonyms matched
- USD symbol works in symbols_to_index but EUR / GBP symbols do not
- Malformed JSON returned for bad infix search request HOT 4
- Feature Request: Fallback options for remote embedding services HOT 2
- `stem: true` prevents `filter_by` from returning results HOT 3
- Querying facets only cuts off value string HOT 3
- BALANCED parentheses filter query will trigger `"Could not parse the filter query.` HOT 2
- The desired functionality should support MultiPolygon to determine if a point falls within a specific area, enabling the retrieval of the code associated with that area.
- "Bad JSON" for POST collections request HOT 1
- Typesense increase num_documents after restart HOT 1
- Integration for Haystack.
- exclude search keywords in override rules HOT 13
- [Feature Request] Set query_by and num_typos in overrides
- Feauture request: Support for multiple references to the same collection in a particular collection
- Unable to filter for existent/non-existent attributes HOT 1
- Updating docker stack crashes cluster HOT 4
- Conversation Search using existing embedding HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from typesense.