Git Product home page Git Product logo

devflowinc / qdrant-svd-vs-lantern-vs-pgvector Goto Github PK

View Code? Open in Web Editor NEW

This project forked from devflowinc/clickhouse-vs-qdrant-comparison

5.0 1.0 1.0 5.14 MB

A comparison of Qdrant (SVD) to lanterndb (usearch+postgres) and pgvector. TLDR lantern and pgvector are viable replacements, but need tuning against qdrant to be accurate.

Home Page: https://github.com/arguflow/arguflow

License: GNU General Public License v3.0

Jupyter Notebook 100.00%

qdrant-svd-vs-lantern-vs-pgvector's Introduction

Arguflow

Offering a product suite for putting arbitrary models into production semantic search and retrieval-augmented LLM-chat experiences on your company's data

DocumentationCompetitive Debate Search DemoCompetitive Debate Chat DemoDiscord

Pgvector and lanterndb/usearch are now nearly as fast and accurate as a SVD (Qdrant)

The objective behind this notebook was to assess the feasibility of substituting our system's SVD, Qdrant, with pgvector or lanterndb (roughly postgresql + usearch). Employing an OLTP solution like these postgresql focused ones would offer the advantage of utilizing a transactional database with schema and transaction support for both objects and vectors, thereby eliminating the need for external database joins during diverse search operations.

Star us on Github at github.com/arguflow/arguflow!!!!

star-please

Findings

Both pgvector and lanterndb are nearly as fast relative to Qdrant and can be equally accurate after tuning. This means that you should first place your vectors in both Qdrant and pgvector or lanterndb then tweak your HNSW index params, m and ef_construction, such that the postgres solution is just as accurate as Qdrant. Following that, move forward with postgres alone.

If you are not already using postgres and do not have requirements for an ACID compliant solution, then we would still recommend Qdrant. It has a lot of convience features, supports quantization, and does not require tuning to be accurate.

Speed Comparisons

pgsolutions vs qdrant

Accuracy comparisons

pgvector accuracy

lanterndb accuracy

Replicating our results

  1. Download the dataset via this link. This is roughly the DebateSum dataset, but with some improved parsing loggic and dedup detection as noted on our docs.
  2. Place the dataset into the same directory as this notebook
  3. docker compose up -d
  4. cat .env.dist > .env
  5. Run all to duplicate our findings

qdrant-svd-vs-lantern-vs-pgvector's People

Contributors

skeptrunedev avatar cdxker avatar vid277 avatar

Stargazers

Lulzx avatar Ozgun Erdogan avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

devflowinc

qdrant-svd-vs-lantern-vs-pgvector's Issues

feature: test pgvector v0.5.0 against Qdrant

Context: These benchmarks will not use granularity so you should be able to drastically simplify the tables which get created

Steps:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.