Git Product home page Git Product logo

Comments (7)

mayooear avatar mayooear commented on May 15, 2024 14

Embeddings cost $0.0004 / 1K tokens so they are very cost-effective. (1 token is approx 3/4 words)
Using openai's (tokenizer)[https://platform.openai.com/tokenizer?view=bpe], you can see how tokens are calculated.
For example, if you're embedding 50-page PDF that's approx 25,000 words. Which is approx. 33,000 tokens ~ $ 0.001

For context, it costs approx 0.48 USD per 1.2 million tokens embeddings.

With respect to pinecone pricing, the free tier is very generous, but for paid/production level pricing is here

As for the gpt-4 calls, I will continue to review the intermediate steps and get back on that shortly.

from gpt4-pdf-chatbot-langchain.

mayooear avatar mayooear commented on May 15, 2024 4

Hello,

Could you give us an idea of the total costs for the 56-Page documents given 1 query:

  • creating the embedding (a one time step)
  • storing the embeddings in Pinecone
  • matching a query of 250 tokens vs. the embedding: costs of ADA, and costs of the query to Pinecone
  • the first query to gpt4: chat history + the query
  • the second query to gpt4: standalone question + relevant documents

It seems like a lot of queries, it would be very helpful to have an idea about these costs.

Btw, thank you for this tutorial !

Let me look into this and get back to you shortly.

from gpt4-pdf-chatbot-langchain.

alfredo-f avatar alfredo-f commented on May 15, 2024 2

This is a fantastic idea!

Maybe adding a small counter of dollars spent in the front-end can save you from a heart attack when the credit card bill rolls in

from gpt4-pdf-chatbot-langchain.

sebastienfi avatar sebastienfi commented on May 15, 2024 2

About Pinecone pricing, it would be possible to switch to pgvector for a self-hosting.

from gpt4-pdf-chatbot-langchain.

kimjongbing avatar kimjongbing commented on May 15, 2024

Also curious about this! Not sure how much money I'd burn through if I used this.

from gpt4-pdf-chatbot-langchain.

databill86 avatar databill86 commented on May 15, 2024

About Pinecone pricing, it would be possible to switch to pgvector for a self-hosting.

I think there are a good number of vector database alternatives referenced by OpenAI in the chatgot retrieval plugin repository. They didn't mention pgvector, but I wonder if it's possible to plug weaviate or redis in here.

from gpt4-pdf-chatbot-langchain.

dosubot avatar dosubot commented on May 15, 2024

Hi, @databill86! I'm Dosu, and I'm here to help the gpt4-pdf-chatbot-langchain team manage their backlog. I wanted to let you know that we are marking this issue as stale.

From what I understand, you are requesting information on the total costs associated with processing a 56-page PDF document with one query. There have been discussions about the cost-effectiveness of embeddings and the pricing of Pinecone, as well as a suggestion to switch to pgvector for self-hosting. However, the issue remains unresolved.

Before we close this issue, we wanted to check with you if it is still relevant to the latest version of the gpt4-pdf-chatbot-langchain repository. If it is, please let us know by commenting on the issue. Otherwise, feel free to close the issue yourself, or it will be automatically closed in 7 days.

Thank you for your understanding and contribution to the project!

from gpt4-pdf-chatbot-langchain.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.