Git Product home page Git Product logo

Comments (8)

shenlei1020 avatar shenlei1020 commented on July 17, 2024

感谢您的反馈~

  1. 首先bce的两个模型都是bert base规模,比其他large规模的模型,效率高大约3倍;

  2. 关于这两个模型高效推理,我们最近release出了一个版本(推理框架onnxruntime-gpu):

  1. 如果在集成上述高效推理方案有任务问题,欢迎在此issue提出!

from bcembedding.

deku0818 avatar deku0818 commented on July 17, 2024

感谢您的反馈~

  1. 首先bce的两个模型都是bert base规模,比其他large规模的模型,效率高大约3倍;
  2. 关于这两个模型高效推理,我们最近release出了一个版本(推理框架onnxruntime-gpu):
  1. 如果在集成上述高效推理方案有任务问题,欢迎在此issue提出!

感谢!这部分代码已经跑起来了,我发现推理的结果是不一样的,是因为模型转换丢失精度吗?

from bcembedding.

deku0818 avatar deku0818 commented on July 17, 2024

sentences = ["This is a test sentence.", "Another sentence for embedding."]
image
image
我进行了简单的测试,发现onnx 推理比SentenceTransformer还慢是为什么?

from bcembedding.

shenlei1020 avatar shenlei1020 commented on July 17, 2024

1、你的onnx模型是自己转的,还是直接下载我们开源的onnx模型?
2、如果是下载我们在qanything开源的onnx模型,是没问题的。注意是qanything最新放出来的embedding和reranker的onnx模型。
3、onnx和torch的结果稍许差异是正常的,看一下cos相似度是不是0.99,是的话就没问题。
4、推理慢的原因可能是:a、你的onnxruntime是cpu版本的,解决方案是先卸载onnxruntime,再pip install onnxruntime-gpu;b、确定onnxruntime-gpu的包是对的,用我们的推理代码,应该不会有问题的,注意看一下CUDAExecutionProvider是否成功。

from bcembedding.

deku0818 avatar deku0818 commented on July 17, 2024

1、你的onnx模型是自己转的,还是直接下载我们开源的onnx模型? 2、如果是下载我们在qanything开源的onnx模型,是没问题的。注意是qanything最新放出来的embedding和reranker的onnx模型。 3、onnx和torch的结果稍许差异是正常的,看一下cos相似度是不是0.99,是的话就没问题。 4、推理慢的原因可能是:a、你的onnxruntime是cpu版本的,解决方案是先卸载onnxruntime,再pip install onnxruntime-gpu;b、确定onnxruntime-gpu的包是对的,用我们的推理代码,应该不会有问题的,注意看一下CUDAExecutionProvider是否成功。

感谢指导,已经定位到原因了,应该是是cuda libcublasLt.so.11的问题,导致只使用了CPU

from bcembedding.

deku0818 avatar deku0818 commented on July 17, 2024

image
image
快了3倍

from bcembedding.

deku0818 avatar deku0818 commented on July 17, 2024

余弦相似度: 0.9999987920724557

from bcembedding.

deku0818 avatar deku0818 commented on July 17, 2024

image image 快了3倍

后面简单跑了下,平均快了将近20倍

from bcembedding.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.