Comments (5)
The error message "TensorRT EP failed to create engine from network" indicates something went wrong when TRT EP is calling
TRT's api buildSerializedNetwork()
and since it happens when dealing with large image, i'm suspecting it's due to OOM.
Could you increase the trt_max_workspace_size to see? The default is 1 GB.
Also, quick question, can you repro the issue using trtexec
?
from onnxruntime.
from onnxruntime.
I tried with trt_max_workspace_sizehttps://onnxruntime.ai/docs/execution-providers/TensorRT-ExecutionProvider.html#trt_max_workspace_size> set to 2G, 4G, 8G with the same result getting also this additional warning if it >is set greater than 1G
Hmm that's strange. Could you share the code that set trt_max_workspace_size?
Please see the example code here.
As for trtexec, some models are not fully TRT eligible, it seems that's the case of your model, so trtexec won't be able to run them. How about trtexec with TRT 10?
Could you share the proxy model so that we can repro from our side? Or could you point to public model that can repro the issue.
from onnxruntime.
from onnxruntime.
Related Issues (20)
- Mismatch in results for TensorRT session and cuda Session HOT 4
- [Feature Request] DFT/STFT WebGPU op support (web/js) HOT 2
- [Feature Request] Assess performance capability before a model is loaded HOT 1
- [Feature Request] Error code for terminated Session::run
- [Bug] The accuracy of the A16W16 quantized model is very poor if per_channel is True HOT 3
- [Mobile] React-native OnnxruntimeJSIHelper install segfaults when registering functions HOT 1
- DML incorrect results, probably Split node HOT 3
- Build onnxruntime with SNPE EP on SA8155P HOT 1
- [E:onnxruntime:, sequential_executor.cc:516 onnxruntime::ExecuteKernel] Non-zero status code returned while running LayerNormalization node. HOT 2
- Run all Nodes on GPU/DML with DML-EP HOT 2
- ImportError: A dynamic link library (DLL) initialization routine failed after building source with Visual Studio 17.10 HOT 8
- [Mobile] onnxruntime-objc crash HOT 4
- v1.17.0 + requires macOS deployment target >= than 13.3 due to C++ 20 HOT 10
- Non-zero status code returned while running ConvTranspose node.
- [Documentation] Install page missing instructions for onnxruntime-rocm HOT 2
- onnxruntime shape mismatch during quantization of yolov8 models HOT 7
- CUDA wasnt able to be loaded when tring to use CUDA12.4 and onnxruntime-gpu 11.8 HOT 2
- Flan-T5 small converted model produces wrong result with batch size > 1 and long senetences
- [Training] Support for RKNPU Execution Provider on RK3562 Platform and On-Device Training Capabilities HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from onnxruntime.