Comments (2)
Errors when trying to export the model to ONMX:
-
When using the original code to convert NVT offset representation of sequence features to sparse features:
"getting an error: RuntimeError: Exporting the operator repeat_interleave to ONNX opset version 9 is not supported." -
Sara/Ronay changed the torch.repeat_interleave b np.repeat and the error changed to:
"RuntimeError: Exporting the operator sparse_coo_tensor to ONNX opset version 9 is not supported."
Next steps:
- [] Julio was about to create an option to generate dense tensors for list columns. Follow-up on that and check if we are able to export the model to ONMX using the latest NVTabular version (which now provides dicts of tensors)
- [] Triton team has suggested us to try newest version of ONMX (current opset is 9)
- [] Try export using TorchScript instead of ONMX. That requires us to change our ensemble on Triton to use TorchScript instead of ONMX
- [] Use Python backend for Triton
P.s. We need the NVT preproc module on Triton to be able to work with multi-hot (list columns) - Issue on NVTabular
from transformers4rec.
I tried the ecom-rees NVT ETL notebook to generate config file and serve the model_nvt
to triton.
- One error is about datetime columns. when we have datetime object columns as input or output, we cannot generate config files, so so we need to convert these columns to timestamp column as integer dtypes.
- Once we get rid of datetime columns, we can generate the config file, Model is loaded to the server. But when I send a client request it gives error. It looks like it complains about lambda and filter ops.
Will create a GH issue on NVTabular and followup on that.
from transformers4rec.
Related Issues (20)
- [BUG] conda env import error cudf HOT 2
- [FEA] Feature to extract attention values from transformer heads
- [BUG] Incorrect scores for evaluation
- [FEA] Multi-task prediction support with Next-Item-Prediction HOT 1
- How to use Transformers4Rec with pandas HOT 2
- [QST] ValueError: For masking a categorical_module is required including an item_id.
- [QST] Projecting inputs of NextItemPredictionTask to'64' As weight tying requires the input dimension '320' to be equal to the item-id embedding dimension '64' HOT 4
- [QST] Cross-entropy and pairwise losses are supported in Next Item Prediction
- [QST] How to print metrics while training?
- RuntimeError: CUDF failure at: /__w/cudf/cudf/cpp/src/io/parquet/reader_impl_helpers.cpp:379: Invalid rowgroup index[BUG] HOT 10
- Génerating predictions HOT 5
- [BUG] Inconsistent inference and evaluation results of the XLNET-CLM even on the training set! HOT 2
- [BUG] CausalLanguageModeling masking error on last item only condition HOT 1
- [QST] Help with creating two tower model with transformers. HOT 1
- [FEA] Post context fusion using T4rec api HOT 1
- [BUG] CausalLanguageModeling do not mask last input item HOT 3
- [QST] Extracting User Representation Vectors from Pre-trained Next Item Prediction Model
- [BUG] AttributeError: 'list' object has no attribute 'output_node'" HOT 3
- Model is not generating accurate recommandations [QST]
- [BUG] RuntimeError: PyTorch execute failure: Expected Tensor but got GenericList
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from transformers4rec.