Comments (4)
Hey @dpdrmj 👋!
Thank you so much for reporting the issue/feature request 🚨.
Someone from SynapseML Team will be looking to triage this issue soon.
We appreciate your patience.
from synapseml.
Hello everyone! Can someone please help here? Does anyone know what could've caused this?
from synapseml.
Hello everyone! Can someone please help here? Does anyone know what could've caused this?
val (trainingData, validationData) =
if (get(validationIndicatorCol).isDefined && dataset.columns.contains(getValidationIndicatorCol))
(df.filter(x => !x.getBoolean(x.fieldIndex(getValidationIndicatorCol))),
Some(sc.broadcast(preprocessData(df.filter(x =>
x.getBoolean(x.fieldIndex(getValidationIndicatorCol)))).collect())))
if the validationData is large, the "collect" use many memory. you need to set driver.memory and executor.memory very large.
from synapseml.
Hello everyone! Can someone please help here? Does anyone know what could've caused this?
val (trainingData, validationData) = if (get(validationIndicatorCol).isDefined && dataset.columns.contains(getValidationIndicatorCol)) (df.filter(x => !x.getBoolean(x.fieldIndex(getValidationIndicatorCol))), Some(sc.broadcast(preprocessData(df.filter(x => x.getBoolean(x.fieldIndex(getValidationIndicatorCol)))).collect())))
if the validationData is large, the "collect" use many memory. you need to set driver.memory and executor.memory very large.
set driver.memory and executor.memory very large can fix it, but it is slow, consume many resource, hope hynapseML team find a new way to rewrite the code, hope to replace the "collect".
from synapseml.
Related Issues (20)
- How can I save a trained Isolation Forest model in SynapseML? HOT 1
- [BUG] Unable to save a trained Isolation Forest model in SynapseML HOT 1
- Scala 2.13. Compatibility HOT 2
- [LightGBM] When the LightGBM support for spark 3.4 and 3.5 will be available? HOT 1
- [BUG] distributed training of LightGBM breaks when number of workers decreases while training HOT 1
- [BUG] Synapse GPT-4, OpenAIChatCompletion, API documentation: mandatory "name" field not mentioned in documentation for "messages" HOT 1
- [BUG] Running Inference from ONNX model HOT 1
- 12 HOT 1
- [BUG] writeToAzureSearch fails when the index has custom analyzers or tokenizers since 0.11.0 HOT 1
- tt HOT 2
- Why does the knn calculation return the farthest distance HOT 1
- [BUG] could not parse main worker ipv6 host and port correctly HOT 1
- [BUG] LightGBM MLFlow autolog not logging metrics
- [BUG] synapse.ml.cognitive Detect transform error
- [BUG] Databricks 14.3 LTS usage of internal _jvm variable is no longer supported
- [BUG]can't download error :org.apache.commons#commons-math3;3.2!commons-math3.jar
- [BUG] Synapse ML Developer Docs are broken when you go into the submodules
- 1.02 lightgbm glibc error
- How to use text-embedding-3-small with different output dimensions? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from synapseml.