Comments (4)
@lgo-solytic , Please provide the write options you are using and the cluster name. Please provide the approx time of these runs as well
from azure-kusto-spark.
Please provide the write options you are using and the cluster name. Please provide the approx time of these runs as well
Options:
KustoSinkOptions.KUSTO_TABLE_CREATE_OPTIONS -> "CreateIfNotExist",
KustoSinkOptions.KUSTO_STAGING_RESOURCE_AUTO_CLEANUP_TIMEOUT -> "5",
KustoSinkOptions.KUSTO_WRITE_ENABLE_ASYNC -> "true",
KustoSinkOptions.KUSTO_WRITE_MODE -> "Queued"
Nothing additional in SparkIngestionProperties
Both jobs were running (streaming) for longer than 48h uninterrupted.
from azure-kusto-spark.
@lgo-solytic : BlobAlreadyReceived_BlobAlreadyFoundInBatch is a Kusto warning from : https://learn.microsoft.com/en-us/azure/data-explorer/error-codes#category-blobalreadyreceived
(it correlates to a similar case of 2 blobs getting ingested as well)
Is there any other Connector logs you see from this failure ? The logs start with KustoConnector for the Spark connector. If you find anything in that correlates, that'd be useful
There is no correlation inference we can draw between the drop in volumes related to this error. You need not use the option
KustoSinkOptions.KUSTO_WRITE_ENABLE_ASYNC -> "true", , exceptions in tasks are not propagated to driver if this is used.
from azure-kusto-spark.
@ag-ramachandran
thanks for your answer.
No unfortunately no logs that could help. Is it possible to configure the KustoConnector to log to application insights?
The same thing happened again last night. But this time only one job was running so at least we can exclude the case of two jobs colliding.
from azure-kusto-spark.
Related Issues (20)
- no option to pass in the appId/appKey with the API call for authentication in Synapse HOT 2
- Support for Scala 2.13 HOT 2
- Write to Kusto in Synapse with option "sparkIngestionPropertiesJson" always failed in spark 3.3 HOT 2
- Cannot write to ADX from Azure Databricks using Kusto connector for pyspark "com.microsoft.kusto.spark.datasource" HOT 6
- ThrottleExceptions when writing data to ADX/Kusto HOT 12
- Stuck at connecting to Kusto HOT 1
- Ingestion fails for tables with "-" in the names HOT 1
- KUSTO_MANAGED_IDENTITY_AUTH is not a member of com.microsoft.kusto.spark.common.KustoOptions and com.microsoft.kusto.spark.datasink.KustoSinkOptions HOT 7
- Importing the spark connector enables verbose logging HOT 6
- com.microsoft.azure.kusto.data.auth.CloudDependentTokenProviderBase.initializeWithCloudInfo throws Null Pointer Exception HOT 2
- Overwrite data option not working HOT 1
- Spark write to Synapse error: java.lang.NoClassDefFoundError: com/twitter/util/TimeoutException HOT 17
- Unable to Authenticate Using Managed Identity HOT 3
- ExtendedKustoClient: Some extents were not processed and we got an empty move result'1' Please open issue if you see this trace. At: https://github.com/Azure/azure-kusto-spark/issues HOT 1
- DeviceAuthentication does not exist in the JVM on Databricks Runtime 14.3 LTS HOT 2
- Dependency issues after update to maven kusto-spark_3.0_2.12:5.0.7 HOT 2
- Kusto library not working when we enabled private end-point on Databricks. Issue is <>.blob.core.windows.net Name or service not known HOT 7
- Writing data to an ADX with private endpoints HOT 5
- UnknownHostException for created storage account HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from azure-kusto-spark.