Git Product home page Git Product logo

Comments (5)

stoader avatar stoader commented on June 11, 2024

Can you attach logs? Normally that should not exit the entire spark job. It should only exit the thread sending the metrics to puahgateway.

from spark-metrics.

dahiyahimanshu avatar dahiyahimanshu commented on June 11, 2024

@stoader : Yes applciation is not exiting. Only exception in the logs every 1 minute. Not sure what will be the impact on the application performance in this degraded mode.
Driver logs

2019-07-22 22:18:21,969 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:19:20,776 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:19:20,783 WARN org.apache.spark.sql.execution.streaming.ProcessingTimeExecutor: Current batch is falling behind. The trigger interval is 5000 milliseconds, but spent 7310 milliseconds
2019-07-22 22:20:20,826 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]

Executor logs :


2019-07-22 22:17:26,571 WARN org.apache.hadoop.util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
2019-07-22 22:18:41,954 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:19:41,680 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:20:41,689 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:21:41,685 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:22:41,687 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:23:41,681 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:24:41,681 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]
2019-07-22 22:25:41,684 ERROR com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp: Sending metrics failed due to: 
java.net.UnknownHostException: eaa-platform-pushgateway222
	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184) ~[na:1.8.0_102]
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[na:1.8.0_102]
	at java.net.Socket.connect(Socket.java:589) ~[na:1.8.0_102]
	at sun.net.NetworkClient.doConnect(NetworkClient.java:175) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:432) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.openServer(HttpClient.java:527) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.<init>(HttpClient.java:211) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:308) ~[na:1.8.0_102]
	at sun.net.www.http.HttpClient.New(HttpClient.java:326) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1169) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1105) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:999) ~[na:1.8.0_102]
	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:933) ~[na:1.8.0_102]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.doRequest(PushGatewayWithTimestamp.java:232) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp.pushAdd(PushGatewayWithTimestamp.java:168) [spark-metrics.jar:2.3-2.1.0]
	at com.banzaicloud.spark.metrics.sink.PrometheusSink$Reporter.report(PrometheusSink.scala:126) [spark-metrics.jar:2.3-2.1.0]
	at com.codahale.metrics.ScheduledReporter.report(ScheduledReporter.java:162) [metrics-core.jar:3.1.2]
	at com.codahale.metrics.ScheduledReporter$1.run(ScheduledReporter.java:117) [metrics-core.jar:3.1.2]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_102]
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [na:1.8.0_102]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [na:1.8.0_102]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_102]
	at java.lang.Thread.run(Thread.java:745) [na:1.8.0_102]

from spark-metrics.

stoader avatar stoader commented on June 11, 2024

This should not impact the driver as publishing metrics is async. The only impact I can see is filling the logs.

What's your suggestion regarding how to handle the case when Pushgateway is down or unreachable?

from spark-metrics.

dahiyahimanshu avatar dahiyahimanshu commented on June 11, 2024

some alternatives :

  • Possibly not to log such exceptions in application logs. (But i think that would be too dirty not to know any implication)

<logger name="com.banzaicloud.metrics.prometheus.client.exporter.PushGatewayWithTimestamp" level="OFF"/>

  • May be not to show these exceptions in stderr section of driver/executor from spark ui but do capture them in driver/executor logs.

  • May be implement a circuit breaker which can decrease the frequency of logs i.e. https://martinfowler.com/bliki/CircuitBreaker.html

  • or KISS (Keep it simple & stupid)

Need your opinion.

from spark-metrics.

stoader avatar stoader commented on June 11, 2024

Agree, not logging exceptions has its implications as you won't be aware that there is an issue on the system that needs to be looked at.

Since spark uses log4j you should be able to filter out these errors log4j or direct into a different log file through log4j configuration such that these logs don't flood spark logs. Have you tried that?

from spark-metrics.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.