Comments (13)
The check you mention will only log a warning if there are not sufficient outbound peers. It will never cause a shutdown. So there must be something wrong with the call to the backend.
from lnd.
thanks, then I will investigate more
It happened multiple times since v0.18,
before near never (0-2y)
from lnd.
Hmm, weird. The only thing I can imagine (that is new in 0.18) is that the call to GetPeerInfo
itself fails for some reason. But I think any errors from that should be logged. What's your log level?
You should see a message that starts with INF: Health check:
that contains the actual error.
from lnd.
debuglevel=info,WTCL=warn
...
2024-06-15 21:39:40.807 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:07:40.807 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:12:40.808 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:15:10.875 [INF] HLCK: Health check: chain backend, call: 2 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:17:40.881 [CRT] SRVR: Health check: chain backend failed after 3 calls
from lnd.
Okay, so the actual connection timed out. Did you update to bitcoind v0.27
at the same time as you updated to lnd v0.18.0
? Maybe something in bitcoind changed in relation to being offline?
from lnd.
I updated the bitcoind
only after lnd 0.18
and the problems started.
The node itself is running already over 3y (started on 2x rsp3)
I monitoring system resources manually but all seams fine
System has Free Memory and Low CPU
The bitcoind logs do not have any special errors/warning
from lnd.
As mentioned above, we won't shutdown based on this message. Instead, it's a warning.
from lnd.
I tested now with a manual disconnect of 65sec
outgoing peers decreased to 7 after the reconnect
no special log entries
I will monitor longer and write if I find the issue.
The node itself never manage to stay longer then 120h online after the last update.
from lnd.
... It already shutdown yesterday
the command docker logs lnd -f --tail 100 | grep "Health check"
didn't print a nextline+cd
2024-06-26 16:41:41.012 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 16:44:11.022 [INF] HLCK: Health check: chain backend, call: 2 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 18:17:40.994 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 18:20:10.996 [INF] HLCK: Health check: chain backend, call: 2 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 18:22:41.014 [CRT] SRVR: Health check: chain backend failed after 3 calls
from lnd.
I found the issue, a now chatty process storagenode from storj suddenly burst in CPU usage over minutes.
thx for help
from lnd.
I investigated now a bit more, cpu limited the storj process and increased the bitcoind dbcache
What I discovered is that bitcoind uses high amount of cpu on a low block download
at this period the $ time bitcoin-cli -netinfo
is very lagy with probs: 1s, 11s, 29s
When the bitcoind does a tip verification the $ time bitcoin-cli -netinfo
goes even over 1m with probs: 1m24s, 2m10s
@guggero
Can the getPeers API call be the reason for the health check: chain backend timed out after: 30s
message on the lnd?
from lnd.
Can the getPeers API call be the reason for the health check: chain backend timed out after: 30s message on the lnd?
That's what I suspect too. But if that call is slow on bitcoind
, maybe it's a sign that you shouldn't cram even more things onto a piece of hardware that's barely able to keep up with just Bitcoin stuff? Maybe StorJ is completely hogging the disk as well, which is slowing down block verification.
from lnd.
I found the real issue, its the fragmentation level of the hdd
seq read/write speed on the idle rsp4 are at ~95MB/s
bitcoind made 10MB/s read, but this was already the max in random access.
@guggero I know this is limited hardware, but is in the req. specs.
from lnd.
Related Issues (20)
- gossip: node announcement ignored because node not in graph HOT 3
- [bug]: can't build on debian HOT 9
- [meta]: Consider increasing column limit HOT 1
- [bug]: LND v0.18.0 - a recovery is not possible anymore HOT 13
- [bug]: payment_failure_reason > 0 when payment status = SUCCEEDED
- [bug]: GossipSyncer should remove unreliable Peers from syncing the ChannelGraph HOT 3
- failed to load config: ValidateConfig: unable to load RPC credentials for bcd: unable to extract RPC credentials HOT 12
- [bug]: BatchOpenChannel RPC will always use 6 confirmation target for the fee estimation HOT 3
- [bug]: lncli create is not honoring its documentation HOT 1
- [bug]: describegraph; node's last_update should include node's edges' last_update
- discovery - [feature]: implement bandwidth based, daemon wide gossip rate limiting. HOT 1
- [bug]: channel automatically force closed with low fees and impossible to bumpfees; reimporting fails with "could not derive private key for legacy channel revocation root format: unable to derive private key" HOT 14
- chanbackup/[feature]: add archive mode for channel back ups
- [feature]: HOT 1
- [bug]: IPV6 generally does not work - connections to other nodes (in and out) HOT 1
- `routing`: `lnd` cannot shutdown due to `MissionControl`
- [bug]: lnd not resigning from leader role when disconnected from cluster HOT 4
- [feature] - routing: add additional validation to MPP related send payment params
- Define `isLocalCommitTx` and refactor contractcourt to use it
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lnd.