Git Product home page Git Product logo

Comments (13)

guggero avatar guggero commented on July 21, 2024

The check you mention will only log a warning if there are not sufficient outbound peers. It will never cause a shutdown. So there must be something wrong with the call to the backend.

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

thanks, then I will investigate more

It happened multiple times since v0.18,
before near never (0-2y)

from lnd.

guggero avatar guggero commented on July 21, 2024

Hmm, weird. The only thing I can imagine (that is new in 0.18) is that the call to GetPeerInfo itself fails for some reason. But I think any errors from that should be logged. What's your log level?
You should see a message that starts with INF: Health check: that contains the actual error.

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

debuglevel=info,WTCL=warn

...
2024-06-15 21:39:40.807 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:07:40.807 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:12:40.808 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:15:10.875 [INF] HLCK: Health check: chain backend, call: 2 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-15 22:17:40.881 [CRT] SRVR: Health check: chain backend failed after 3 calls

from lnd.

guggero avatar guggero commented on July 21, 2024

Okay, so the actual connection timed out. Did you update to bitcoind v0.27 at the same time as you updated to lnd v0.18.0? Maybe something in bitcoind changed in relation to being offline?

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

I updated the bitcoind only after lnd 0.18 and the problems started.
The node itself is running already over 3y (started on 2x rsp3)

I monitoring system resources manually but all seams fine
System has Free Memory and Low CPU

The bitcoind logs do not have any special errors/warning

from lnd.

Roasbeef avatar Roasbeef commented on July 21, 2024

As mentioned above, we won't shutdown based on this message. Instead, it's a warning.

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

I tested now with a manual disconnect of 65sec
outgoing peers decreased to 7 after the reconnect
no special log entries

I will monitor longer and write if I find the issue.

The node itself never manage to stay longer then 120h online after the last update.

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

... It already shutdown yesterday
the command docker logs lnd -f --tail 100 | grep "Health check"
didn't print a nextline+cd

2024-06-26 16:41:41.012 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 16:44:11.022 [INF] HLCK: Health check: chain backend, call: 2 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 18:17:40.994 [INF] HLCK: Health check: chain backend, call: 1 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 18:20:10.996 [INF] HLCK: Health check: chain backend, call: 2 failed with: health check: chain backend timed out after: 30s, backing off for: 2m0s
2024-06-26 18:22:41.014 [CRT] SRVR: Health check: chain backend failed after 3 calls

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

I found the issue, a now chatty process storagenode from storj suddenly burst in CPU usage over minutes.

thx for help

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

I investigated now a bit more, cpu limited the storj process and increased the bitcoind dbcache

What I discovered is that bitcoind uses high amount of cpu on a low block download
at this period the $ time bitcoin-cli -netinfo is very lagy with probs: 1s, 11s, 29s

When the bitcoind does a tip verification the $ time bitcoin-cli -netinfo goes even over 1m with probs: 1m24s, 2m10s

@guggero
Can the getPeers API call be the reason for the health check: chain backend timed out after: 30s message on the lnd?

from lnd.

guggero avatar guggero commented on July 21, 2024

Can the getPeers API call be the reason for the health check: chain backend timed out after: 30s message on the lnd?

That's what I suspect too. But if that call is slow on bitcoind, maybe it's a sign that you shouldn't cram even more things onto a piece of hardware that's barely able to keep up with just Bitcoin stuff? Maybe StorJ is completely hogging the disk as well, which is slowing down block verification.

from lnd.

Zetanova avatar Zetanova commented on July 21, 2024

I found the real issue, its the fragmentation level of the hdd
seq read/write speed on the idle rsp4 are at ~95MB/s

bitcoind made 10MB/s read, but this was already the max in random access.

@guggero I know this is limited hardware, but is in the req. specs.

from lnd.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.