Comments (8)
It looks like snappy compression is using way more cpu for some reason in 0.33
Edit: Its because snappy compression for gRPC was added in 0.28, you could try 0.33 and use
--receive.grpc-compression=none
and see if it helps? Another hint is that your bandwidth with 0.26 seems to be orders of magnitude higher, also pointing at snappy compression.
See:
--receive.grpc-compression=snappy
Compression algorithm to use for gRPC requests
to other receivers. Must be one of: snappy,
none
from thanos.
Confirming that the use of the --receive.grpc-compression=none
field in version 0.33.0
was effective, and now the resource utilization is similar to that in version 0.26.0
. Thanks @MichaHoffmann!
from thanos.
@hayk96 I don't think you need it there. The routers will take care of firing all the requests to achieve replication. The ingesters have nothing more to do than read the incoming non-compressed request and write the data.
from thanos.
I have mixed opinions regarding this. CPU is usually cheaper than network bandwidth. Ingest latency is super important too. If to save network bandwidth we need to pay in latency, this is a very sensible decision. Latency with the compression in @hayk96's environment was awful compared to no-compression. And check that memory usage chart...
from thanos.
I uploaded the profiles to pprof.me:
heap profiles:
- v0.33: https://pprof.me/3a91f164ab7ce91989b90bc866ea0ad2
- v0.26: https://pprof.me/be0808d689c3ce00fb2c655e7ab30955
cpu profiles
- v0.33: https://pprof.me/3105bcf0b4bd4d9cbfda27640ba23851
- v0.26: https://pprof.me/74cdccfc18754305f15af03c37c6fc50
from thanos.
BTW should I add that flag in the receive-ingesters
as well, does it make sense?
from thanos.
Maybe we add this in the Troubleshooting; Common cases? What do you think?
from thanos.
Maybe we add this in the Troubleshooting; Common cases? What do you think?
I'm indifferent i think; Its a bit of a tradeoff between network bandwidth and CPU; and usually CPU is cheaper so i think defaulting to snappy here is the correct behavior. But then again you faced the regression during update and would have profited from an FAQ article about it. I dont know whats the best course of action tbh!
from thanos.
Related Issues (20)
- Read value of remote_user in Slow Query Logs of Query Frontend from a HTTP header HOT 3
- Thanos Receive doesn't announce external_labels which are set in hashrings.json when it works in routing and ingesting mode. HOT 1
- Issue with deduplication alogrithm in Thanos HOT 4
- Query Stats Returned with query including query bytes fetched HOT 5
- Max and min pointed at Sidecars not working on 0.35 HOT 15
- `ThanosSidecarBucketOperationsFailed` alert is flaky
- PR Title Validation
- Thanos Receive Pod is crashing with Readiness and livness Probe Failed
- Thanos ruler vs. eventual consistency of metrics
- Can Huawei's OBS storage be supported?
- Thanos React-app : Proxy server for thanos-query
- Query: update of endpoint failed...context deadline exceeded
- Thanos Chart 0.34.0 app version 12.23.1
- Thanos receive fails "no space left on device"
- sidecar: Greatly increased Thanos sidecar memory usage from 0.32.2 to 0.32.3, still exists in 0.35.0 HOT 3
- api/v1/label returns wrong values HOT 2
- Regression in thanos v0.35.1 HOT 2
- Thanos Receiver: Router/Ingestor setup no longer returns `thanos_receive_write_timeseries_*` and `thanos_receive_write_samples_*` metrics with thanos v0.35.1 HOT 2
- Extend Thanos bucket rewrite to support filtered archiving of existing blocks
- Support additional aggregates for downsampling
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from thanos.