Comments (2)
642f9a1 slashes partition count estimates during repair by 10%, based on the assumption that in average, around 10% of the data is moved by repair. This number is pulled from thin air, and while I expect in most cases it will be fine, in certain cases it will be a gross under-estimation and can lead to bloom filters violating FP chance and generating more IO.
What's the interaction between estimation and RBNO, which moves all of the data?
After e.g. bootstrap, are the repair-written sstables used as they are (and in this case βΒ won't 642f9a1 make their filters very ineffective?), or are they guaranteed to go through some kind of compaction (reshape?) first, which will fix their filters?
from scylladb.
642f9a1 slashes partition count estimates during repair by 10%, based on the assumption that in average, around 10% of the data is moved by repair. This number is pulled from thin air, and while I expect in most cases it will be fine, in certain cases it will be a gross under-estimation and can lead to bloom filters violating FP chance and generating more IO.
What's the interaction between estimation and RBNO, which moves all of the data?
Most of the RBNO operations will move all of the data for a given range. Node ops like rebuild might move only few data in case the node already has some of the data.
After e.g. bootstrap, are the repair-written sstables used as they are (and in this case β won't 642f9a1 make their filters very ineffective?), or are they guaranteed to go through some kind of compaction (reshape?) first, which will fix their filters?
The sstables generated by RBNO will go through off strategy compaction which will integrate them to main dataset. This will fix the filters.
Also the new node is supposed to have a lower "heat" so that the heat_load_balance will route less traffic to it.
from scylladb.
Related Issues (20)
- ancient upgrade document in repo HOT 2
- Repair error: invalid partition-end, partition has an active range tombstone HOT 4
- Abnormal data volume in the commitlog directory HOT 1
- chunked_vector::reserve_partial does not work as documented if size > max_chunk_capacity()
- View update generation times out during index building HOT 1
- [x86_64, debug] auth_cluster/test_auth_v2_migration failed with InvalidRequest HOT 3
- [dtest-release, 6.0] alternator_tests.TesterAlternator.test_alternator_enforce_authorization failed with `StopIteration` HOT 11
- Include static service levels in service levels listing HOT 1
- docs: Issue on page Create a ScyllaDB Cluster - Multi Data Centers (DC)
- docs: Issue on page Replace a Dead Node in a ScyllaDB Cluster
- cql: fix LIMIT clause (epic)
- CDC with tablets: design
- Performance isolation of hints receiving: design
- Incorrect output of describe in full index on frozen collections HOT 1
- document "keyspace" and "table" arguments for "nodetool ring"
- FileNotFoundError: [Errno 2] No such file or directory: 'thrift' HOT 1
- Auth: optimize load on roles table during permission cache population
- RESTful API /storage_service/quiesce_topology may not wait for tablets to be rebalanced shortly after topology change
- tasks: abort method hits use after free
- [aarch64, debug] topology_experimental_raft/test_alternator failed with AssertionError
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scylladb.