Comments (2)
@HoustonPutman
The same issue is happening to me,
The HPA mentioned it would scale to 14 pods, but they kept 58 running.
For me the leader election was successful, but I could see a lot of down replicas which is causing query issues and getting the error like shards are down
from solr-operator.
Ok, so y'alls issues seem somewhat related.
I have seen problems with Solr failing to delete bad replicas during an unsuccessful migration. And that's the reason why you are seeing a large increase in the number of replicas.
So i suspect something wrong with the scale down/up / migration of the shards. Every pod gets restarted during the downgrade......
This is definitely a problem, and related to the fact that you are addressing your solr nodes through the ingress. In order for all Solr traffic to not be directed through the ingress (which would slow things down considerably), we use basically /etc/hosts on the pods to map each ingress address to the IP of the pod it maps to. And since you are scaling down, it is removing some of the /etc/hosts entries, thus requiring full restarts every time.
An easy solution to this would be to only update the /etc/hosts if an IP is changed or added. It doesn't really matter if we have unused entries there.
Anyways, we should definitely have an integration test that stresses the HPA with ingresses, because this seems like a very iffy edge case.
The same issue is happening to me
@sabaribose I think this is separate, because you are not using an ingress, but using the headless service.
I think your is from the BalanceReplicas command not queueing for a retry when it fails. But I will do more investigation here.
from solr-operator.
Related Issues (20)
- prometheus-exporter started throwing different errors after upgrading solr operator to 0.7.0 HOT 5
- Uninstalling solr helm chart without uninstalling Zookeeper HOT 3
- AppVersion vs Image.tag HOT 1
- Ability to set custom hostname for SolrCloud HOT 2
- SolrCloud Pod moved to new Node - Replica Migration pending HOT 8
- Run solr-operator and solr helm chart on openshift get error "would violate PodSecurity "restricted:v1.24"" HOT 6
- Issue with basic auth HOT 2
- Operator never deletes ingress or per-node services
- Improve documentation for additional volumes HOT 1
- Resources limits and requests configuration not set on SolrCloud pod HOT 1
- Add the ability to add Environment variables as a configmap HOT 1
- Not create the StatefulSets when add the custom security.json in helm HOT 4
- Missing permission for "/admin/info/system" endpoint in security.json template in the SolrCloud CRD documentation
- Authentication not woking with solr-cloud. Pods are getting restarted. HOT 4
- User helm chart 0.8.0 with default values thorw the error in ValidationError(SolrCloud.spec): unknown field "scaling" in org.apache.solr.v1beta1.SolrCloud.spec HOT 1
- gen-pkcs12-keystore init container fails if the tls secret contains no ca.crt HOT 1
- Support running the solr operator on ARM nodes HOT 4
- Solr Backup recurrence/schedule not enabled by helm 0.7.1 HOT 1
- Actual running pod counts are different from the HPA-allocated HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from solr-operator.