-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deploying Cass-operator and CassandraDataCenter on Power architecture #561
Comments
There should be some sort of indication from Kubernetes what caused the termination of a pod. We have no special handling for killing a pod (especially with index 0). The only reason we would kill a pod is if the Cassandra container itself becomes stuck (as in, loses readiness). Other than that, it would require rolling restart / decommission to start deleting pods. But in those cases, the order would be different. I recommend checking the logs of containers to identify if there's a reason why Cassandra is failing, or if Kubernetes has reasons to delete the pod otherwise (like rescheduling). |
Thank you for the pointer! The Cassandra logs have no error specifying why the container is failing, attaching the logs below for your reference. |
The cassandra container's log could tell if the /drain endpoint was called (it's a shutdown hook for the pod) or if the shutdown came from other source. If it's the shutdown hook, then something should indicate why it was shutdown. Did cass-operator logs have any indications? If it did kill the pod, it should log why. |
I checked the cass-operator logs and found this error for the 0th index pod:
|
There is the reason:
The management-api could not be contacted for some reason (perhaps the cassandra-container logs would tell something, that includes management-api logs, server-system-logger container is the Cassandra itself). |
Hi Team , We are working on deploying cass-operator(v1.14.0) and CassandraDataCenter on Power architecture. We are able to deploy the operator successfully, while deploying the CassandraDataCenter the pod with 0th index terminates repeatedly while the pods with 1st and 2nd index are running fine.
We have installed the operator using Helm chart.
Any pointers regarding this will be helpful, Thank you.
┆Issue is synchronized with this Jira Story by Unito
┆Issue Number: CASS-18
The text was updated successfully, but these errors were encountered: