-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CPU spike fault not injecting #82
Comments
How much CPU spike you see after requesting 100%? How you are measuring the spike? |
HI @jv-frechstest can you please provide more details about it? |
Since there are no update closing the ticket, @jv-frechstest please re-open if required. |
Hi. Our team has adopted the mangle product in our organization. Its running as a docker image on the Kubernetes cluster. We have having similar issues with CPU_SPIKE & MEMORY_SPIKE. When we inject the 100% chaos on wither CPU or MEMORY spike, we only see about 60% injection. |
Hi @Anvesh42 , |
@rpraveen-vmware Thanks for the response. So let me sum up the challenges/issues our team is facing with mangle before getting into the execution details and further scenarios that were run.
How do we address the injection of CPU fault at the application level in this scenario? Our approach with all the faults has been to use the labels instead of a specific container so all the POD's matching that label are in play, instead of a specific container.
I would like to hear your suggestions on this? The spike is injected at the infra level with the infrastructure faults and at the specific JVM process with the application fault. It appears to me that application level CPU & MEMORY faults is more specific to a particular JVM process which is good. I mean, if there are, lets say, 5 different services running on a given node within the cluster, running infrastructure fault will impact all of them.
Your response is appreciated. Thanks! |
|
@rpraveen-vmware Thanks for your inputs Praveen. I am working running these scenarios based on the above pointers. Meanwhile, we are looking to upgrade mangle from 3.0 to 3.5. I have been told that 3.5 version has the Log4J issue remediation (an issue that happened very recently, few weeks ago) I did find the image but I do not see info on the changelog i.e. what has been changed from 3.0 to 3.5. I am not sure if Log4J remediation changes have been added in this version. Mind throwing some light? Or is there some other place where I could find the changelog? Thanks |
Hi @Anvesh42 , Mangle 3.5 has following changes:
Thanks, |
@ashrimalivmware @rpraveen-vmware Thanks Avinash. I did run the tests based on @rpraveen-vmware suggestion to use the
It would be nice to connect so we can work together and address/improve these issues. P.S. We use Microsoft Teams in our environment. So we can connect there depending on your availability. Appreciate your response! Anvesh |
@Anvesh42 Yeah it would be better if we can connect, MS teams is fine with us. Please feel free to schedule a call, preferable timings would be post 8:30 AM IST and before 9:30 PM IST. |
@ashrimalivmware @rpraveen-vmware Please find the attached OpenShift DC objects for sample namespaces, DEV03 & DEV70, that we used during the working session to test the CPU_FAULT spike scenarios. DEV03 image properties:- RHEL:7.7-openjdk:1.8.0.232 DEV70 image properties:- RHEL:7.9-openjdk:1.8.0.292 We tested the following scenarios by modifying the resources section in each DC (Deployment Config) object. NOTE:
Command used the measure the CPU spike in the microservice container:- Observations made:-
Please note that whatever fixes or enhancements required for CPU fault, if any, may most likely apply to MEMORY fault as well. |
@rpraveen-vmware @ashrimalivmware Has there been any update on this? I hope your team was able to replicate the scenarios that we went over during our meeting few weeks ago and also as depicted in detail above. |
Hi @Anvesh42 , @ashrimalivmware Limits:
|
Describe the bug
Mangle is deployed on OpenShift container. Trying to inject CPU spike fault to clusterK8 endpoint service.
We are not able to spike the CPU to 90%. The CPU spikes very little compared to target we want.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
CPU spiked to 100% on target instance.
Screenshots
If applicable, add screenshots to help explain your problem.
Logs
If applicable, add application logs/troubleshooting bundle to help in root cause analysis.
Configuration information:
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: