/site/images/avatar.png

Cloud Nuke

Managing resources and objects in your cloud account is an essential aspect of maintaining an efficient and cost-effective infrastructure. Occasionally, you may need to delete specific types of resources and objects to streamline your operations, optimize costs, or meet compliance requirements.

Container App startup trigger in k8s

Application start delay in k8s containers Stress/Load testing applicable. When you want to sping up a lot of pods; the application inside ech pod must start simultaneously with others. The main problem is cluster autoscaling is slow, it means additional nodes will start with delay. AWS specific case The idea here: prescale more nodes than required, so cluster autoscaling will not spend time to spin up new nodes Assume we have EKS with unmanaged Node Groups as ASG.

Handling Network Retries

In a distributed environment, network retries are an essential mechanism to handle temporary failures in communication between different components. When using Docker containers, network retries can help to ensure the reliability and availability of containerized applications.