"Scaling Made Simple: The Complete Guide to AWS Auto Scaling"

1-What is auto-scaling in AWS?

AWS Auto Scaling is a cloud computing service provided by Amazon Web Services (AWS) that automatically adjusts the number of computing resources (such as EC2 instances) allocated to an application or workload based on demand. It works by monitoring predefined metrics, such as CPU utilization or network traffic, and automatically increasing or decreasing the number of resources based on the current demand.

For example, if a sudden increase in user traffic causes a spike in demand for an application, AWS Auto Scaling can automatically provision additional EC2 instances to handle the increased load. Conversely, if demand decreases, AWS Auto Scaling can reduce the number of EC2 instances allocated to the application to save costs.

AWS Auto Scaling helps organizations optimize their resource utilization and maintain high availability and performance, without requiring manual intervention. In simple terms, it enables cloud resources to automatically scale up or down to match the needs of an application or workload, making it easier to manage and cost-efficient.

2-Create an Auto-Scaling Group in AWS:

Log in to the AWS Management Console and open the Amazon EC2 console.

In the navigation pane, choose "Auto Scaling Groups".

On the Auto Scaling Groups page, choose "Create Auto Scaling Group".

Select the launch template or launch configuration that you want to use for the auto-scaling group. A launch template is a newer option and can include additional options compared to a launch configuration

You can also specify the VPC and subnets for the instances.

In Advance Option- Load Balancing -Select "Attach to a new load balancer"

Next, Select "Application load balancer" and load balance scheme as "Internal"

Next, select your choosing subnet options. And as we are creating from the template the port is already defined.

Configure the Auto Scaling group details, such as the name, the minimum and maximum number of instances, and the desired capacity.

Configure the scaling policies that will be used to automatically scale the number of instances based on demand. You can choose from several scaling policies, such as target tracking or simple scaling. Here we choose - "Average CPU utilization - up to 50%"

Optionally, you can configure notifications to alert you when instances are launched or terminated.

Can add SNS (Simple Notification Service) but it's optional.

Select any "Alarm or Alert" you have created in Cloudwatch.

Next, Review

If All Good - Select - "Create Auto Scaling Group"

Now, choose an EC2 instance that will serve as the basis for your auto-scaling group. You can select an existing instance or create a new one.

Select - EC2 Instance - Action - Instance Settings - Attach to the auto-scaling group.

Select an auto-scaling group and Click "Attach".

That's it! With these simple steps, you can create an auto-scaling group in AWS and ensure that your applications are running efficiently and cost-effectively.

Thank you for taking the time to read my blog!

"Scaling Made Simple: The Complete Guide to AWS Auto Scaling"

Table of contents

1-What is auto-scaling in AWS?

2-Create an Auto-Scaling Group in AWS: