Understand the available compute configuration and features

PostedDecember 20, 2024

UpdatedMarch 21, 2025

ByKevin McCaffrey

Understanding the available configuration options and features for your compute service is pivotal to provisioning the right amount of resources. This knowledge ensures that your workload can effectively handle varying loads, ultimately optimizing performance efficiency and user experience.

Best Practices

Evaluate Compute Options Thoroughly

Analyze the compute requirements of your application, considering the workload types (e.g., batch processing, real-time processing).
Investigate various compute services offered by AWS (e.g., EC2, Lambda, Fargate) to match your performance needs.
Utilize AWS Compute Optimizer to receive recommendations for optimal instance types based on historical usage data.

Leverage Auto Scaling

Implement Auto Scaling to dynamically adjust capacity based on traffic patterns, ensuring your application remains performant under varying loads.
Set appropriate scaling policies based on metrics such as CPU utilization, request count, or custom CloudWatch metrics to optimize resource usage.
Regularly review and adjust scaling thresholds and policies to respond to changing workload patterns.

Monitor and Optimize Performance

Use AWS CloudWatch to monitor performance metrics and identify bottlenecks or resource inefficiencies in real-time.
Conduct load testing and performance testing to validate the compute choices and configurations under expected workload conditions.
Consider using AWS X-Ray to analyze and debug distributed applications, helping you understand how different components interact and optimize them accordingly.

Engage in Continuous Review and Iteration

Regularly assess your compute choices as application requirements evolve, particularly after major updates or changes in user traffic patterns.
Involve key stakeholders in periodic reviews to align business needs with technical capabilities, ensuring the compute architecture remains efficient.
Stay informed about new AWS offerings and features that may enhance performance, and evaluate them for potential integration into your architecture.

Questions to ask your team

Have you evaluated the specific compute requirements of your application and workload patterns?
Do you regularly review the performance metrics of your compute resources?
Are you familiar with the different configurations and capacity options for the compute services you are using?
Have you tested the performance of your application with various compute instance types or sizes to determine the most efficient option?
Do you assess the cost-versus-performance trade-offs when selecting compute resources for your workload?
Are you utilizing auto-scaling features to adjust the number of instances based on current demand?
Have you considered serverless options where applicable to optimize resource usage and performance?
Do you have a clear understanding of how instance types and configurations affect your application’s performance characteristics?

Who should be doing this?

Cloud Architect

Evaluate the workload requirements to determine appropriate compute resources.
Analyze different compute services and configurations available in AWS.
Design architectures that optimize performance based on usage patterns.
Collaborate with developers to understand application design impacts on performance.
Monitor resource utilization and performance metrics to identify inefficiencies.
Recommend changes to improve performance efficiency as needed.

DevOps Engineer

Implement infrastructure as code (IaC) to provision the required compute resources.
Continuously monitor performance of the deployed applications and compute resources.
Automate scaling and resource allocation based on demand and performance metrics.
Ensure best practices are followed in configuring compute resources.
Collaborate with the Cloud Architect to refine resource provisioning strategies.

Application Developer

Understand application performance requirements and communicate them effectively.
Optimize application code and design to leverage the selected compute resources.
Work with the architecture team to align application design with compute choices.
Test applications for performance, identifying bottlenecks related to compute resource usage.

What evidence shows this is happening in your organization?

Compute Resources Configuration Guide: A comprehensive guide outlining the various compute resource configurations available in AWS, including EC2 instance types, Auto Scaling options, and serverless architecture choices. This guide helps teams understand how to select the optimal compute resources based on workload requirements.
Performance Efficiency Checklist: A checklist to evaluate different compute resource options and their features for various applications. This tool assists architects in ensuring they have considered all necessary configurations and features to enhance performance efficiency.
Compute Resource Selection Matrix: A matrix that categorizes different compute services based on workload characteristics and usage patterns. This matrix provides visual guidance on selecting the most efficient compute resource for specific application needs.
Performance Optimization Playbook: A playbook detailing strategies for optimizing performance efficiency in workloads by selecting the right compute resources. It includes best practices, configuration tips, and case studies that illustrate effective resource utilization.
AWS Compute Services Dashboard: An interactive dashboard that displays real-time metrics on the performance of various AWS compute services being utilized by the organization. This tool allows teams to monitor efficiency and make data-driven decisions for resource adjustments.

Cloud Services

AWS

Amazon EC2: Provides resizable compute capacity in the cloud, allowing you to select from a variety of instance types and sizes based on your workload requirements.
AWS Lambda: Enables you to run code in response to events without provisioning or managing servers, allowing for efficient scaling based on demand.
Amazon ECS: A fully managed container orchestration service that allows you to deploy and manage containerized applications with the ability to optimize resource utilization.

Azure

Azure Virtual Machines: Provides on-demand scalable computing resources with various VM sizes and configurations to meet different application needs.
Azure Functions: A serverless compute service that automatically scales to meet demand, allowing you to run event-driven code without managing infrastructure.
Azure Kubernetes Service (AKS): Simplifies the deployment, management, and scaling of containerized applications using Kubernetes, optimizing resource efficiency.

Google Cloud Platform

Google Compute Engine: Offers scalable and flexible virtual machine instances that provide various machine types to optimize performance for different workloads.
Google Cloud Functions: A lightweight, serverless compute service that automatically scales in response to incoming requests, optimizing resource allocation.
Google Kubernetes Engine (GKE): A managed environment for deploying containerized applications, offering automated scaling and optimized resource management.

Question: How do you select and use compute resources in your workload?
Pillar: Performance Efficiency (Code: PERF)

Operational Excellence

Determine what your priorities are

Structure your organization to support your business outcomes

Organizational culture to support your business outcomes

Implement observability in your workload

Reduce defects, ease remediation, and improve flow into production

Mitigate deployment risks

Be ready to support a workload

Uilize workload observability

Understand the health of your operations

Manage workload and operations events

Evolve your operations

Security

Securely operate your workload

Manage identities for people and machines

Manage permissions for people and machines

Detect and investigate security events

Protect your network resources

Protect your compute resources

Classify your data

Protect your data at rest

Protect your data in transit

Anticipate, respond to, and recover from incidents

Incorporate and validate the security properties of applications throughout the design, development, and deployment lifecycle

Reliability

Manage service quotas and constraints

Plan your network topology

Design your workload service architecture

Design interactions in a distributed system to prevent failures

Design interactions in a distributed system to mitigate or withstand failures

Monitor workload resources

Design your workload to adapt to changes in demand

Implement change

Back up data

Fault isolation to protect your workload

Design your workload to withstand component failures

Test reliability

Plan for disaster recovery (DR)

Cost Optimization

Implement cloud financial management

Govern usage

Monitor your cost and usage

Decommission resources

Evaluate cost when you select services

Meet cost targets when you select resource type, size and number

Use pricing models to reduce cost

Plan for data transfer charges

Manage demand, and supply resources

Evaluate new services

Evaluate the cost of effort

Performance

Select the appropriate cloud resources and architecture patterns for your workload

Select and use compute resources in your workload

Store, manage, and access data in your workload

Select and configure networking resources in your workload

Support more performance efficiency for your workload

Sustainability

Select Regions for your workload

Align cloud resources to your demand

Take advantage of software and architecture patterns to support your sustainability goals

Take advantage of data management policies and patterns to support your sustainability goals

Select and use cloud hardware and services in your architecture to support your sustainability goals

Implement organizational processes support your sustainability goals