Search for Well Architected Advice
< All Topics
Print

Use automation to proactively remediate performance-related issues

Automation plays a critical role in maintaining the performance efficiency of cloud workloads. By utilizing automated systems to identify and rectify performance bottlenecks, organizations can optimize resource utilization and ensure that workloads remain responsive to user demands.

Best Practices

  • Implement Continuous Monitoring: Employ monitoring tools to continuously assess performance metrics, ensuring you can react before performance issues impact users. Use tools like Amazon CloudWatch to collect and analyze performance data effectively.
  • Establish Key Performance Indicators (KPIs): Define clear KPIs for your application to measure performance and set thresholds that trigger automated remediation actions, leveraging AWS Lambda or similar services to respond to alerts.
  • Utilize Auto Scaling: Employ auto-scaling strategies to adjust compute resources in response to demand changes. This ensures resources are optimized and performance is maintained without over-provisioning.

Supporting Questions

  • What KPIs have you established to monitor workload performance?
  • How often do you review the effectiveness of your automated remediation processes?
  • Are you utilizing tools that provide real-time performance insights?

Roles and Responsibilities

  • DevOps Engineer: Responsible for the implementation of monitoring, alerting, and automation tools that enforce performance efficiency across cloud workloads.
  • Cloud Architect: Designs systems that integrate performance monitoring and remediation strategies to optimize workload efficiency.

Artifacts

  • Performance Monitoring Dashboards: Custom dashboards that visualize key performance metrics and trends in real-time, enabling immediate identification of performance issues.
  • Automated Remediation Scripts: Scripts that automatically adjust resources or configurations in response to predetermined performance thresholds.

Cloud Services

AWS

  • Amazon CloudWatch: A monitoring service providing data and insights on resource utilization, application performance, and operational health, enabling automation in response to performance metrics.
  • AWS Lambda: Allows you to run code in response to triggers such as CloudWatch alarms, enabling automatic remediation actions without server management.
  • Amazon EC2 Auto Scaling: Automatically adjusts the number of EC2 instances in your application based on demand, ensuring optimal performance while minimizing cost.

Question: What process do you use to support more performance efficiency for your workload?
Pillar: Performance Efficiency (Code: PERF)

Table of Contents