Search for Well Architected Advice
< All Topics
Print

Collect compute-related metrics

Understanding and optimizing your compute resources is crucial to achieving better performance efficiency in your workload. By collecting and analyzing compute-related metrics, organizations can make informed decisions about their compute choices and adapt them to meet the specific demands of their applications.

Best Practices

  • Monitor Resource Utilization: Regularly track metrics such as CPU usage, memory consumption, and disk I/O. Implement monitoring solutions to visualize performance trends over time, enabling proactive adjustments to resource allocations for optimizing performance.
  • Set Up Alerts for Anomalies: Configure alerts for unusual spikes or drops in performance metrics. This can help quickly identify performance bottlenecks or inefficiencies allowing for faster remediation.
  • Perform Regular Reviews: Conduct periodic assessments of your compute metrics to ensure that your resources align with application requirements and usage patterns. This iterative process enables continual optimization of performance.

Supporting Questions

  • Are there any performance bottlenecks indicated by the metrics?
  • How do the resource utilization trends compare to our expected patterns?

Roles and Responsibilities

  • DevOps Engineer: Responsible for implementing and maintaining monitoring tools, as well as analyzing the collected metrics to provide actionable insights.
  • Application Architect: Oversees the architecture’s design and usage patterns, ensuring that compute resources are aligned with application needs and performance objectives.

Artifacts

  • Monitoring Dashboard: An interactive dashboard that visualizes compute metrics, helping teams track performance in real-time and making it easier to diagnose issues.

Cloud Services

AWS

  • Amazon CloudWatch: A monitoring and observability service that provides data and insights for AWS resources, enabling organizations to collect metrics and set alarms for enhanced resource utilization.
  • AWS Cost Explorer: Allows you to view and analyze your AWS costs and usage, helping to correlate compute performance with cost efficiency over time.

Question: How do you select and use compute resources in your workload?
Pillar: Performance Efficiency (Code: PERF)

Table of Contents