Search for Well Architected Advice
< All Topics
Print

Use runbooks for standard activities such as deployment

Controlled changes are essential for consistent deployment of new functionality and maintenance of workloads. By utilizing runbooks, teams can ensure that changes are executed in a predictable manner, minimizing risks associated with uncontrolled modifications.

Best Practices

  • Document Standard Operating Procedures: Create comprehensive runbooks that outline step-by-step processes for common tasks. This practice minimizes errors and accelerates deployments by providing clear guidance to the team, ensuring all changes are manageable and traceable.
  • Automate Runbook Execution: Where possible, automate runbooks to eliminate human error. This can be achieved through scripts or integrations with CI/CD tools, allowing for faster, more reliable deployments that comply with organizational standards.
  • Regularly Review and Update Runbooks: Runbooks should evolve with your environment and workflows. Regular reviews help ensure that the runbooks remain relevant, reflecting any changes in technology or business processes, thus maintaining their effectiveness.

Supporting Questions

  • Are all standard activities documented to ensure predictability?
  • How frequently are the runbooks reviewed and updated?
  • Is there a clear ownership for maintaining runbooks?

Roles and Responsibilities

  • DevOps Engineer: Responsible for creating, updating, and maintaining runbooks to automate standard activities and ensure operational consistency.
  • Quality Assurance Team: Ensures that runbooks are followed correctly during deployments and that they meet quality standards by performing regular audits.
  • Release Manager: Oversees the deployment of changes, ensuring that runbooks are executed correctly while managing the overall deployment process.

Artifacts

  • Runbook Template: A standardized format for documenting runbooks, ensuring consistency and clarity in the steps needed for various operations.
  • Change Log: A record of all changes made through runbooks, enabling tracking of modifications and their effects over time.

Cloud Services

AWS

  • AWS Systems Manager: Facilitates the creation and automation of runbooks to manage operational tasks across AWS environments, enhancing reliability and efficiency.
  • AWS CodeDeploy: Automates application deployments, integrating with runbooks to maintain a high degree of reliability during changes.
  • AWS Lambda: Enables the execution of scripts within runbooks without the need for server management, supporting seamless and consistent changes.

Question: How do you implement change?
Pillar: Reliability (Code: REL)

Table of Contents