Search for Well Architected Advice
< All Topics
Print

Minimize data movement across networks

Minimizing data movement is crucial for enhancing operational efficiency and reducing carbon footprints in cloud workloads. By limiting the amount of data transferred across networks, you can optimize resource usage and lower energy consumption.

Best Practices

Optimize Data Storage Access and Networking

  • Utilize shared file systems or object storage solutions to centralize access to common data, reducing the need for data duplication across multiple locations. This minimizes the volume of data movement and the associated energy consumption.
  • Implement caching strategies to reduce accessing remote data, thereby decreasing the frequency of data transfers over the network.
  • Evaluate and choose appropriate storage classes that align with access patterns and performance requirements, such as Amazon S3 for infrequent access or cold storage options.
  • Leverage tools such as AWS DataSync to automate the transfer of data between on-premises and cloud storage, optimizing network traffic and resource use.
  • Regularly audit your data access patterns and delete or archive unused data to ensure that only necessary data is still being accessed and moved.

Questions to ask your team

  • Have you identified areas within your data management where data movement can be minimized?
  • Are you currently using shared file systems or object storage solutions to centralize access to common data?
  • What strategies do you have in place to reduce unnecessary data transfers between regions or services?
  • How do you monitor and measure the network resources utilized for data movement in your workload?
  • Are you regularly reviewing and optimizing your network configurations to align with sustainability goals?
  • Do you have lifecycle policies to transition data to more efficient storage solutions based on usage patterns?

Who should be doing this?

Data Architect

  • Design and implement data management policies that align with sustainability goals.
  • Evaluate and select storage technologies that minimize environmental impact.
  • Monitor data usage patterns and recommend efficient storage configurations.

Cloud Engineer

  • Implement shared file systems or object storage solutions to reduce data movement.
  • Ensure proper configurations are in place to optimize network resource usage.
  • Collaborate with the Data Architect to integrate sustainability practices into infrastructure deployment.

Data Analyst

  • Analyze data usage to identify opportunities for lifecycle management and resource optimization.
  • Provide insights on data retention policies to minimize unnecessary storage.
  • Report on the sustainability impact of data management practices to stakeholders.

Compliance Officer

  • Ensure data management practices comply with sustainability regulations and standards.
  • Review policies for data lifecycle management to prevent retention of unnecessary data.
  • Conduct audits to assess the effectiveness of data management towards sustainability goals.

What evidence shows this is happening in your organization?

  • Network Usage Reduction Strategy: A structured approach detailing best practices and guidelines to minimize data movement across networks. This strategy outlines design patterns for using shared file systems or object storage instead of duplicating data, helping to optimize network resources and reduce environmental impact.
  • Data Transfer Checklist: A detailed checklist for administrators to evaluate and reduce unnecessary data migration activities. It covers identifying required data, consolidating data access patterns, and using common storage services, ensuring minimal network utilization.
  • Shared Storage Playbook: A practical guide that provides configuration steps and implementation details for setting up shared file systems or object storage solutions to centralize data access. This playbook advises on versioning, replication policies, and how to avoid redundant data transfers.

Cloud Services

AWS

  • Amazon S3: Amazon S3 provides scalable object storage that can help reduce data movement by enabling easy access to shared data while optimizing costs according to access frequency.
  • AWS DataSync: AWS DataSync automates data transfer between on-premises storage and AWS, helping to minimize data movement across the network efficiently.
  • Amazon FSx: Amazon FSx provides fully managed shared file systems, which decrease the need for repeated data movement by allowing direct access to common data.

Azure

  • Azure Blob Storage: Azure Blob Storage offers massively scalable object storage solutions, which help minimize data movement by providing easy access to shared data for applications.
  • Azure File Sync: Azure File Sync centralizes your file shares in Azure Files and enables flexible access, thus minimizing data movement across networks.

Google Cloud Platform

  • Google Cloud Storage: Google Cloud Storage provides unified object storage that supports data accessibility and uses less network bandwidth, promoting sustainability.
  • Filestore: Filestore is a managed file storage service that offers shared file systems for applications, reducing the need for data duplication and movement.
Table of Contents