Multi-Region Latency Reduction Strategies

Multi-region Atlas deployments can be used to enhance performance by reducing latency. The following sections list the factors and configuration choices you can make to reduce latency.

Physical Distance

Physical distance is the primary cause of latency. The distance between users and your application, between your application and your data, and between cluster nodes all impact system latency and application performance.

To reduce latency for read operations, it's crucial to place both your application and data geographically closer to users.

Replication Configuration

Replication is the copying of data from the primary node to secondary nodes. How you configure replication contributes to latency. Consider the following factors:

Write Concern Levels: There is a trade-off between write durability and write latency. The write concern level you configure (for example, w: "majority") defines replication across multiple data centers, potentially increasing latency for more durable writes. However, this ensures that all data is committed to secondaries and protects you from rolling back writes in the event of failover.
Order of Regions: The order of regions in your configuration can determine the priority for the primary node location, which can impact write latency depending on where your users are located. For example, if most of your users are in Asia, you should set the highest priority for the Asia region so that it is elected the primary.
Mirrored Reads: Mirrored reads reduce the impact of primary elections following an outage by pre-warming the caches on the secondary nodes. For more information, see Mirrored Reads.
Read Preference: By default, applications send read operations to the primary node. However, you can configure the read preference to send read operations to secondary nodes. By doing so, you ensure reads go to the geographically closest cluster.
Important
Keep in mind that there is the possibility of a secondary node returning stale data due to replication lag.
Data Distribution: Distributing data across regions by using replica sets or sharded clusters is an effective approach when your data is geographically oriented. For example, if you have data that is only read from the EU, and other data that is only read in North America, you can create shards that distribute that data appropriately.

Network Configuration

You can increase security and further reduce latency using the following network connectivity options:

Private Endpoints: Private endpoints establish direct and secure connections between your application's virtual network and your Atlas cluster, potentially reducing network hops and improving latency.
VPC Peering: Configure VPC peering in your replica sets to allow applications to connect to peered regions. In the event of a failover, VPC peering provides a way for an application server to detect a new primary node and route traffic accordingly.

Data Modeling and Query Optimization

The speed at which your application accesses data contributes to latency. Good data modeling and query optimization can improve data access speeds. For example, you can:

Reduce Document Size: Consider shortening field names and value lengths to decrease the amount of data transferred over the network.
Optimize Query Patterns: Use indexes effectively to minimize the amount of data that needs to be read across regions.

Monitoring and Testing Latency

Atlas provides the Real-Time Performance Panel (RTPP) to observe latency metrics for different regions. You can also implement application-level monitoring to track end-to-end latency to and from the application. Before final production deployment, we suggest conducting performance testing under the various multi-region scenarios to identify and address latency bottlenecks.

Back

Multi-Region

Multi-Cloud

Physical Distance

Replication Configuration

Important

Network Configuration

Data Modeling and Query Optimization

Monitoring and Testing Latency