Skip to main content

Build Balanced Zones (Spatial Statistics Tools)

Summary

Creates spatially contiguous zones in a study area using a genetic growth algorithm based on specified criteria.

You can create zones that contain an equal number of features, zones that are similar based on a set of attribute values, or both. You can also select zones with approximately equal areas, that are as compact as possible, and that maintain consistent summary statistics of other variables.

Learn more about how Build Balanced Zones works

Usage

  • To create zones, choose Attribute target, Number of zones and attribute target, or Defined number of zones for the Zone Creation Method parameter. When you choose Attribute target, the tool will create zones based on the target values of one or more specified variables. The Number of zones and attribute target option balances the attributes over the specified number of zones. The Defined number of zones option creates zones based on feature counts. The Custom seed locations option allows you to specify locations from which zones will grow.

  • For custom seed locations, you must provide a point feature class in the Seed Location Points parameter indicating the locations of the starting seeds. For example, to create emergency response zones from census tracts, you can provide the locations of hospitals or fire departments as the seed location points, and the census tract zones will grow around these locations. If the input features are polygons, then any polygon containing a seed location point will be used to grow a zone. For point input features, the closest input point to each seed location point will be used to grow zones. Usually, there will be one zone for each seed location point (for example, one emergency response zone per hospital). However, the zone count can be different than the number of seed location points if the same input point feature is closest to multiple seed location points.

  • This tool can take input points or polygons.

  • If multiple variables are specified for the Zone Building Criteria parameter, you can provide weights for each criteria. Weights are applied proportionally to the value specified in the weights field.

  • This tool supports parallel processing to create generations for the algorithm and uses 50 percent of available processors by default. The number of processors can be increased or decreased using the Parallel Processing Factor environment.

Parameters

Label Explanation Data type

Input Features

The feature class or feature layer that will be aggregated into zones.

Feature Layer

Output Features

The output feature class indicating which features are aggregated into each zone. The feature class will be symbolized by the ZONE_ID field and will contain fields displaying the values of each criteria that you specify.

Feature Class

Zone Creation Method

Specifies the method that will be used to create each zone. Zones grow until all specified criteria are satisfied.

  • Attribute targetZones will be created based on target values of one or multiple variables. The sum of each attribute must be specified in the Zone Building Criteria With Target parameter, and each zone will grow until the sum of the attributes exceeds these values. For example, you can use this option to create zones that each have at least 100,000 residents and 20,000 family homes.

  • Number of zones and attribute targetA specified number of zones will be created while keeping the sum of an attribute approximately equal within each zone. The number of zones must be specified in the Target Number of Zones parameter. The attribute sum within each zone is equal to the sum of the total attribute divided by the number of zones.

  • Defined number of zonesA specified number of zones will be created that are each composed of approximately the same number of input features. The number of zones must be specified in the Target Number of Zones parameter.

  • Custom seed locationsThe seed locations for each zone will be determined from a point feature class. Provide the feature class in the Seed Location Points parameter.

String

Target Number of Zones

(Optional)

The number of zones that will be created.

Long

Zone Building Criteria With Target

(Optional)

Specifies the variables that will be considered, as well as their target values and optional weights. The default weight is 1, and each variable contributes equally unless they are changed.

Value table columns:

  • VariableThe variable to consider when creating zones.

  • SumThe threshold for creating zones.

  • WeightThe weight of the variable specified. The weights will be applied proportional to the value specified. For example, if three variables are specified with weights of 10, 40, and 30, the importance of each variable will be 1/8, 1/2, and 3/8, respectively.

Value Table

Zone Building Criteria

(Optional)

Specifies the variables that will be considered and, optionally, their weights. The default weight is 1, and each variable contributes equally unless changed.

Value table columns:

  • VariableThe variable to consider when creating zones.

  • WeightThe weight of the variable specified. The weights will be applied proportional to the value specified. For example, if three variables are specified with weights of 10, 40, and 30, the importance of each variable will be 1/8, 1/2, and 3/8, respectively.

Value Table

Spatial Constraints

(Optional)

Specifies how neighbors will be defined while the zones grow. Zones can only grow into new features that are neighbors of at least one of the features already in the zone. If the input features are polygons, the default spatial constraint is Contiguity edges corners. If the input features are points, the default spatial constraint is Trimmed Delaunay triangulation.

  • Contiguity edges onlyFor zones containing contiguous polygon features, only polygons that share an edge will be part of the same zone.

  • Contiguity edges cornersFor zones containing contiguous polygon features, only polygons that share an edge or a vertex will be part of the same zone.

  • Trimmed Delaunay triangulationFeatures in the same zone will have at least one natural neighbor in common with another feature in the zone. Natural neighbor relationships are based on a trimmed Delaunay Triangulation. Conceptually, Delaunay Triangulation creates a nonoverlapping mesh of triangles from feature centroids. Each feature is a triangle node, and nodes that share edges are considered neighbors. These triangles are then clipped to a convex hull to ensure that features cannot be neighbors with any features outside of the convex hull. This is the default.

  • Get spatial weights from fileSpatial, and, optionally, temporal relationships will be defined by a specified spatial weights file (.swm). Create the spatial weights matrix using the Generate Spatial Weights Matrix tool or the Generate Network Spatial Weights tool. The path to the spatial weights file is specified by the Spatial Weights Matrix File parameter.

String

Spatial Weight Matrix File

(Optional)

The path to a file containing spatial weights that define spatial and, optionally, temporal relationships among features.

File

Zone Characteristics

(Optional)

Specifies the characteristics of the zones that will be created.

  • Equal areaZones with total area as similar as possible will be created.

  • CompactnessZones with more closely-packed (compact) features will be created.

  • Equal number of featuresZones with an equal number of features will be created.

String

Attribute to Consider

(Optional)

Specifies attributes and statistics to consider in the selection of final zones. You can homogenize attributes based on their sum, average, median, or variance. For example, if you are creating zones based on home values and want to balance the average total income within each zone, the solution with the most equal average income across zones will be preferred.

Value table columns:

  • VariableThe variable to consider when selecting the optimal zone solution.

  • FunctionThe mathematical way in which each attribute will be balanced within zones.

    • SumThe attribute will be balanced based on the sum.

    • AverageThe attribute will be balanced based on the average.

    • VarianceThe attribute will be balanced based on the variance.

    • MedianThe attribute will be balanced based on the median.

Value Table

Distance to Consider

(Optional)

The feature class that will be used to homogenize the total distance per zone. The distance is calculated from each of the input features to the closest feature provided in this parameter. This distance is then used as an additional attribute constraint when selecting the final zone solution. For example, you can create police patrol districts that are each approximately the same distance from the closest police station.

Feature Layer

Categorical Variable to Maintain Proportions

(Optional)

The categorical variable to be considered for zone proportions.

Field

Proportion Method

(Optional)

Specifies the type of proportion that will be maintained based on the chosen categorical variable.

  • Maintain within proportionEach zone will maintain the same proportions as the overall study area for the given categorical variable. For example, given a categorical variable that is 60% Type A and 40% Type B, this method will prefer zones that are composed of approximately 60% Type A features and 40% Type B features.

  • Maintain overall proportionZones will be created so that the overall proportions of category predominance by zone matches the proportions of the given categorical variable for the entire dataset. For example, given a categorical variable that is 60% Type A and 40% Type B, this method will prefer solutions where 60% of the zones are predominantly Type A features and 40% of the zones are predominantly Type B features.

String

Population Size

(Optional)

The number of randomly generated initial seeds. For larger datasets, increasing this number will increase the search space and the probability of finding a better solution. The default is 100.

Long

Number of Generations

(Optional)

The number of times the zone search process will be repeated. For larger datasets, increasing the number is recommended to find an optimal solution. The default is 50 generations.

Long

Mutation Factor

(Optional)

The probability that an individual's seed values will be mutated to a new set of seeds. Mutation increases the search space by introducing variability of the possible solutions in every generation and allows for faster convergence to an optimal solution. The default is 0.1.

Double

Output Convergence Table

(Optional)

The table containing the total fitness score for the best solution found in every generation as well as the fitness score for the individual zone constraints.

Table

Output Polygon Zones

(Optional)

Creates polygons around each zone by creating Thiessen polygons and combining all polygons in the same zone. This is useful when the input feature are points but you need to create polygon zones that completely cover an area. This parameter only applies when the input features and output zones are points.

Feature Class

Seed Location Points

(Optional)

The point features representing the seed locations that will be used to grow zones. For example, to create emergency response zones from census tracts, you can provide the locations of hospitals or fire departments as the seed location points, and the census tract zones will grow around these locations.

Feature Layer

Environments

Output Coordinate System, Parallel Processing Factor, Random number generator

Licensing information

  • Basic: Yes
  • Standard: Yes
  • Advanced: Yes