blvim: Boltzmann–Lotka–Volterra Interaction Model

blvim implements A. Wilson’s Boltzmann–Lotka–Volterra (BLV) interaction model. The model is described in Wilson, A. (2008), “Boltzmann, Lotka and Volterra and spatial structural evolution: an integrated methodology for some dynamical systems”, J. R. Soc. Interface, 5:865–871.

The package’s primary goal is to provide a fast implementation of the BLV model, complete with a collection of tools designed to explore the results through statistical summaries and graphical representations. The secondary goal is to facilitate the systematic assessment of how model parameters impact the results, again using summaries and graphical representations (see vignette("grid") for details on this aspect).

Installation

You can install blvim from CRAN with:

install.packages('blvim')

You can install the development version from R-universe with:

install.packages('blvim', repos = c('https://fabrice-rossi.r-universe.dev'))

Spatial interaction models

Spatial interaction models aim to estimate flows between locations, such as workers commuting from residential zones to employment zones. The blvim package focuses on the maximum entropy models developed by Alan Wilson. See vignette("theory") for the theoretical background.

In practice, if we have \(`n`\) origin locations and \(`p`\) destination locations, the goal is to compute a flow matrix \(`(Y_{ij})_{1\leq i\leq n, 1\leq j\leq p}`\), where \(`Y_{ij}`\) is the flow from origin \(`i`\) to destination \(`j`\). This computation relies on characteristics of the origin and destination locations, along with a matrix of exchange difficulties, known as a cost matrix, \(`(c_{ij})_{1\leq i\leq n, 1\leq j\leq p}`\). For example, \(`c_{ij}`\) can represent the distance between origin \(`i`\) and destination \(`j`\).

Usage

The package is loaded in a standard way.

library(blvim)

Input data

To compute a spatial interaction model with blvim, you need at least a cost matrix. The package comes with distance data for a selection of large French cities. We use the 30 largest ones as origin locations and the 20 smallest ones as destination locations. The cost matrix is the distance between the cities (in meters).

## 30 largest cities
origins <- french_cities[1:30, c("th_longitude", "th_latitude")]
## 20 smallest cities
destinations <- french_cities[
  (nrow(french_cities) - 19):nrow(french_cities),
  c("th_longitude", "th_latitude")
]
## cost matrix
cost_matrix <-
  french_cities_distances[
    1:30,
    (nrow(french_cities) - 19):nrow(french_cities)
  ]
rownames(cost_matrix) <- french_cities[1:30, "name"]
colnames(cost_matrix) <-
  french_cities[(nrow(french_cities) - 19):nrow(french_cities), "name"]

Additionally, since we focus on production-constrained models, we must specify the production for each origin location (a vector of positive values \(`(X_i)_{1\leq i\leq n}`\)). Here, we assume a common unitary production.

X <- rep(1, nrow(origins))

Finally, the simple static model requires an attractiveness value for each destination location, a vector of positive values \(`(Z_j)_{1\leq j\leq p}`\). We again assume a common unitary attractiveness.

Z <- rep(1, nrow(destinations))

We could use the population of the cities as production constraints for instance.

Static models

In Wilson’s production-constrained maximum entropy model, the flows are given by


Y_{ij} = \frac{X_iZ_j^{\alpha}\exp(-\beta c_{ij})}{\sum_{k=1}^pZ_k^{\alpha}\exp(-\beta c_{ik})},

where \(`\alpha`\) is a return-to-scale parameter and \(`\beta`\) is the inverse of a cost scale parameter. Note that the flow matrix is production-constrained, meaning that the total outgoing flow from any origin location is equals the production of that location:


\forall i,\quad X_i=\sum_{j=1}^{p}Y_{ij}.

The model is obtained using the static_blvim() function:

a_model <- static_blvim(cost_matrix, X, alpha = 1.1, beta = 1 / 500000, Z)
a_model
#> Spatial interaction model with 30 origin locations and 20 destination locations
#> • Model: Wilson's production constrained
#> • Parameters: return to scale (alpha) = 1.1 and inverse cost scale (beta) =
#> 2e-06

Several functions are provided to extract parts of the result. In particular flows() returns the flow matrix \(`Y`\).

a_model_flows <- flows(a_model)

which can be displayed using, for instance, the image() function.

par(mar = rep(0.1, 4))
image(t(a_model_flows)[, 30:1],
  col = gray.colors(20, start = 1, end = 0),
  axes = FALSE,
  frame = TRUE
)

A matrix representation of the flows. It shows no particular structure in the flow apart from a dominating flow from Nice to Cagnes-sur-Mer.

In this representation, each row shows the flows from one origin location to all destination locations. The package also provides a ggplot2::autoplot() function, which can be used as follows:

library(ggplot2)
autoplot(a_model, "full") +
  scale_fill_gradient(low = "white", high = "black") +
  coord_fixed()

A matrix representation of the flows. The color scheme is more contrasted than in the previous figure, but does not show much more strucutre. The dominating flow is Nice to Cagnes-sur-Mer, but the figure shows that there are additional important flows.

b_model <- static_blvim(cost_matrix, X, alpha = 1.1, beta = 1 / 100000, Z)
b_model
#> Spatial interaction model with 30 origin locations and 20 destination locations
#> • Model: Wilson's production constrained
#> • Parameters: return to scale (alpha) = 1.1 and inverse cost scale (beta) =
#> 1e-05

autoplot(b_model) +
  scale_fill_gradient(low = "white", high = "black") +
  coord_fixed()

A matrix representation of the flows. The flows are much more concentrated than with the previous choice of parameters. Many destination locations do not receive any flow. Most of the flow goes to Vaulx-en-Vellin, Arles, Albi and Cagnes-sur-mer.

As the two figures above exemplify, different values of the parameters \(`\alpha`\) and \(`\beta`\) result in more or less concentrated flows.

Dynamic models

A. Wilson’s Boltzmann–Lotka–Volterra (BLV) interaction model builds upon the production-constrained maximum entropy model. The core idea is to update the attractiveness of the destination locations based on their incoming flows.

Ideally, we aim for the following condition to hold in the limit:


Z_j =\sum_{i=1}^{n}Y_{ij},

where the flows are given by the equations above. The model is estimated using the blvim() function as follows.

a_blv_model <- blvim(cost_matrix, X, alpha = 1.1, beta = 1 / 500000, Z)
a_blv_model
#> Spatial interaction model with 30 origin locations and 20 destination locations
#> • Model: Wilson's production constrained
#> • Parameters: return to scale (alpha) = 1.1 and inverse cost scale (beta) =
#> 2e-06
#> ℹ The BLV model converged after 4800 iterations.

Notice that we start with some initial values of the attractiveness, but the final values are different. These final values can be obtained using the attractiveness() function (and visualised here using a bar plot).

par(mar = c(0.1, 4, 1, 0))
a_final_Z <- attractiveness(a_blv_model)
barplot(a_final_Z)

A bar plot representation of the attractivenesses of the destination locations. Only one bar is visible: Vaulx-en-Vellin dominates the destination locations. In this particular example, one destination location acts as a global attractor of all the flows. This pattern is also visible in the final flow matrix.

autoplot(a_blv_model) +
  scale_fill_gradient(low = "white", high = "black")

A matrix representation of the flows. The matrix has only one black column, showing that all the flow goes to Vaulx-en-Vellin.

The autoplot() function can also be used to show the destination flows or the attractivenesses values:

autoplot(a_blv_model, "attractiveness", with_names = TRUE) +
  coord_flip()

A bar plot representation of the attractivenesses of the destination locations. Only one bar is visible: Vaulx-en-Vellin dominates the destination locations.

Naturally, the results are strongly influenced by the parameters, as shown in this second example.

b_blv_model <- blvim(cost_matrix, X, alpha = 1.1, beta = 1 / 50000, Z)
b_blv_model
#> Spatial interaction model with 30 origin locations and 20 destination locations
#> • Model: Wilson's production constrained
#> • Parameters: return to scale (alpha) = 1.1 and inverse cost scale (beta) =
#> 2e-05
#> ℹ The BLV model converged after 5700 iterations.

autoplot(b_blv_model, "attractiveness", with_names = TRUE) +
  coord_flip()

autoplot(b_blv_model, with_names = TRUE) +
  scale_fill_gradient(low = "white", high = "black") +
  theme(axis.text.x = element_text(angle = 90))

A matrix representation of the flows. The figure shows essentially the same things as the previous bar plot. In most of the cases, the flow outgoing from a origin location goes to a unique destination location. The main exception is the pair Saint-Ouen-sur-Seine and Saint-Quentin who share incoming flows.

bvlim offers a collection of graphical representations that can leverage the data associated to the origin and destination locations. For instance, we can display the full flows using geographical coordinates of the cities.

origin_positions(b_blv_model) <- as.matrix(origins)
destination_positions(b_blv_model) <- as.matrix(destinations)
autoplot(b_blv_model,
  with_positions = TRUE,
  show_destination = TRUE,
  show_production = TRUE,
  cut_off = 0
) +
  scale_linewidth(range = c(0, 1.5)) +
  coord_sf(crs = "epsg:4326") +
  scale_color_discrete(type = c("darkorange", "blueviolet"))