The IDSA model is based on the SPADE model, which extends the power of spatial determinant of the SPADE model to multivariable interaction and takes spatial fuzzy overlay into account.
In this vignette, we used the same data as the SPADE vignette to demonstrate the use of the IDSA model.
library(sf)
library(tidyverse)
library(magrittr)
library(gdverse)
ushi = read_sf(system.file('extdata/USHI.gpkg',package = 'gdverse'))
ushi
## Simple feature collection with 1037 features and 19 fields
## Geometry type: POLYGON
## Dimension: XY
## Bounding box: xmin: 295713 ymin: 3784916 xmax: 324202.2 ymax: 3804669
## Projected CRS: WGS 84 / UTM zone 49N
## # A tibble: 1,037 × 20
## LPI LSI GAR DAR WAR TER NDVI NDWI DTH DTP DTR DTW BH BD
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 57.2 10.5 0.630 0.273 0.00219 0.979 0.497 -0.356 5.66 1.08 0.934 1.16 3.16 0.635
## 2 85.7 2.35 0.905 0.0952 0 0.977 0.514 -0.288 5.66 1.41 0.0807 1.57 0.310 0.139
## 3 41.2 7.85 0.436 0.449 0.000672 0.972 0.365 -0.296 4.38 1.88 0.516 2.37 3.99 0.671
## 4 39.1 5.40 0.534 0.377 0 0.966 0.395 -0.299 4.25 1.97 0.442 2.81 4.63 0.634
## 5 32.0 8.04 0.405 0.437 0 0.968 0.354 -0.302 3.65 1.34 0.615 2.55 4.07 0.642
## 6 58.3 7.91 0.332 0.615 0.000377 0.966 0.312 -0.271 3.69 0.700 0.592 2.25 7.91 0.777
## 7 98.4 3.34 0.984 0.0161 0 0.948 0.451 -0.248 5.39 1.36 0.124 3.64 0.216 0.111
## 8 44.6 17.6 0.416 0.502 0.000672 0.949 0.365 -0.295 5.11 1.20 0.926 2.50 6.49 0.747
## 9 35.9 9.81 0.464 0.467 0.00134 0.936 0.379 -0.309 7.10 2.01 1.03 2.75 5.49 0.692
## 10 44.3 9.49 0.403 0.504 0.000547 0.926 0.310 -0.278 8.51 2.68 0.658 3.20 3.27 0.601
## # ℹ 1,027 more rows
## # ℹ 6 more variables: RL <dbl>, RFD <dbl>, SVF <dbl>, FAR <dbl>, SUHI <dbl>,
## # geom <POLYGON [m]>
There are originally 18 explanatory variables in the
ushi
tibble. If these 18 variables are used together, IDSA
needs to be calculated about 2.62143 × 105 times, which is
very time-consuming, and we need to reduce the dimension of the data or
select appropriate variables.
Here we combine the understanding of the data and the diffusion maps method to generate the data after dimensionality reduction. Specifically, we combine data of the same class into one class using diffusion maps based on the classification of explanatory variables.
In SPADE vignette we proved the invalidity of WAR
to
explain SUHI
, which is removed here.
landscape = c("LPI","LSI","GAR","DAR")
dist_cold_heat = c("DTH","DTP","DTR","DTW")
urbanbuilt_form = c("TER","NDVI","NDWI","BH","BD","RL","RFD","SVF","FAR")
dm_n21 = \(xvecname){
return(
ushi %>%
sf::st_drop_geometry() %>%
dplyr::select(dplyr::all_of(xvecname)) %>%
{Rdimtools::do.dm(as.data.frame(.),1)} %>%
{.$Y} %>%
{.[,1]} %>%
gdverse::rescale_vector(0,1)
)
}
newdata = tibble::tibble(
SUHI = ushi$SUHI,
landscape = dm_n21(landscape),
dist_cold_heat = dm_n21(dist_cold_heat),
urbanbuilt_form = dm_n21(urbanbuilt_form),
)
newdata
## # A tibble: 1,037 × 4
## SUHI landscape dist_cold_heat urbanbuilt_form
## <dbl> <dbl> <dbl> <dbl>
## 1 1.76 1.00 0.726 0.200
## 2 0.519 1.00 0.722 0.00319
## 3 3.04 1.00 0.392 0.272
## 4 3.22 1.00 0.344 0.345
## 5 3.47 1.00 0.216 0.273
## 6 4.71 1.00 0.225 0.644
## 7 1.24 1.00 0.508 0.00117
## 8 4.30 1 0.554 0.544
## 9 3.76 1.00 0.860 0.443
## 10 4.23 1.00 0.928 0.197
## # ℹ 1,027 more rows
We use the same spatial weight matrix as SPADE vignette.
idsa.model = idsa(SUHI ~ ., data = newdata, wt = wt, cores = 6)
idsa.model
## *** Interactive Detector For Spatial Associations
##
## | varibale | PID |
## |:--------------------------------------------:|:-----------:|
## | landscape ∩ dist_cold_heat ∩ urbanbuilt_form | 0.455535905 |
## | dist_cold_heat ∩ urbanbuilt_form | 0.408837104 |
## | urbanbuilt_form | 0.345728704 |
## | landscape ∩ urbanbuilt_form | 0.330273768 |
## | dist_cold_heat | 0.123441382 |
## | landscape ∩ dist_cold_heat | 0.071538071 |
## | landscape | 0.004183838 |
##
## --------- IDSA model performance evaluation: --------
## * Number of overlay zones : 11
## * Percentage of finely divided zones : 0.0009643202
## * Number of individual explanatory variables : 3
##
## ## Different of response variable between a pair of overlay zones:
##
## | zone1st | zone2nd | Risk |
## |:--------------------:|:--------------------:|:----:|
## | zonedist_cold_heat_2 | zonedist_cold_heat_3 | No |
## | zonedist_cold_heat_2 | zonedist_cold_heat_4 | No |
## | zonedist_cold_heat_2 | zonedist_cold_heat_8 | Yes |
## | zonedist_cold_heat_2 | zonedist_cold_heat_9 | No |
## | zonedist_cold_heat_2 | zonelandscape_1 | No |
##
## #### Only the first five pairs of overlay zones are displayed! ####
plot(idsa.model)
The risk detection outcomes, as determined by IDSA, can be graphically represented on a map.
idsa_risk = idsa.model$risk2
ushi$risk = idsa_risk
ggplot(data = ushi) +
geom_sf(aes(fill = risk),
alpha = .65) +
ggplot2::scale_fill_manual(values = c("No" = "#7fdbff",
"Yes" = "#ffa500")) +
theme_bw()
This vignette is a special case because the explanatory variables are dimensionally reduced in advance and is used as a coping strategy when IDSA is executed in the face of many explanatory variables.