SA-Encoder: A Learnt Spatial Autocorrelation Representation to Inform 3D Geospatial Object Detection

Tianyang Chen; Wenwu Tang; Shenen Chen; Craig Allan

doi:10.3390/rs17173124

SA-Encoder: A Learnt Spatial Autocorrelation Representation to Inform 3D Geospatial Object Detection — Tianyang Chen (2025) | RDL Network

Abstract

2 min read

Contextual features play a critical role in geospatial object detection by characterizing the surrounding environment of objects. In existing deep learning-based studies of 3D point cloud classification and segmentation, these features have been represented through geometric descriptors, semantic context (i.e., modeled by an attention-based mechanism), global-level context (i.e., through global aggregation), and textural representation (e.g., RGB, intensity, and other attributes). Even though contextual features have been widely explored, spatial contextual features that explicitly capture spatial autocorrelation and neighborhood dependency have received limited attention in object detection tasks. This gap is particularly relevant in the context of GeoAI, which calls for mutual benefits between artificial intelligence and geographic information science. To bridge this gap, this study presents a spatial autocorrelation encoder, namely SA-Encoder, designed to inform 3D geospatial object detection by capturing spatial autocorrelation representation as types of spatial contextual features. The study investigated the effectiveness of such spatial contextual features by estimating the performance of a model trained on them alone. The results suggested that the derived spatial autocorrelation information can help adequately identify some large objects in an urban-rural scene, such as buildings, terrain, and large trees. We further investigated how the spatial autocorrelation encoder can inform model performance in a geospatial object detection task. The results demonstrated significant improvements in detection accuracy across varied urban and rural environments when we compared the results to models without considering spatial autocorrelation as an ablation experiment. Moreover, the approach also outperformed the models trained by explicitly feeding traditional spatial autocorrelation measures (i.e., Matheron’s semivariance). This study showcases the advantage of the adaptiveness of the neural network-based encoder in deriving a spatial autocorrelation representation. This advancement bridges the gap between theoretical geospatial concepts and practical AI applications. Consequently, this study demonstrates the potential of integrating geographic theories with deep learning technologies to address challenges in 3D object detection, paving the way for further innovations in this field.

SA-Encoder: A Learnt Spatial Autocorrelation Representation to Inform 3D Geospatial Object Detection

Abstract

Discussion(0)

Related publications

Neural Network-Based Spatial Autocorrelation Representation for Deep Learning-Driven Geospatial Object Detection from 3D Point Clouds: A Case Study with Urban and Rural Scenes

Explicit Incorporation of Spatial Autocorrelation in 3D Deep Learning for Geospatial Object Detection

SMURF: Spatial Multi-Representation Fusion for 3D Object Detection with 4D Imaging Radar

SMURF: Spatial Multi-Representation Fusion for 3D Object Detection With 4D Imaging Radar

Exploring Person Context and Local Scene Context for Object Detection

Related publications

Preprint2025
Neural Network-Based Spatial Autocorrelation Representation for Deep Learning-Driven Geospatial Object Detection from 3D Point Clouds: A Case Study with Urban and Rural Scenes
Preprint2025

Article2024
Explicit Incorporation of Spatial Autocorrelation in 3D Deep Learning for Geospatial Object Detection
Article2024

Preprint2023
SMURF: Spatial Multi-Representation Fusion for 3D Object Detection with 4D Imaging Radar
Preprint2023

Article2023
SMURF: Spatial Multi-Representation Fusion for 3D Object Detection With 4D Imaging Radar
Article2023

Preprint2015
Exploring Person Context and Local Scene Context for Object Detection
Preprint2015