面向高分辨率遥感影像建筑物提取的SD-BASNet网络
|
朱娟娟(1996-),女,硕士,助理工程师,主要研究方向为遥感图像处理与分析。Email:20202201142@stu.kust.edu.cn。 |
Copy editor: 张仙
收稿日期: 2024-06-12
修回日期: 2024-10-07
网络出版日期: 2026-06-03
基金资助
国家自然科学基金项目“面向光学与SAR遥感图像语义变化检测的多任务学习方法研究”(42361054)
云南省基础研究计划项目“轻量级自适应尺度特征遥感影像非监督变化检测方法”(202201AT070164)
湖南省自然科学基金项目“基于边缘注意力网络的建筑物动态变化检测和提取”(2023JJ60561)
兴滇英才支持计划项目共同资助
SD-BASNet:a building extraction network for high-spatial-resolution remote sensing imagery
Received date: 2024-06-12
Revised date: 2024-10-07
Online published: 2026-06-03
针对网络模型参数量大、下采样过程丢失影像建筑物细节信息的问题,受轻量级网络的启发,设计了一种融入深度可分离残差块和空洞卷积的建筑物提取网络(SD-BASNet)。首先,在深度监督编码器预测模块中设计了一个深度可分离残差块,将深度可分离卷积引入主干网络ResNet中,避免卷积核过大,减少网络的参数量;其次,为防止网络轻量化带来的精度下降,将空洞卷积融入后处理优化模块的编码层,增大特征图的感受野,从而捕捉更广泛的上下文信息,提高建筑物特征提取的准确性。在WHU建筑物数据集上进行实验,在不同尺度建筑物提取中均表现较好,其平均交并比和平均像素精度分别为92.25%和96.59%,其召回率、精确率和F1指标分别达到96.50%,93.79%和92.61%。与PSPNet,SegNet,DeepLabV3,SE-UNet,UNet++等语义分割网络相比,SD-BASNet网络提取精度得到了显著提升,且提取的建筑物完整度更好;与基础网络BASNet相比,SD-BASNet网络的参数量与运行时间也有所减少,证实了该文提出的SD-BASNet网络的有效性。
朱娟娟 , 黄亮 , 朱莎莎 . 面向高分辨率遥感影像建筑物提取的SD-BASNet网络[J]. 自然资源遥感, 2025 , 37(5) : 122 -130 . DOI: 10.6046/zrzyyg.2024209
In response to the challenges posed by substantial parameters and the loss of building details during downsampling,this study,inspired by lightweight networks,designed a building extraction network (SD-BASNet) incorporating depthwise separable residual blocks and dilated convolution. First,a depthwise separable residual block was designed in the prediction module of the deep supervision encoder-decoder. Depthwise separable convolution was incorporated into the backbone ResNet to prevent oversized convolutional kernels and reduce the number of network parameters. Second,to mitigate the potential decline in accuracy due to network lightweighting,dilated convolution was integrated into the encoder layer of the post-processing optimization module. This strategy effectively expands the receptive field of feature maps,thereby capturing broader contextual information and enhancing the accuracy of building feature extraction. Experiments on the WHU building dataset showed that the proposed network achieved an mIoU of 92.25%,an mPA of 96.59%,a Recall of 96.50%,a Precision of 93.79%,and a F1-score of 92.61%. Compared with current semantic segmentation networks,including PSPNet,SegNet,DeepLabV3,SE-UNet,and UNet++,the SD-BASNet demonstrated significantly improved accuracy and better completeness of building extraction. Compared with the baseline BASNet,the SD-BASNet also exhibited reductions in both parameter count and runtime,demonstrating its effectiveness.
表1 不同网络的参数量与时间分析Tab.1 Analysis of parameter quantity in different networks |
| 序号 | 网络 | 参数量/106 | 训练时间/h |
|---|---|---|---|
| ① | BASNet | 87.06 | 11.27 |
| ② | BASNet+DSC | 61.54 | 10.79 |
| ③ | BASNet+DC | 87.06 | 11.77 |
| ④ | BASNet+DSC+DC(SD-BASNet) | 61.96 | 10.91 |
表2 不同网络的检测结果Tab.2 Detection results of different networks (%) |
| 网络 | mIoU | mPA | 召回率 | 精确率 | F1 |
|---|---|---|---|---|---|
| PSPNet | 73.74 | 80.43 | 89.62 | 62.86 | 73.89 |
| SegNet | 77.67 | 83.26 | 93.27 | 67.87 | 78.57 |
| DeepLabV3 | 82.30 | 87.07 | 94.34 | 75.39 | 83.80 |
| SE-UNet | 82.95 | 87.37 | 95.43 | 75.74 | 84.46 |
| UNet++ | 83.77 | 87.92 | 96.09 | 76.69 | 85.30 |
| BASNet | 90.10 | 93.88 | 98.40 | 87.44 | 89.92 |
| SD-BASNet | 92.25 | 96.59 | 96.50 | 93.79 | 92.61 |
表3 小尺度建筑物的检测结果Tab.3 Detection results of small-scale building (%) |
| 网络 | mIoU | mPA | 召回率 | 精确率 | F1 |
|---|---|---|---|---|---|
| PSPNet | 74.30 | 79.84 | 89.84 | 61.46 | 75.26 |
| SegNet | 74.64 | 82.24 | 96.84 | 65.48 | 78.13 |
| DeepLabV3 | 82.17 | 87.93 | 96.80 | 77.06 | 85.81 |
| SE-UNet | 83.59 | 88.81 | 97.87 | 78.43 | 87.08 |
| UNet++ | 84.77 | 89.92 | 97.09 | 79.69 | 87.30 |
| BASNet | 92.89 | 96.56 | 93.99 | 96.01 | 94.99 |
| SD-BASNet | 93.07 | 96.81 | 94.71 | 96.17 | 95.41 |
表4 多尺度建筑物的检测结果Tab.4 Detection results of multi-scale building (%) |
| 网络 | mIoU | mPA | 召回率 | 精确率 | F1 |
|---|---|---|---|---|---|
| PSPNet | 46.14 | 76.51 | 78.49 | 53.75 | 63.14 |
| SegNet | 81.04 | 87.69 | 95.42 | 77.37 | 85.45 |
| DeepLabV3 | 81.58 | 87.97 | 96.23 | 77.56 | 85.89 |
| SE-UNet | 83.09 | 88.80 | 97.99 | 78.46 | 87.15 |
| UNet++ | 86.86 | 84.79 | 97.14 | 77.14 | 85.02 |
| BASNet | 89.36 | 93.58 | 94.40 | 90.61 | 92.51 |
| SD-BASNet | 90.18 | 94.50 | 94.57 | 91.81 | 93.17 |
表5 大尺度建筑物的检测结果Tab.5 Detection results of large-scale building (%) |
| 网络 | mIoU | mPA | 召回率 | 精确率 | F1 |
|---|---|---|---|---|---|
| PSPNet | 39.58 | 40.72 | 40.72 | 58.62 | 56.71 |
| SegNet | 82.62 | 90.25 | 87.19 | 90.14 | 88.64 |
| DeepLabV3 | 84.50 | 91.90 | 92.57 | 88.09 | 90.28 |
| SE-UNet | 88.21 | 93.85 | 93.56 | 91.80 | 92.67 |
| UNet++ | 87.59 | 92.16 | 93.31 | 90.34 | 91.31 |
| BASNet | 91.76 | 95.39 | 92.70 | 97.12 | 94.86 |
| SD-BASNet | 92.34 | 95.95 | 92.49 | 97.59 | 95.50 |
表6 消融实验检测结果Tab.6 Detection results of ablation tests (%) |
| 网络 | mIoU | mPA | 召回率 | 精确率 | F1 |
|---|---|---|---|---|---|
| BASNet | 90.10 | 93.88 | 98.40 | 87.44 | 89.92 |
| BASNet+DSC | 86.39 | 94.80 | 91.39 | 93.05 | 92.70 |
| BASNet+DC | 92.56 | 95.01 | 96.16 | 93.24 | 92.12 |
| SD-BASNet | 92.25 | 96.59 | 96.50 | 93.79 | 92.61 |
| [1] |
张卓尔, 潘俊, 舒奇迪. 基于双路细节关注网络的遥感影像建筑物提取[J]. 武汉大学学报(信息科学版), 2024, 49(3):376-388.
|
| [2] |
李治, 隋正伟, 傅俏燕, 等. 基于形态学序列和多源先验信息的城市建筑物高分遥感提取[J]. 遥感学报, 2023, 27(4):998-1008.
|
| [3] |
张云佐, 郭威, 武存宇. 融合CNN和Transformer的遥感图像建筑物快速提取[J]. 光学精密工程, 2023, 31(11):1700-1709.
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
李星华, 白学辰, 李正军, 等. 面向高分影像建筑物提取的多层次特征融合网络[J]. 武汉大学学报(信息科学版), 2022, 47(8):1236-1244.
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
季顺平, 魏世清. 遥感影像建筑物提取的卷积神经元网络与开源数据集方法[J]. 测绘学报, 2019, 48(4):448-459.
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
刘浩, 骆剑承, 黄波, 等. 基于特征压缩激活Unet网络的建筑物提取[J]. 地球信息科学学报, 2019, 21(11):1779-1789.
|
| [30] |
|
/
| 〈 |
|
〉 |