The document explores different fusion strategies for combining RGB satellite imagery and digital surface models (DSM) in convolutional neural networks for satellite image segmentation. It finds that fusing the inputs at the encoder level achieved the highest intersection-over-union score of 0.87, slightly better than input fusion at 0.888. Fusing at the segmentation heads or decoder levels performed worse. Inputting the RGB and DSM as separate 4-channel data achieved the best results, with an IOU of 0.888.
Related topics: