|国家预印本平台
首页|DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications

DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications

DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications

来源:Arxiv_logoArxiv
英文摘要

Significant efforts have been directed towards adapting self-supervised multimodal learning for Earth observation applications. However, most current methods produce coarse patch-sized embeddings, limiting their effectiveness and integration with other modalities like LiDAR. To close this gap, we present DUNIA, an approach to learn pixel-sized embeddings through cross-modal alignment between images and full-waveform LiDAR data. As the model is trained in a contrastive manner, the embeddings can be directly leveraged in the context of a variety of environmental monitoring tasks in a zero-shot setting. In our experiments, we demonstrate the effectiveness of the embeddings for seven such tasks: canopy height mapping, fractional canopy cover, land cover mapping, tree species identification, plant area index, crop type classification, and per-pixel waveform-based vertical structure mapping. The results show that the embeddings, along with zero-shot classifiers, often outperform specialized supervised models, even in low-data regimes. In the fine-tuning setting, we show strong performances near or better than the state-of-the-art on five out of six tasks.

Sarah Brood、Ibrahim Fayad、Max Zimmer、Martin Schwartz、Fabian Gieseke、Philippe Ciais、Gabriel Belouze、Alexandre d'Aspremont、Aurelien De Truchis

遥感技术

Sarah Brood,Ibrahim Fayad,Max Zimmer,Martin Schwartz,Fabian Gieseke,Philippe Ciais,Gabriel Belouze,Alexandre d'Aspremont,Aurelien De Truchis.DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications[EB/OL].(2025-07-16)[2025-08-16].https://arxiv.org/abs/2502.17066.点此复制

评论