|国家预印本平台
首页|Text-Guided Token Communication for Wireless Image Transmission

Text-Guided Token Communication for Wireless Image Transmission

Text-Guided Token Communication for Wireless Image Transmission

来源:Arxiv_logoArxiv
英文摘要

With the emergence of 6G networks and proliferation of visual applications, efficient image transmission under adverse channel conditions is critical. We present a text-guided token communication system leveraging pre-trained foundation models for wireless image transmission with low bandwidth. Our approach converts images to discrete tokens, applies 5G NR polar coding, and employs text-guided token prediction for reconstruction. Evaluations on ImageNet show our method outperforms Deep Source Channel Coding with Attention Modules (ADJSCC) in perceptual quality and semantic preservation at Signal-to-Noise Ratios (SNRs) above 0 dB while mitigating the cliff effect at lower SNRs. Our system requires no scenario-specific retraining and exhibits superior cross-dataset generalization, establishing a new paradigm for efficient image transmission aligned with human perceptual priorities.

Bole Liu、Li Qiao、Ye Wang、Zhen Gao、Yu Ma、Keke Ying、Tong Qin

无线通信

Bole Liu,Li Qiao,Ye Wang,Zhen Gao,Yu Ma,Keke Ying,Tong Qin.Text-Guided Token Communication for Wireless Image Transmission[EB/OL].(2025-07-08)[2025-07-20].https://arxiv.org/abs/2507.05781.点此复制

评论