Text-Guided Token Communication for Wireless Image Transmission
Text-Guided Token Communication for Wireless Image Transmission
With the emergence of 6G networks and proliferation of visual applications, efficient image transmission under adverse channel conditions is critical. We present a text-guided token communication system leveraging pre-trained foundation models for wireless image transmission with low bandwidth. Our approach converts images to discrete tokens, applies 5G NR polar coding, and employs text-guided token prediction for reconstruction. Evaluations on ImageNet show our method outperforms Deep Source Channel Coding with Attention Modules (ADJSCC) in perceptual quality and semantic preservation at Signal-to-Noise Ratios (SNRs) above 0 dB while mitigating the cliff effect at lower SNRs. Our system requires no scenario-specific retraining and exhibits superior cross-dataset generalization, establishing a new paradigm for efficient image transmission aligned with human perceptual priorities.
Bole Liu、Li Qiao、Ye Wang、Zhen Gao、Yu Ma、Keke Ying、Tong Qin
无线通信
Bole Liu,Li Qiao,Ye Wang,Zhen Gao,Yu Ma,Keke Ying,Tong Qin.Text-Guided Token Communication for Wireless Image Transmission[EB/OL].(2025-07-08)[2025-07-20].https://arxiv.org/abs/2507.05781.点此复制
评论