|国家预印本平台
首页|无线VR场景下基于U-RIS的资源分配算法研究

无线VR场景下基于U-RIS的资源分配算法研究

Distributed Resource Allocation Algorithm Based on U-RIS in Wireless VR

中文摘要英文摘要

面向无线VR业务的移动边缘计算网络中面临着网络资源受限、传输不稳等问题,本文对多人VR(Virtual Reality,虚拟现实)游戏场景中资源管理进行了研究,并基于深度强化学习提出了一种分布式资源分配算法,优化了VR用户的平均时延、带宽与计算资源的利用效率。场景中部署的MEC服务器根据VR用户的实时虚拟坐标完成高分辨率、高动态的沉浸式VR游戏世界渲染,并通过太赫兹频段进行渲染数据的下行传输。在实际场景中,受用户自身移动和头部转动影响,用户与基站间的下行传输信道会因人体遮挡而转化为非视距传输信道,导致传输速率下降。为解决该问题,本文引入U-RIS(Unmanned Aerial Vehicle mounted with Reconfigurable Intelligent Surface,搭载智能反射面的无人机)技术,利用U-RIS兼备UAV(Unmanned Aerial Vehicle,无人机)灵活性和RIS信道重构能力的特点,能够建立基站与用户间的辅助传输链路,提升用户下行信道稳定性。然而,U-RIS对用户信道的实时优化效果同样受用户移动与头部转动、以及太赫兹接收天线定向增益等因素的影响。同时,考虑到用户移动随机性和VR业务时延敏感性,如何在用户位置信息不足的前提下优化三维坐标系中U-RIS的飞行轨迹成为待解决的难题。本文基于快速PPO(Proximal Policy Optimization,近端优化策略)与DDQN(Deep Dueling Q-network,双重深度Q网络)提出一个多级强化学习模型。首先,考虑到用户位置信息不足、三维空间复杂度高等限制条件,为了解决U-RIS飞行轨迹优化问题,模型第一级利用快速PPO建立用户历史移动信息与U-RIS实时飞行向量之间的联系,实现了不需要用户实时位置信息的U-RIS三维轨迹优化。然后,模型第二级基于DDQN综合分析U-RIS轨迹与基站当前剩余资源对用户时延的影响,给出用户在未来一段时间内带宽与计算资源的分配决策。仿真结果表明,本文所提算法显著提升了网络中带宽和计算资源利用率,同时降低了用户时延。

his article investigates resource management in a mobile edge computing (MEC) network for wireless virtual reality (VR) applications, which is faced with challenges such as limited network resources and unstable transmission. A distributed resource allocation algorithm based on deep reinforcement learning is proposed to optimize the average latency, bandwidth, and utilization efficiency of computational resources for multiple VR gamers. MEC servers deployed in the VR gaming scenario render high-resolution and high-dynamic immersive VR game worlds based on real-time virtual coordinates of the VR users, and transmit rendered data via terahertz (THz) frequencies. In practical scenarios, the downlink transmission channel between the base station and the VR user may be affected by human body blockage due to the user\'s movements and head rotation, resulting in a decrease in transmission rate. To address this problem, the use of unmanned aerial vehicle mounted with reconfigurable intelligent surfaces (U-RIS) technology is introduced, which combines the flexibility of unmanned aerial vehicle (UAV) and the channel reconstruction capability of reconfigurable intelligent surfaces (RIS) to establish an auxiliary transmission link between the base station and the user, thereby improving the stability of the downlink channel. However, the real-time optimization effect of U-RIS on the user\'s channel is also affected by factors such as user movements and head rotation, as well as the directional gain of the THz receive antenna. Furthermore, considering the randomness of user movements and the sensitivity of VR applications to latency, optimizing the flight trajectory of U-RIS in three-dimensional space in the absence of sufficient user location information is a challenging task. To address this issue, a multi-level reinforcement learning model based on fast proximal policy optimization (PPO) and deep dueling Q-network (DDQN) is proposed in this article. Firstly, to solve the problem of U-RIS flight trajectory optimization without real-time user location information, the first level of the proposed model establishes a connection between the user\'s historical movement information and the real-time flight vector of U-RIS using fast PPO. Secondly, the next level of the model comprehensively analyzes the impact of U-RIS trajectory and the current remaining resources of the base station on user latency using DDQN, and provides a bandwidth and computational resource allocation decision for the user. Simulation results demonstrate that the proposed algorithm significantly improves the utilization of bandwidth and computational resources in the network, while reducing user latency.

李曦、杨东鹏

无线通信通信电子技术应用

无线VR,无人机通信,智能反射面

Wireless Virtual Reality Unmanned Aerial Vechile Communication Reconfigurable Intelligent Surface

李曦,杨东鹏.无线VR场景下基于U-RIS的资源分配算法研究[EB/OL].(2023-03-28)[2025-07-21].http://www.paper.edu.cn/releasepaper/content/202303-314.点此复制

评论