首页|Designing an efficient and equitable humanitarian supply chain
dynamically via reinforcement learning
Designing an efficient and equitable humanitarian supply chain dynamically via reinforcement learning
Designing an efficient and equitable humanitarian supply chain dynamically via reinforcement learning
This study designs an efficient and equitable humanitarian supply chain dynamically by using reinforcement learning, PPO, and compared with heuristic algorithms. This study demonstrates the model of PPO always treats average satisfaction rate as the priority.
Weijia Jin
计算技术、计算机技术
Weijia Jin.Designing an efficient and equitable humanitarian supply chain dynamically via reinforcement learning[EB/OL].(2025-05-22)[2025-07-22].https://arxiv.org/abs/2505.17439.点此复制
评论