首页|大语言模型旋转位置编码的简易推导

大语言模型旋转位置编码的简易推导

Easy Derivation Of Rotary Position Embeddings For Large Language Models

来源：

中文摘要

英文摘要

以 LLAMA 为代表的开源大语言模型广泛使用旋转位置编码，原始论文使用复函数推导。本文改用线性代数推导，期望更好地理解该编码方法；提出该方法的一个疑点并给出了改进建议。

he Rotary Position Embeddings(RoPE) is widely used in open-source large language models such as LLAMA. In the original paper, the formula derivation uses complex functions. In this Paper, I derive PoPE’s formulas again with linear algebra, hoping to better understand this method.

DOI：10.12074/202307.00071V1

学科分类：计算技术、计算机技术

中文关键词：大语言模型LLM旋转位置编码LLAMA

英文关键词：Large Language Model(LLM)Rotary Position Embeddings(RoPE)LLAMA

推荐引用：.大语言模型旋转位置编码的简易推导[EB/OL].(2023-07-10)[2025-08-02].https://chinaxiv.org/abs/202307.00071.点此复制

大语言模型旋转位置编码的简易推导

Easy Derivation Of Rotary Position Embeddings For Large Language Models

评论