|国家预印本平台
首页|Domain Pre-training Impact on Representations

Domain Pre-training Impact on Representations

Domain Pre-training Impact on Representations

来源:Arxiv_logoArxiv
英文摘要

This empirical study analyzes the effects of the pre-training corpus on the quality of learned transformer representations. We focus on the representation quality induced solely through pre-training. Our experiments show that pre-training on a small, specialized corpus can yield effective representations, and that the success of combining a generic and a specialized corpus depends on the distributional similarity between the target task and the specialized corpus.

Cesar Gonzalez-Gutierrez、Ariadna Quattoni

计算技术、计算机技术

Cesar Gonzalez-Gutierrez,Ariadna Quattoni.Domain Pre-training Impact on Representations[EB/OL].(2025-05-30)[2025-07-19].https://arxiv.org/abs/2505.24455.点此复制

评论