|国家预印本平台
首页|NeoN: A Tool for Automated Detection, Linguistic and LLM-Driven Analysis of Neologisms in Polish

NeoN: A Tool for Automated Detection, Linguistic and LLM-Driven Analysis of Neologisms in Polish

NeoN: A Tool for Automated Detection, Linguistic and LLM-Driven Analysis of Neologisms in Polish

来源:Arxiv_logoArxiv
英文摘要

NeoN, a tool for detecting and analyzing Polish neologisms. Unlike traditional dictionary-based methods requiring extensive manual review, NeoN combines reference corpora, Polish-specific linguistic filters, an LLM-driven precision-boosting filter, and daily RSS monitoring in a multi-layered pipeline. The system uses context-aware lemmatization, frequency analysis, and orthographic normalization to extract candidate neologisms while consolidating inflectional variants. Researchers can verify candidates through an intuitive interface with visualizations and filtering controls. An integrated LLM module automatically generates definitions and categorizes neologisms by domain and sentiment. Evaluations show NeoN maintains high accuracy while significantly reducing manual effort, providing an accessible solution for tracking lexical innovation in Polish.

Aleksandra Tomaszewska、Dariusz Czerski、Bartosz ?uk、Maciej Ogrodniczuk

印欧语系

Aleksandra Tomaszewska,Dariusz Czerski,Bartosz ?uk,Maciej Ogrodniczuk.NeoN: A Tool for Automated Detection, Linguistic and LLM-Driven Analysis of Neologisms in Polish[EB/OL].(2025-05-21)[2025-06-14].https://arxiv.org/abs/2505.15426.点此复制

评论