|国家预印本平台
首页|Swa-bhasha Resource Hub: Romanized Sinhala to Sinhala Transliteration Systems and Data Resources

Swa-bhasha Resource Hub: Romanized Sinhala to Sinhala Transliteration Systems and Data Resources

Swa-bhasha Resource Hub: Romanized Sinhala to Sinhala Transliteration Systems and Data Resources

来源:Arxiv_logoArxiv
英文摘要

The Swa-bhasha Resource Hub provides a comprehensive collection of data resources and algorithms developed for Romanized Sinhala to Sinhala transliteration between 2020 and 2025. These resources have played a significant role in advancing research in Sinhala Natural Language Processing (NLP), particularly in training transliteration models and developing applications involving Romanized Sinhala. The current openly accessible data sets and corresponding tools are made publicly available through this hub. This paper presents a detailed overview of the resources contributed by the authors and includes a comparative analysis of existing transliteration applications in the domain.

Deshan Sumanathilaka、Sameera Perera、Sachithya Dharmasiri、Maneesha Athukorala、Anuja Dilrukshi Herath、Rukshan Dias、Pasindu Gamage、Ruvan Weerasinghe、Y. H. P. P. Priyadarshana

南亚语系(澳斯特罗-亚细亚语系)计算技术、计算机技术

Deshan Sumanathilaka,Sameera Perera,Sachithya Dharmasiri,Maneesha Athukorala,Anuja Dilrukshi Herath,Rukshan Dias,Pasindu Gamage,Ruvan Weerasinghe,Y. H. P. P. Priyadarshana.Swa-bhasha Resource Hub: Romanized Sinhala to Sinhala Transliteration Systems and Data Resources[EB/OL].(2025-07-12)[2025-08-02].https://arxiv.org/abs/2507.09245.点此复制

评论