MasakhaNER: Named Entity Recognition for African Languages
MasakhaNER: Named Entity Recognition for African Languages
We take a step towards addressing the under-representation of the African continent in NLP research by creating the first large publicly available high-quality dataset for named entity recognition (NER) in ten African languages, bringing together a variety of stakeholders. We detail characteristics of the languages to help researchers understand the challenges that these languages pose for NER. We analyze our datasets and conduct an extensive empirical evaluation of state-of-the-art methods across both supervised and transfer learning settings. We release the data, code, and models in order to inspire future research on African NLP.
Blessing Sibanda、Graham Neubig、Salomey Osei、Tobius Saul Bateesa、Iroro Orife、Degaga Wolde、Jesujoba Alabi、Temilola Oloyede、Abdoulaye Diallo、Rubungo Andre Niyongabo、Dibora Gebreyohannes、Samba Ngom、Emmanuel Anebi、Deborah Nabagereka、Ignatius Ezeani、Gerald Muriuki、Yvonne Wambui、Perez Ogayo、Sebastian Ruder、Henok Tilaye、Kelechi Nwaike、Davis David、Tosin Adewumi、Shruti Rijhwani、Chris Chinenye Emezue、Orevaoghene Ahia、Chiamaka Chukwuneke、Adewale Akinfaderin、David Ifeoluwa Adelani、Stephen Mayhew、Thierno Ibrahima DIOP、Ayodele Awokoya、Catherine Gitau、Tendai Marengereke、Abdoulaye Faye、Paul Rayson、Shamsuddeen Muhammad、Happy Buzaaba、Bonaventure F. P. Dossou、Derguene Mbaye、Verrah Otiende、Mouhamadane MBOUP、Nkiruka Odu、Samuel Oyerinde、Israel Abebe Azime、Jade Abbott、Kelechi Ogueji、Mofetoluwa Adeyemi、Constantine Lignos、Eric Peter Wairagala、Clemencia Siro、Anuoluwapo Aremu、Victor Akinode、Tajuddeen Gwadabe、Jonathan Mukiibi、Julia Kreutzer、Joyce Nakatumba-Nabende、Seid Muhie Yimam、Daniel D'souza、Maurice Katusiime、Chester Palen-Michel
非洲诸语言语言学闪-含语系(阿非罗-亚细亚语系)
Blessing Sibanda,Graham Neubig,Salomey Osei,Tobius Saul Bateesa,Iroro Orife,Degaga Wolde,Jesujoba Alabi,Temilola Oloyede,Abdoulaye Diallo,Rubungo Andre Niyongabo,Dibora Gebreyohannes,Samba Ngom,Emmanuel Anebi,Deborah Nabagereka,Ignatius Ezeani,Gerald Muriuki,Yvonne Wambui,Perez Ogayo,Sebastian Ruder,Henok Tilaye,Kelechi Nwaike,Davis David,Tosin Adewumi,Shruti Rijhwani,Chris Chinenye Emezue,Orevaoghene Ahia,Chiamaka Chukwuneke,Adewale Akinfaderin,David Ifeoluwa Adelani,Stephen Mayhew,Thierno Ibrahima DIOP,Ayodele Awokoya,Catherine Gitau,Tendai Marengereke,Abdoulaye Faye,Paul Rayson,Shamsuddeen Muhammad,Happy Buzaaba,Bonaventure F. P. Dossou,Derguene Mbaye,Verrah Otiende,Mouhamadane MBOUP,Nkiruka Odu,Samuel Oyerinde,Israel Abebe Azime,Jade Abbott,Kelechi Ogueji,Mofetoluwa Adeyemi,Constantine Lignos,Eric Peter Wairagala,Clemencia Siro,Anuoluwapo Aremu,Victor Akinode,Tajuddeen Gwadabe,Jonathan Mukiibi,Julia Kreutzer,Joyce Nakatumba-Nabende,Seid Muhie Yimam,Daniel D'souza,Maurice Katusiime,Chester Palen-Michel.MasakhaNER: Named Entity Recognition for African Languages[EB/OL].(2021-03-22)[2025-05-18].https://arxiv.org/abs/2103.11811.点此复制
评论