MLlab4CS at SemEval-2023 Task 2: Named Entity Recognition in Low-resource Language Bangla Using Multilingual Language Models

Shrimon Mukherjee, Madhusudan Ghosh, Girish ., Partha Basuchowdhuri

The 17th International Workshop on Semantic Evaluation (SemEval-2023) Task 2: multiconer ii multilingual complex named entity recognition Paper

TLDR: Extracting of NERs from low-resource languages and recognizing their types is one of the important tasks in the entity extraction domain. Recently many studies have been conducted in this area of research. In our study, we introduce a system for identifying complex entities and recognizing their typ
You can open the #paper-SemEval_210 channel in a separate window.
Abstract: Extracting of NERs from low-resource languages and recognizing their types is one of the important tasks in the entity extraction domain. Recently many studies have been conducted in this area of research. In our study, we introduce a system for identifying complex entities and recognizing their types from low-resource language Bangla, which was published in SemEval Task 2 MulitCoNER II 2023. For this sequence labeling task, we use a pre-trained language model built on a natural language processing framework. Our team name in this competition is \textbackslash{}textbf\{MLlab4CS\}. Our model \textbackslash{}textbf\{\textbackslash{}Muril\} produces a macro average F-score of 76.27\textbackslash{}\%, which is a comparable result for this competition.