A Multilingual Evaluation of NER Robustness to Adversarial Inputs

Akshay Srinivasan; Sowmya Vajjala

A Multilingual Evaluation of NER Robustness to Adversarial Inputs

Akshay Srinivasan, Sowmya Vajjala

Add to Favorites

The 8th Workshop on Representation Learning for NLP (RepL4NLP 2023) N/a Paper

TLDR: Adversarial evaluations of language models typically focus on English alone. In this paper, we performed a multilingual evaluation of Named Entity Recognition (NER) in terms of its robustness to small perturbations in the input. Our results showed the NER models we explored across three languages (E

RocketChat
Abstract

You can open the #paper-ACL_6 channel in a separate window.

Abstract: Adversarial evaluations of language models typically focus on English alone. In this paper, we performed a multilingual evaluation of Named Entity Recognition (NER) in terms of its robustness to small perturbations in the input. Our results showed the NER models we explored across three languages (English, German and Hindi) are not very robust to such changes, as indicated by the fluctuations in the overall F1 score as well as in a more fine-grained evaluation. With that knowledge, we further explored whether it is possible to improve the existing NER models using a part of the generated adversarial data sets as augmented training data to train a new NER model or as fine-tuning data to adapt an existing NER model. Our results showed that both these approaches improve performance on the original as well as adversarial test sets. While there is no significant difference between the two approaches for English, re-training is significantly better than fine-tuning for German and Hindi.