Causal Intervention for Mitigating Name Bias in Machine Reading Comprehension

Jiazheng Zhu; shaojuan wu; Xiaowang Zhang; Yuexian Hou; Zhiyong Feng

Causal Intervention for Mitigating Name Bias in Machine Reading Comprehension

Jiazheng Zhu, shaojuan wu, Xiaowang Zhang, Yuexian Hou, Zhiyong Feng

📝 Paper

Anthology

Underline 📺 Watch Video on Underline Add to Favorites

Findings: Ethics and NLP Findings Paper

Session 7: Ethics and NLP (Virtual Poster)

Conference Room: Pier 7&8

Conference Time: July 12, 11:00-12:30 (EDT) (America/Toronto)

Global Time: July 12, Session 7 (15:00-16:30 UTC)

Keywords: model bias/unfairness mitigation

TLDR: Machine Reading Comprehension (MRC) is to answer questions based on a given passage, which has made great achievements using pre-trained Language Models (LMs). We study the robustness of MRC models to names which is flexible and repeatability. MRC models based on LMs may overuse the name informatio...

You can open the #paper-P4844 channel in a separate window.

Abstract: Machine Reading Comprehension (MRC) is to answer questions based on a given passage, which has made great achievements using pre-trained Language Models (LMs). We study the robustness of MRC models to names which is flexible and repeatability. MRC models based on LMs may overuse the name information to make predictions, which causes the representation of names to be non-interchangeable, called name bias. In this paper, we propose a novel Causal Interventional paradigm for MRC (CI4MRC) to mitigate name bias. Specifically, we uncover that the pre-trained knowledge concerning names is indeed a confounder by analyzing the causalities among the pre-trained knowledge, context representation and answers based on a Structural Causal Model (SCM). We develop effective CI4MRC algorithmic implementations to constrain the confounder based on the neuron-wise and token-wise adjustments. Experiments demonstrate that our proposed CI4MRC effectively mitigates the name bias and achieves competitive performance on the original SQuAD. Moreover, our method is general to various pre-trained LMs and performs robustly on the adversarial datasets.