Understanding Demonstration-based Learning from a Causal Perspective

Ruiyi Zhang; Tong Yu

Understanding Demonstration-based Learning from a Causal Perspective

Ruiyi Zhang, Tong Yu

📝 Paper

Anthology

Underline 📺 Watch Video on Underline Add to Favorites

Main: Information Extraction Main-poster Paper

Session 1: Information Extraction (Virtual Poster)

Conference Room: Pier 7&8

Conference Time: July 10, 11:00-12:30 (EDT) (America/Toronto)

Global Time: July 10, Session 1 (15:00-16:30 UTC)

Keywords: named entity recognition and relation extraction

TLDR: Demonstration-based learning has shown impressive performance in exploiting pretrained language models under few-shot learning settings. It is interesting to see that demonstrations, even those composed of random tokens, can still improve performance. In this paper, we build a Structural Causal Mode...

You can open the #paper-P4490 channel in a separate window.

Abstract: Demonstration-based learning has shown impressive performance in exploiting pretrained language models under few-shot learning settings. It is interesting to see that demonstrations, even those composed of random tokens, can still improve performance. In this paper, we build a Structural Causal Model (SCM) to understand demonstration-based learning from causal perspectives and interpret random demonstrations as interventions on the demonstration variable within the causal model. We investigate the causal effects and find that the concurrence of specific words in the demonstration will induce bias, while randomly sampled tokens in the demonstration do not. Based on this finding, we further propose simple ways to construct random demonstrations, which even outperform hand-crafted, meaningful demonstrations on public sequence labeling benchmarks.