NewsMet : A 'do it all' Dataset of Contemporary Metaphors in News Headlines

Rohan Joseph, Timothy Liu, Aik Beng Ng, Simon See, Sunny Rai

Findings: Resources and Evaluation Findings Paper

Session 7: Resources and Evaluation (Virtual Poster)
Conference Room: Pier 7&8
Conference Time: July 12, 11:00-12:30 (EDT) (America/Toronto)
Global Time: July 12, Session 7 (15:00-16:30 UTC)
Keywords: nlp datasets, evaluation, metrics
TLDR: Metaphors are highly creative constructs of human language that grow old and eventually die. Popular datasets used for metaphor processing tasks were constructed from dated source texts. In this paper, we propose NewsMet, a large high-quality contemporary dataset of news headlines hand-annotated wit...
You can open the #paper-P3107 channel in a separate window.
Abstract: Metaphors are highly creative constructs of human language that grow old and eventually die. Popular datasets used for metaphor processing tasks were constructed from dated source texts. In this paper, we propose NewsMet, a large high-quality contemporary dataset of news headlines hand-annotated with metaphorical verbs. The dataset comprises headlines from various sources including political, satirical, reliable and fake. Our dataset serves the purpose of evaluation for the tasks of metaphor interpretation and generation. The experiments reveal several insights and limitations of using LLMs to automate metaphor processing tasks as frequently seen in the recent literature. The dataset is publicly available for research purposes{{https://github.com/AxleBlaze3/NewsMet\_Metaphor\_Dataset}}.