Morphological Inflection: A Reality Check

Jordan Kodner, Sarah Payne, Salam Khalifa, Zoey Liu

Main: Theme: Reality Check Main-poster Paper

Poster Session 6: Theme: Reality Check (Poster)
Conference Room: Frontenac Ballroom and Queen's Quay
Conference Time: July 12, 09:00-10:30 (EDT) (America/Toronto)
Global Time: July 12, Poster Session 6 (13:00-14:30 UTC)
Keywords: (non-)generalizability, evaluation, methodology
Languages: arabic, german, spanish, swahili, turkish
TLDR: Morphological inflection is a popular task in sub-word NLP with both practical and cognitive applications. For years now, state-of-the-art systems have reported high, but also highly variable, performance across data sets and languages. We investigate the causes of this high performance and high var...
You can open the #paper-P3693 channel in a separate window.
Abstract: Morphological inflection is a popular task in sub-word NLP with both practical and cognitive applications. For years now, state-of-the-art systems have reported high, but also highly variable, performance across data sets and languages. We investigate the causes of this high performance and high variability; we find several aspects of data set creation and evaluation which systematically inflate performance and obfuscate differences between languages. To improve generalizability and reliability of results, we propose new data sampling and evaluation strategies that better reflect likely use-cases. Using these new strategies, we make new observations on the generalization abilities of current inflection systems.