[Demo] KWJA: A Unified Japanese Analyzer Based on Foundation Models

Nobuhiro Ueda; Kazumasa Omura; Takashi Kodama; Hirokazu Kiyomaru; Yugo Murawaki; Daisuke Kawahara; Sadao Kurohashi

[Demo] KWJA: A Unified Japanese Analyzer Based on Foundation Models

Nobuhiro Ueda, Kazumasa Omura, Takashi Kodama, Hirokazu Kiyomaru, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

📝 Paper

Anthology

Underline 🪧 Poster 📺 Watch Video on Underline Add to Favorites

Demo: Multilingualism and Cross-Lingual NLP (demo) Demo Paper

Demo Session 2: Multilingualism and Cross-Lingual NLP (demo) (Poster)

Conference Room: Frontenac Ballroom and Queen's Quay

Conference Time: July 10, 14:00-15:30 (EDT) (America/Toronto)

Global Time: July 10, Demo Session 2 (18:00-19:30 UTC)

TLDR: We present KWJA, a high-performance unified Japanese text analyzer based on foundation models. KWJA supports a wide range of tasks, including typo correction, word segmentation, word normalization, morphological analysis, named entity recognition, linguistic feature tagging, dependency parsing, PAS ...

You can open the #paper-D133 channel in a separate window.

Abstract: We present KWJA, a high-performance unified Japanese text analyzer based on foundation models. KWJA supports a wide range of tasks, including typo correction, word segmentation, word normalization, morphological analysis, named entity recognition, linguistic feature tagging, dependency parsing, PAS analysis, bridging reference resolution, coreference resolution, and discourse relation analysis, making it the most versatile among existing Japanese text analyzers. KWJA solves these tasks in a multi-task manner but still achieves competitive or better performance compared to existing analyzers specialized for each task. KWJA is publicly available under the MIT license at https://github.com/ku-nlp/kwja.