BREAK: Breaking the Dialogue State Tracking Barrier with Beam Search and Re-ranking

Seungpil Won, Heeyoung Kwak, Joongbo Shin, Janghoon Han, Kyomin Jung

Main: Dialogue and Interactive Systems Main-poster Paper

Poster Session 4: Dialogue and Interactive Systems (Poster)
Conference Room: Frontenac Ballroom and Queen's Quay
Conference Time: July 11, 11:00-12:30 (EDT) (America/Toronto)
Global Time: July 11, Poster Session 4 (15:00-16:30 UTC)
Keywords: dialogue state tracking
TLDR: Despite the recent advances in dialogue state tracking (DST), the joint goal accuracy (JGA) of the existing methods on MultiWOZ 2.1 still remains merely 60\%. In our preliminary error analysis, we find that beam search produces a pool of candidates that is likely to include the correct dialogue stat...
You can open the #paper-P5293 channel in a separate window.
Abstract: Despite the recent advances in dialogue state tracking (DST), the joint goal accuracy (JGA) of the existing methods on MultiWOZ 2.1 still remains merely 60\%. In our preliminary error analysis, we find that beam search produces a pool of candidates that is likely to include the correct dialogue state. Motivated by this observation, we introduce a novel framework, called BREAK (Beam search and RE-rAnKing), that achieves outstanding performance on DST. BREAK performs DST in two stages: (i) generating k-best dialogue state candidates with beam search and (ii) re-ranking the candidates to select the correct dialogue state. This simple yet powerful framework shows state-of-the-art performance on all versions of MultiWOZ and M2M datasets. Most notably, we push the joint goal accuracy to 80-90\% on MultiWOZ 2.1-2.4, which is an improvement of 23.6\%, 26.3\%, 21.7\%, and 10.8\% over the previous best-performing models, respectively. The data and code will be available at https://github.com/tony-won/DST-BREAK