[Industry] Improving Knowledge Production Efficiency With Question Answering on Conversation

Changlin Yang; Siye Liu; Sen Hu; Wangshu Zhang; Teng Xu; Jing Zheng

[Industry] Improving Knowledge Production Efficiency With Question Answering on Conversation

Changlin Yang, Siye Liu, Sen Hu, Wangshu Zhang, Teng Xu, Jing Zheng

📝 Paper

Anthology

Underline 📺 Watch Video on Underline Add to Favorites

Industry: Industry Industry Paper

Session 4: Industry (Virtual Poster)

Conference Room: Pier 7&8

Conference Time: July 11, 11:00-12:30 (EDT) (America/Toronto)

Global Time: July 11, Session 4 (15:00-16:30 UTC)

TLDR: Through an online customer service application, we have collected many conversations between customer service agents and customers. Building a knowledge production system can help reduce the labor cost of maintaining the FAQ database for the customer service chatbot, whose core module is question an...

You can open the #paper-I60 channel in a separate window.

Abstract: Through an online customer service application, we have collected many conversations between customer service agents and customers. Building a knowledge production system can help reduce the labor cost of maintaining the FAQ database for the customer service chatbot, whose core module is question answering (QA) on these conversations. However, most existing researches focus on document-based QA tasks, and there is a lack of researches on conversation-based QA and related datasets, especially in Chinese language. The challenges of conversation-based QA include: 1) answers may be scattered among multiple dialogue turns; 2) understanding complex dialogue contexts is more complicated than documents. To address these challenges, we propose a multi-span extraction model on this task and introduce continual pre-training and multi-task learning schemes to further improve model performance. To validate our approach, we construct two Chinese datasets using dialogues as the knowledge source, namely cs-qaconv and kd-qaconv, respectively. Experimental results demonstrate that the proposed model outperforms the baseline on both datasets. The online application also verifies the effectiveness of our method. The dataset kd-qaconv will be released publicly for research purposes.