Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Tianduo Wang; Wei Lu

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Tianduo Wang, Wei Lu

📝 Paper

Anthology

Underline 🪧 Poster 📺 Watch Video on Underline Add to Favorites

Main: NLP Applications Main-poster Paper

Poster Session 7: NLP Applications (Poster)

Conference Room: Frontenac Ballroom and Queen's Quay

Conference Time: July 12, 11:00-12:30 (EDT) (America/Toronto)

Global Time: July 12, Poster Session 7 (15:00-16:30 UTC)

Keywords: mathematical nlp

TLDR: Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions in...

You can open the #paper-P3539 channel in a separate window.

Abstract: Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions into step-by-step reasoning chains, but such ability seems only to emerge from models with abundant parameters. This work investigates how to incorporate relatively small LMs with the capabilities of multi-step reasoning. We propose to inject such abilities by continually pre-training LMs on a synthetic dataset MsAT which is composed of Multi-step Arithmetic Tasks. Our experiments on four math word problem datasets show the effectiveness of the proposed method in enhancing LMs' math reasoning abilities.