ACL2023: Sequence Reducible Holdout Loss for Language Model Pretraining

Sequence Reducible Holdout Loss for Language Model Pretraining

Raghuveer Thirukovalluru, Bhuwan Dhingra, Sam Wiseman

Add to Favorites

The Fourth Workshop on Simple and Efficient Natural Language Processing Long Paper

TLDR:

RocketChat
Abstract

You can open the #paper-SustaiNLP_36 channel in a separate window.

Abstract: