Sequence Reducible Holdout Loss for Language Model Pretraining

Raghuveer Thirukovalluru, Bhuwan Dhingra, Sam Wiseman

The Fourth Workshop on Simple and Efficient Natural Language Processing Long Paper

TLDR:
You can open the #paper-SustaiNLP_36 channel in a separate window.
Abstract: