Sequence Reducible Holdout Loss for Language Model Pretraining
Raghuveer Thirukovalluru, Bhuwan Dhingra, Sam Wiseman
The Fourth Workshop on Simple and Efficient Natural Language Processing Long Paper
TLDR:
You can open the
#paper-SustaiNLP_36
channel in a separate window.
Abstract: