Controllable Text Generation via Probability Density Estimation in the Latent Space

Yuxuan Gu; Xiaocheng Feng; Sicheng Ma; Lingyuan Lingyuan Zhang; Heng Gong; Weihong Zhong; Bing Qin

Controllable Text Generation via Probability Density Estimation in the Latent Space

Yuxuan Gu, Xiaocheng Feng, Sicheng Ma, Lingyuan Lingyuan Zhang, Heng Gong, Weihong Zhong, Bing Qin

📝 Paper

Anthology

Underline 🪧 Poster 📺 Watch Video on Underline Add to Favorites

Main: Generation Main-poster Paper

Session 1: Generation (Virtual Poster)

Conference Room: Pier 7&8

Conference Time: July 10, 11:00-12:30 (EDT) (America/Toronto)

Global Time: July 10, Session 1 (15:00-16:30 UTC)

Keywords: text-to-text generation

TLDR: Previous work on controllable text generation has explored the idea of control from the latent space, such as optimizing a representation with attribute-specific classifiers or sampling one from relevant discrete samples. However, they cannot effectively model a complex space with diverse attributes...

You can open the #paper-P1149 channel in a separate window.

Abstract: Previous work on controllable text generation has explored the idea of control from the latent space, such as optimizing a representation with attribute-specific classifiers or sampling one from relevant discrete samples. However, they cannot effectively model a complex space with diverse attributes, high dimensionality, and asymmetric structure, leaving subsequent controls unsatisfying. In this work, we propose a novel control framework using probability density estimation in the latent space. Our method utilizes an invertible transformation function, the Normalizing Flow, that maps the complex distributions in the latent space to simple Gaussian distributions in the prior space. Thus, we can perform sophisticated and flexible controls in the prior space and feed the control effects back into the latent space owing to the bijection property of invertible transformations. Experiments on single-attribute and multi-attribute control reveal that our method outperforms several strong baselines on attribute relevance and text quality, achieving a new SOTA. Further analysis of control strength adjustment demonstrates the flexibility of our control strategy.