CLIP-based image captioning via unsupervised cycle-consistency in the latent space
Romain Bielawski, Rufin VanRullen
The 8th Workshop on Representation Learning for NLP (RepL4NLP 2023) Long Paper
TLDR:
You can open the
#paper-ACL_3566
channel in a separate window.
Abstract: