[Industry] Weighted Contrastive Learning With False Negative Control to Help Long-tailed Product Classification

Tianqi Wang; Lei Chen; Xiaodan Zhu; Younghun Lee; Jing Gao

[Industry] Weighted Contrastive Learning With False Negative Control to Help Long-tailed Product Classification

Tianqi Wang, Lei Chen, Xiaodan Zhu, Younghun Lee, Jing Gao

📝 Paper

Anthology

Underline 📺 Watch Video on Underline Add to Favorites

Industry: Industry Industry Paper

Session 5: Industry (Poster)

Conference Room: Frontenac Ballroom and Queen's Quay

Conference Time: July 11, 16:15-17:45 (EDT) (America/Toronto)

Global Time: July 11, Session 5 (20:15-21:45 UTC)

TLDR: Item categorization (IC) aims to classify product descriptions into leaf nodes in a categorical taxonomy, which is a key technology used in a wide range of applications. Along with the fact that most datasets often has a long-tailed distribution, classification performances on tail labels tend to be...

You can open the #paper-I146 channel in a separate window.

Abstract: Item categorization (IC) aims to classify product descriptions into leaf nodes in a categorical taxonomy, which is a key technology used in a wide range of applications. Along with the fact that most datasets often has a long-tailed distribution, classification performances on tail labels tend to be poor due to scarce supervision, causing many issues in real-life applications. To address IC task's long-tail issue, K-positive contrastive loss (KCL) is proposed on image classification task and can be applied on the IC task when using text-based contrastive learning, e.g., SimCSE. However, one shortcoming of using KCL has been neglected in previous research: false negative (FN) instances may harm the KCL's representation learning. To address the FN issue in the KCL, we proposed to re-weight the positive pairs in the KCL loss with a regularization that the sum of weights should be constrained to K+1 as close as possible. After controlling FN instances with the proposed method, IC performance has been further improved and is superior to other LT-addressing methods.