[SRW] SWEET: Weakly Supervised Person Name Extraction for Fighting Human Trafficking

Javin Liu, Peter Yu, Vidya Sujaya, Pratheeksha Nair, Kellin Pelrine, Reihaneh Rabbany

Student Research Workshop Srw Paper

Session 7: Student Research Workshop (Poster)
Conference Room: Frontenac Ballroom and Queen's Quay
Conference Time: July 12, 11:00-12:30 (EDT) (America/Toronto)
Global Time: July 12, Session 7 (15:00-16:30 UTC)
TLDR: We propose a weak supervision pipeline SWEET for extracting person names, which combines simple antirules with multiple large language models fine-tuned on public NER benchmark datasets as well as a domain-specific dataset labelled with ChatGPT. The proposed training process addresses the major chal...
You can open the #paper-S122 channel in a separate window.
Abstract: We propose a weak supervision pipeline SWEET for extracting person names, which combines simple antirules with multiple large language models fine-tuned on public NER benchmark datasets as well as a domain-specific dataset labelled with ChatGPT. The proposed training process addresses the major challenge of lack of training data in this domain, by effectively aggregating multiple weak signals. The proposed method SWEET outperforms the previous supervised state-of-the-art method for this task by 10% F1 score and better generalizes to the benchmark datasets.