ROZAM at SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis

Mohammadmostafa Rostamkhani, Ghazal Zamaninejad, Sauleh Eetemadi

The 17th International Workshop on Semantic Evaluation (SemEval-2023) Task 9: multilingual tweet intimacy analysis Paper

TLDR: We build a model using large multilingual pretrained language model XLM-T for regression task and fine-tune it on the MINT (Multilingual INTmacy) analysis dataset which covers 6 languages for training and 4 languages for testing zero-shot performance of the model. The dataset was annotated and the a
You can open the #paper-SemEval_305 channel in a separate window.
Abstract: We build a model using large multilingual pretrained language model XLM-T for regression task and fine-tune it on the MINT (Multilingual INTmacy) analysis dataset which covers 6 languages for training and 4 languages for testing zero-shot performance of the model. The dataset was annotated and the annotations are intimacy scores. We experiment with several deep learning architectures to predict intimacy score. To achieve optimal performance we modify several model settings including loss function, number and type of layers. In total, we ran 16 end-to-end experiments. Our best system achieved a Pearson Correlation score of 0.52.