Back to top

Semantic Label Representations with Lbl2Vec - A Similarity-Based Approach for Unsupervised Text Classification

Last modified Jan 18, 2023

In this paper, we evaluate the Lbl2Vec approach for unsupervised text document classification. Lbl2Vec requires only a small number of keywords describing the respective classes to create semantic label representations. For classification, Lbl2Vec uses cosine similarities between label and document representations, but no annotation information. We show that Lbl2Vec significantly outperforms common unsupervised text classification approaches and a widely used zero-shot text classification approach. Furthermore, we show that using more precise keywords can significantly improve the classification results of similarity-based text classification approaches.

Files and Subpages

Name Type Size Last Modification Last Editor
Semantic Label Representations with Lbl2Vec.pdf 1,57 MB 18.01.2023