In this paper, we evaluate the Lbl2Vec approach for unsupervised text document classification. Lbl2Vec requires only a small number of keywords describing the respective classes to create semantic label representations. For classification, Lbl2Vec uses cosine similarities between label and document representations, but no annotation information. We show that Lbl2Vec significantly outperforms common unsupervised text classification approaches and a widely used zero-shot text classification approach. Furthermore, we show that using more precise keywords can significantly improve the classification results of similarity-based text classification approaches.
Conference (0)
Name | Type | Size | Last Modification | Last Editor |
---|---|---|---|---|
Semantic Label Representations with Lbl2Vec.pdf | 1,57 MB | 18.01.2023 |