Extracting Topics From a Tv Channel's Facebook Page Using Contextualized Document Embedding

Habbat, N.; Anoun, H.; Hassouni, L.

Published in

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, (XLVI-4/W5-2021), p. 245-249, 2021

DOI: 10.5194/isprs-archives-xlvi-4-w5-2021-245-2021

Tools

Export citation

Search in Google Scholar

Extracting Topics From a Tv Channel's Facebook Page Using Contextualized Document Embedding

Journal article published in 2021 by N. Habbat, H. Anoun, L. Hassouni

This paper was not found in any repository; the policy of its publisher is unknown or unclear.

Full text: Unavailable

Preprint: policy unknown

Upload

Postprint: policy unknown

Upload

Published version: policy unknown

Upload

Abstract

Topic models extract meaningful words from text collection, allowing for a better understanding of data. However, the results are often not coherent enough, and thus harder to interpret. Adding more contextual knowledge to the model can enhance coherence. In recent years, neural network-based topic models become available, and the development level of the neural model has developed thanks to BERT-based representation. In this study, we suggest a model extract news on the Aljazeera Facebook page. Our approach combines the neural model (ProdLDA) and the Arabic Pre-training BERT transformer model (AraBERT). Therefore, the proposed model produces more expressive and consistent topics than ELMO using different topic model algorithms (ProdLDA and LDA) with 0.883 in topic coherence.

Published in

Links

Tools

Extracting Topics From a Tv Channel's Facebook Page Using Contextualized Document Embedding

Abstract