Published in

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, (XLVI-4/W5-2021), p. 245-249, 2021

DOI: 10.5194/isprs-archives-xlvi-4-w5-2021-245-2021

Links

Tools

Export citation

Search in Google Scholar

Extracting Topics From a Tv Channel's Facebook Page Using Contextualized Document Embedding

Journal article published in 2021 by N. Habbat, H. Anoun, L. Hassouni
This paper was not found in any repository; the policy of its publisher is unknown or unclear.
This paper was not found in any repository; the policy of its publisher is unknown or unclear.

Full text: Unavailable

Question mark in circle
Preprint: policy unknown
Question mark in circle
Postprint: policy unknown
Question mark in circle
Published version: policy unknown

Abstract

Topic models extract meaningful words from text collection, allowing for a better understanding of data. However, the results are often not coherent enough, and thus harder to interpret. Adding more contextual knowledge to the model can enhance coherence. In recent years, neural network-based topic models become available, and the development level of the neural model has developed thanks to BERT-based representation. In this study, we suggest a model extract news on the Aljazeera Facebook page. Our approach combines the neural model (ProdLDA) and the Arabic Pre-training BERT transformer model (AraBERT). Therefore, the proposed model produces more expressive and consistent topics than ELMO using different topic model algorithms (ProdLDA and LDA) with 0.883 in topic coherence.

Beta version