|国家预印本平台
首页|TeleScope: A Longitudinal Dataset for Investigating Online Discourse and Information Interaction on Telegram

TeleScope: A Longitudinal Dataset for Investigating Online Discourse and Information Interaction on Telegram

TeleScope: A Longitudinal Dataset for Investigating Online Discourse and Information Interaction on Telegram

来源:Arxiv_logoArxiv
英文摘要

Telegram is a globally popular instant messaging platform known for its strong emphasis on security, privacy, and unique social networking features. It has recently emerged as the host for various cross-domain analysis and research works, such as social media influence, propaganda studies, and extremism. This paper introduces TeleScope, an extensive dataset suite that, to our knowledge, is the largest of its kind. It comprises metadata for about 500K Telegram channels and downloaded message metadata for about 71K public channels, accounting for around 120M crawled messages. We also release channel connections and user interaction data built using Telegram's message-forwarding feature to study multiple use cases, such as information spread and message forwarding patterns. In addition, we provide data enrichments, such as language detection, active message posting periods for each channel, and Telegram entities extracted from messages, that enable online discourse analysis beyond what is possible with the original data alone. The dataset is designed for diverse applications, independent of specific research objectives, and sufficiently versatile to facilitate the replication of social media studies comparable to those conducted on platforms like X (formerly Twitter)

Susmita Gangopadhyay、Danilo Dessi、Dimitar Dimitrov、Stefan Dietze

通信无线通信

Susmita Gangopadhyay,Danilo Dessi,Dimitar Dimitrov,Stefan Dietze.TeleScope: A Longitudinal Dataset for Investigating Online Discourse and Information Interaction on Telegram[EB/OL].(2025-04-28)[2025-06-22].https://arxiv.org/abs/2504.19536.点此复制

评论