Topological data mapping of online hate speech, misinformation, and general mental health: A large language model based study.

Journal: PLOS digital health

Published Date: Jul 29, 2025

Abstract

The advent of social media has led to an increased concern over its potential to propagate hate speech and misinformation, which, in addition to contributing to prejudice and discrimination, has been suspected of playing a role in increasing social violence and crimes in the United States. While literature has shown the existence of an association between posting hate speech and misinformation online and certain personality traits of posters, the general relationship and relevance of online hate speech/misinformation in the context of overall psychological wellbeing of posters remain elusive. One difficulty lies in finding data analytics tools capable of adequately analyzing the massive amount of social media posts to uncover the underlying hidden links. Machine learning and large language models such as ChatGPT make such an analysis possible. In this study, we collected thousands of posts from carefully selected communities on the social media site Reddit. We then utilized OpenAI's GPT3 to derive embeddings of these posts, which are high-dimensional real-numbered vectors that presumably represent the hidden semantics of posts. We then performed various machine-learning classifications based on these embeddings in order to identify potential similarities between hate speech/misinformation speech patterns and those of various communities. Finally, a topological data analysis (TDA) was applied to the embeddings to obtain a visual map connecting online hate speech, misinformation, various psychiatric disorders, and general mental health.

Authors

Andrew William Alexander

College of Medicine, Texas A&M University, Houston, Texas, United States of America.
Hongbin Wang

College of Computer and Cyber Security, Hebei Normal University, Shijiazhuang, China.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40729350)

Topological data mapping of online hate speech, misinformation, and general mental health: A large language model based study.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Topological data mapping of online hate speech, misinformation, and general mental health: A large language model based study.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals