Identifying self-disclosed anxiety on Twitter: A natural language processing approach

Zarate, Daniel; Ball, Michelle; Prokofieva, Maria; Kostakos, Vassilis; Stavropoulos, Vasileios

Identifying self-disclosed anxiety on Twitter: A natural language processing approach

Download

[thumbnail of 1-s2.0-S0165178123005292-main.pdf]

Preview

1-s2.0-S0165178123005292-main.pdf - Published Version (1MB) | Preview

Available under license: Creative Commons Attribution

Export

Zarate, Daniel ORCID: https://orcid.org/0000-0002-1508-8637, Ball, Michelle ORCID: https://orcid.org/0000-0002-4056-6178, Prokofieva, Maria ORCID: https://orcid.org/0000-0003-1974-3827, Kostakos, Vassilis and Stavropoulos, Vasileios ORCID: https://orcid.org/0000-0001-6964-4662 (2023) Identifying self-disclosed anxiety on Twitter: A natural language processing approach. Psychiatry Research, 330. ISSN 0165-1781

Abstract

Background: Text analyses of social media posts are a promising source of mental health information. This study used natural language processing to explore distinct language patterns on Twitter related to self-reported anxiety diagnosis. Methods: A total of 233.000 tweets made by 605 users (300 reporting anxiety diagnosis and 305 not) over six months were comparatively analysed, considering user behavior, Linguistic Inquiry Word Count (LIWC), and sentiment analysis. Twitter users with a self-disclosed diagnosis of anxiety were classified as ‘anxious’ to facilitate group comparisons. Results: Supervised machine learning models showed a high prediction accuracy (Naïve Bayes 81.1 %, Random Forests 79.8 %, and LASSO-regression 79.4 %) in identifying Twitter users’ self-disclosed diagnosis of anxiety. Additionally, a Latent Profile Analysis (LPA) identified four profiles characterized by high sentiment (31 % anxious participants), low sentiment (68 % anxious), self-immersed (80 % anxious), and normative behavior (38 % anxious). Conclusion: The digital footprint of self-disclosed anxiety on Twitter posts presented a high frequency of words conveying either negative sentiment, a low frequency of positive sentiment, a reduced frequency of posting, and lengthier texts. These distinct patterns enabled highly accurate prediction of anxiety diagnosis. On this basis, appropriately resourced, awareness raising, online mental health campaigns are advocated.

Dimensions Badge

Altmetric Badge

View Altmetric information about this item.

Item type	Article
URI	https://vuir.vu.edu.au/id/eprint/47474
DOI	10.1016/j.psychres.2023.115579
Official URL	https://www.sciencedirect.com/science/article/pii/...
Subjects	Current > FOR (2020) Classification > 4611 Machine learning Current > FOR (2020) Classification > 5203 Clinical and health psychology Current > Division/Research > Institute for Health and Sport
Keywords	anxiety; cyber-phenotype; digital footprint; natural language processing; sentiment analysis; Twitter
Download/View statistics	View download statistics for this item

Search Google Scholar

Repository staff login

CORE (COnnecting REpositories)

VU Institutional Repository

Find repository resources

Identifying self-disclosed anxiety on Twitter: A natural language processing approach

Abstract

Dimensions Badge

Altmetric Badge

Useful links

Indexed by

Acknowledgement of Country

VU Institutional Repository

Find repository resources

Search the VU Institutional Repository

Identifying self-disclosed anxiety on Twitter: A natural language processing approach

Abstract

Dimensions Badge

Altmetric Badge