Text-Based Detection of the Risk of Depression

Authors	HAVIGEROVÁ Jana Marie HAVIGER Jiří KUČERA Dalibor HOFFMANNOVÁ Petra
Year of publication	2019
Type	Article in Periodical
Magazine / Source	Frontiers in Psychology
MU Faculty or unit	Faculty of Arts
Citation
web	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6431661/
Doi	http://dx.doi.org/10.3389/fpsyg.2019.00513
Keywords	depression; genre; morphology; quantitative linguistics; predictive model
Description	This study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents' linguistic fingerprints were traced in four texts of different genres. Depressivity was measured using the Depression, Anxiety and Stress Scale (DASS-21). The research sample (N = 172, 83 men, 89 women) was created by quota sampling an adult Czech population. Morphological variables of the texts showing differences (M-W test) between the non-depressive and depressive groups were incorporated into predictive models. Results: Across all participants, the data best fit predictive models of depressivity using morphological characteristics from the informal text "letter from holidays" (Nagelkerke r(2) = 0.526 for men and 0.670 for women). For men, models for the formal texts "cover letter" and "complaint" showed moderate fit with the data (r(2) = 0.479 and 0.435). The constructed models show weak to substantial recall (0.235 - 0.800) and moderate to substantial precision (0.571 - 0.889). Morphological variables appearing in the final models vary. There are no key morphological characteristics suitable for all models or for all genres. The resulting models' properties demonstrate that they should be suitable for screening individuals at risk of depression and the most suitable genre is informal text ("letter from holidays").