Bozorgmehr, Arezoo: From Limited Data to Meaningful Insights : Two studies on chronic stress prediction using machine learning. - Bonn, 2024. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn.
Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5-74487
@phdthesis{handle:20.500.11811/11291,
urn: https://nbn-resolving.org/urn:nbn:de:hbz:5-74487,
doi: https://doi.org/10.48565/bonndoc-214,
author = {{Arezoo Bozorgmehr}},
title = {From Limited Data to Meaningful Insights : Two studies on chronic stress prediction using machine learning},
school = {Rheinische Friedrich-Wilhelms-Universität Bonn},
year = 2024,
month = feb,

note = {Background: Chronic stress is widespread and adversely affects mental and physical health. The two studies in this dissertation used machine learning to predict chronic stress and identify its risk as well as protective factors. The first study examined workplace factors in German general practice assistants, while the second developed an interpretable model using a national dataset from the Robert Koch Institute to identify protective factors for improved well-being.
Methods: The first study analyzed 550 general practice assistants comparing 4 machine learning classifiers (random forest, support vector machine, K-nearest neighbors, and artificial neural network) and logistic regression to predict chronic stress. The model performance was evaluated using metrics such as the area under the curve (AUC), sensitivity, and positive predictive value. The second study investigated chronic stress in the German population using data from 5,801 in the representative DEGS1 study. Multiclass classification with extreme gradient boosting (XGBoost) and Shapley additive explanations (SHAP) was employed to predict stress levels and especially identify protective factors. The evaluation metrics included the area under the curve (AUC), precision, recall, and F1-score, which were averaged using the macro average.
Results: In the first study, machine learning models outperformed logistic regression in predicting chronic stress among practice assistants. The random forest model achieved the highest AUC of 84 %. Work characteristics were identified as relevant factors. In the second study, the XGBoost model obtained an AUC of 81 %, a precision of 63 %, a recall of 52 %, a specificity of 78 %, and a F1-score of 54 %. The study identified several protective factors against chronic stress like satisfaction with living space, strong social support, being male, and age = 42 years.
Conclusion: Machine learning classifiers, specifically ensemble methods, demonstrated superior performance compared to logistic regression and other ML approaches in predicting chronic stress even with small data sets. Aiming at reliable feature importance detection, the SHAP technique identified significant protective factors that should be taken into account when designing interventions to alleviate chronic stress.},

url = {https://hdl.handle.net/20.500.11811/11291}
}

The following license files are associated with this item:

InCopyright