Popis: |
The aim of this study is to analyse the determinants of women's vaginal dryness using machine learning. Data came from Korea University Anam Hospital in Seoul, Republic of Korea, with 3298 women, aged 40-80 years, who attended their general health check from January 2010 to December 2012. Five machine learning methods were applied and compared for the prediction of vaginal dryness, measured by a Menopause Rating Scale. Random forest variable importance, a performance gap between a complete model and a model excluding a certain variable, was adopted for identifying major determinants of vaginal dryness. In terms of the mean squared error, the random forest (1.0597) was much better than linear regression (17.9043) and artificial neural networks with one, two and three hidden layers (1.7452, 1.7148 and 1.7736, respectively). Based on random forest variable importance, the top-10 determinants of vaginal dryness were menopause age, age, menopause, height, thyroid stimulating hormone, neutrophils, years since menopause, lymphocytes, alkaline phosphatase and blood urea nitrogen. In addition, its top-20 determinants were peak expiratory flow rate, low-density lipoprotein cholesterol, white blood cells, monocytes, cancer antigen 19-9, creatinine, eosinophils, total cholesterol, triglyceride and amylase. Machine learning presents a great decision support system for the prediction of vaginal dryness. For preventing vaginal dryness, preventive measures would be needed regarding early menopause, the thyroid function and systematic inflammation.Impact Statement |