Methods: The study uses data collected from the Homelessness Screening Clinical Reminder (HSCR), a 2-question screener to identify homelessness and imminent risk of homelessness among all Veterans accessing Veterans’ Health Administration outpatient services. Among a sample of 104,312 individuals who screened positive for homelessness risk in 2012, we use prior homelessness and housing status, medical and behavioral health records, VA benefits eligibility, and demographic data to predict homelessness status at subsequent rescreening. Forecasts are made using two methods – logistic regression and random forest, a machine learning classification and forecasting algorithm – and compared.
Results: The random forest algorithm forecasts homelessness with significantly greater accuracy than logistic regression: the algorithm produces a higher rate of true positives and a lower rate of false positives. Findings have broad implications for homelessness prevention at the Department of Veterans Affairs and beyond.
Conclusions and Implications: By using machine learning forecasts to allocate resources, the VA and community-based providers can substantially improve their targeting, increasing shelter savings while reducing costs for false positives. Through more efficient allocation, programs can make better use of existing resources and have a stronger argument on which to advocate for additional funds.