Methods: The current study used a mixed methods approach to develop and test a set of computer models to automate the coding of investigation summaries for our target construct. To start, four expert human coders labeled a collection of child welfare investigation summaries (N = 1,402) for the presence (DV+) or absence (DV-) of an active domestic violence service need. These labeled documents were then used to develop a set of text mining and machine learning models, and to test their accuracy and reliability. We first developed a rules-based text mining model that relied on the use of an expert dictionary and sentiment analysis to classify documents as DV+ or DV-. We then develop more advanced machine learning models that used a k-nearest neighbor algorithm to perform the coding task. Accuracy and reliability for all models were determined by comparing computer classifications to those of expert human coders.
Results: The machine learning models achieved greater than 90% accuracy in the classification of documents when compared to the classification decisions of expert human coders. Fleiss kappa estimates of coding reliability between the top-performing model and expert human coders exceeded .80, suggesting that system administrators, researchers and evaluators could confidently deploy our model to bring this task to scale, rapidly classifying their entire population of documents for the presence or absence of a domestic violence service need.
Conclusions and Implications: The results provide strong evidence that text mining and machine learning procedures can be a cost-effective solution for extracting meaningful insights from unstructured text data. While not suitable for case-level predictive analytics, the insights derived from these procedures can be particularly useful for investigating the prevalence, temporal trends and geographic distribution of domestic violence-related needs in the child welfare system. These methods have the potential to substantially enhance the use of unstructured text data in social work research and evaluation.