Methods: Data were from the National Survey of Child and Adolescent Wellbeing-Third Cohort (NSCAW-III), a nationally representative dataset of children and families impacted by the child welfare system. The sample included cases with valid birth father-child contact data (N = 2,380). The outcome was father-child contact in the past 12 months. A total of 124 predictors at multiple system levels—individual (e.g., father, child, caseworker), family, neighborhood, child welfare system—were included. Random forest—a machine learning approach—was employed for the main analysis. Correlations were conducted to explore directionality between predictors and the outcome.
Results: Fathers in the sample were socioeconomically disadvantaged. Most had a high school education or less (88%), were unemployed (67%), and were non-resident with their children (92%). They were racially and ethnically diverse (36% Black, 33% White, 25% Hispanic, 5% other). The mean age of the children was 6.55 years (SD = 5.91). Most families (70%) reported father-child contact in the past 12 months. Random forest results showed that among the top 20 predictors, fathers’ sociodemographic characteristics were particularly important. For example, missing data on fathers’ race/ethnicity (importance score = 1.00), fathers being White (importance score = 0.84), and fathers having an associate’s degree (importance score = 0.57) were some of the most important predictors of father-child contact. Additional key predictors included caseworkers’ ability to speak another language (importance score = 0.66), child welfare systems’ referral of families to health services (importance score = 0.61), child being a girl (importance score = 0.49), and families' access to food (importance score = 0.45). Top predictors generally showed weak negative correlations with father-child contact.
Conclusion and Implications: Factors at different system levels—individual, family, and child welfare system—interact in complex ways to predict father-child contact, suggesting the need for multisystemic interventions to connect fathers with their children. The prominence of certain predictors, including missing race/ethnicity data, also highlights underlying reporting challenges or measurement gaps in child welfare data. Applying machine learning to child welfare data has advantages over traditional methods, including the ability to handle large datasets and model complex, interactive, and non-linear relationships. These capabilities can help uncover hidden patterns that may better inform child welfare practice and policy.
![[ Visit Client Website ]](images/banner.gif)