To accurately determine the amount the federal government spends addressing homelessness, we reviewed federal program descriptions using the CFDA, looking for any mention of keywords related to homelessness.
We developed a ranking system to assess whether individuals experiencing homelessness would directly benefit from these programs, and categorized these programs by the type of service they provide. To learn more about the development of our ranking and categorization systems visit our Github page.
For example, the Homeless Veterans Reintegration Program provides services to homeless veterans looking for jobs, which we classified as employment related. We presented these programs to the U.S. Interagency Council on Homelessness, who recommended that we remove several programs where the majority of funds went to beneficiaries other than individuals experiencing homelessness.
We used this dataset to identify all grant spending from USAspending.gov for the programs we determined to have a direct relation to homelessness. As part of our analysis, we identified three categories of grant awards in USAspending.gov data that are relevant to our work:
To create our visualization, we mapped funding amounts for the programs that fell within a Continuum of Care area using mapping files provided by HUD. The data used in this story was updated as of November 2019.
Most CoC areas are based on the geographic boundaries of one or more counties. Where that is true for counties, the data for counties in the CoC areas were added together to get the total for the CoC area. In instances where boundaries were determined by cities or cities and counties, the data for the city was subtracted from the surrounding county and the county/city data was allocated to the correct CoC area.
The lowest geographic area for income was county. For this reason both counties and cities were linked to the data for their county. Using the population data previously described, we used a weighted average based on median lowest quintile income of the geographic area that make up each CoC area, based on the population of each area.
The data for these metrics are based upon geographic areas created by HHS. Like CoC areas, these areas are largely based upon counties. We linked these geographic areas to counties and created a weighted average for each CoC area based upon population.
This data was linked to the HUD PIT Count and Housing Inventory Count, as well as the total funding data.
To find the optimal number of clusters to target for this analysis we used a method that looks at the percentage of variance explained as a function of the number of clusters. The optimal value comes when the marginal gain for adding more clusters begins to level off. We chose five clusters as our target.
We tested several unsupervised clustering algorithms, the method we employed was Fuzzy C Means, a data clustering technique which allows one piece of data to belong to two or more clusters.
The clusters were then further segmented by population density, in an attempt to manage the size of the final clusters, but also to visualize areas with similar behavior and conditions for comparison.
Area: Measured in square miles of land area (not including water area); data from 2010 and CoC totals have been updated to reflect 2018 CoC area alignment.
Population: Estimated 2018 data
Median Rent: Estimated median gross rent per month
Income: 2017 annual income estimates by quintile; we used the lowest quintile
Drug Use: Illicit drug use in the past month among individuals 12 or over, percentages; data from 2016
Illicit Drugs include, marijuana/hashish, cocaine (including crack), heroin, hallucinogens, inhalants, or prescription-type psychotherapeutics used non-medically, including data from original methamphetamine questions, but not including new methamphetamine items added in 2005 and 2006.
Alcohol Dependence: Alcohol dependence or abuse in the past year among individuals age 12 or older, percentages; data from 2016
Alcohol dependence or abuse is based on definitions found in the fourth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV).
Mental Illness: Serious mental illness in the past year among adults aged 18 or older, percentages; data from 2016
Serious mental illness (SMI) is defined as having a diagnosable mental, behavioral, or emotional disorder, other than a developmental or substance use disorder, as assessed by the Mental Health Surveillance Study (MHSS) Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders-Fourth Edition-Research Version-Axis I Disorders (MHSS-SCID), which is based on the DSM-IV. SMI included individuals with diagnoses resulting in serious functional impairment. For details, see Section B in the "2011-2012 National Survey on Drug Use and Health: Guide to State Tables and Summary of Small Area Estimation Methodology" at http://www.samhsa.gov/data/.
Notes on the Maps: Since the shape file used in the visualization was created, several Continuum of Care's have merged with the balance of state Continuum of Care. These Continuum of Care's are: WA-507, which is now a part of WA-501, and LA-508, which is now a part of LA-509. We edited the shape file so that the hover over and color for these Continuum of Care's reflect the new information, but the boundaries for the old Continuum of Care are still included in the map.