The California Department of Public Health detected a novel infectious respiratory disease outbreak in California between May to December 2023, and collected information about the number of cases and case severity, along with demographic information on infected individuals. This report aims to examine the course of this outbreak and understand if it disproportionately affected certain demographic or geographic populations. In particular, race/ethnicity, county, and age factors are examined to identify populations who may benefit most from prevention and treatment resources.
This data source is simulated data of weekly infectious disease cases for each county in California reported from public health agencies and organizations such as county health departments during 2023, beginning in late May 2023 until the end of December 2023. The infection data are linked with demographic information such as age group, binary gender, race and ethnicity.
This is a simulated dataset containing reported weekly cases of a disease categorized by date of diagnoses, patient demographics, and cumulative totals for infected, unrecovered, and severe cases, for the county of Los Angeles. Such data would have been collected by public health agencies from around LA county. Data was collected from late May 2023 until the end of December 2023. The data provides patient demographics of age group, binary gender, race and ethnicity which will help to answer if there are disparities in the rate of infection among different populations.
This dataset is simulated, but approximates what population data from the State of California might look like. It includes population estimates by the CA Dept of Finance for 2023 by CA county and demographic categories (age, race, and sex).
First, the two disease infection datasets were joined together (Source 1 & 2) simply by binding the rows (added the rows from both datasets together). This generated a joined list of infection data for all counties in California.
Next, strata of interest were identified. Since we’re interested in the distribution of infections across race and geographic categories, the rows were grouped by county and then by race/ethnicity. Counts of the infections in each stratum were then summed up to give a cumulative case count over the entire period of data collection (May to Dec 2023). Since both datasets have information on both 1. new infections and 2. new severe infections in separate columns, a sum for each of these two columns was obtained per stratum.
This stratified summary of cumulative new and severe case counts was left joined with the California population dataset, using county and race_ethnicity categories as keys. The resulting table shows the counts of cumulative new infections, new severe infections, and total population count for each stratum.
To calculate the cumulative incidence of new and severe cases per 100 individuals in each stratified demographic group, the count new or severe cases was divided by the population count for that group, and multiplied by 100.
To obtain weekly incidence rate information, we went back to the joined infection data (Source 1 & 2) and then grouped by age category and diagnosis dates (diagnosis dates were measured once a week). The weekly new case and severe case counts were then summed up to give a total count for each week per age group. The population dataset (Source 3) was likewise grouped by age category, and then included in a left join with the stratified weekly case data. Weekly cases per 1000 individuals in each age group was calculated by dividing the weekly case count of each age group by the population of that age group, and multiplied by 1000.
Cumulative incidence of new cases, ignoring ethnicity differences in each county, was mapped. The map shows that Imperial County has the highest cumulative incidence of all the counties. Examining the granular data for Imperial county in the table below, the highest incidence was seen in the Black, Non-Hispanic population in this county, with a cumulative incidence of 66.64 cases/100 individuals. Cumulative incidence among White, Non-Hispanic population in this county was also close behind, with 65.93 cases/100 individuals. In contrast, the coastal counties generally have lower cumulative incidence of new infections.
American Indian or Alaska Native (Non-Hispanic) and White (Non-Hispanic) racial and ethnic groups have the highest cumulative incidence rates of new infections (newly infected individuals over the population of that racial/ethnic category in each county) in all of California during the outbreak from May to December of 2023. Specifically, the highest per capita rates of new infections for these racial and ethnic groups are in Imperial County.
Observing the overall trends for weekly infections, it is evident that weekly new infections as well as severe infections seem to follow similar patterns. New infections begin to take off after the week of Jun 12, 2023, and rise steeply until the week of Aug 28, and remain high until week of Oct 18, before steeply dropping off and returning to baseline by week of Nov 20. Weekly severe infections follow the same shape, but trail behind new infections by 2 weeks, so severe case rates do not drop off until Oct 30, 2023 and only reach baseline the week of Dec 4.
When separated by age group, the rate of weekly new infections follow the same shape for all age groups, but are the highest among those aged 65+, followed by those aged 18-49. The weekly severe infections, however, show that the weekly incidence of severe cases is extremely high in the older population, when compared with all other age groups.
The collected infection data provide insight into the populations most at risk from a novel respiratory infection and most likely to benefit from prevention and mitigation strategies.
In particular, Imperial County appears to be most impacted by the infection, as it demonstrated the highest cumulative incidence among all counties. And among the various populations in Imperial County, the Black, Non-Hispanic population experienced the highest incidence rate of new infections, highlighting this vulnerable population for intervention.
More widely across the entire state, however, American Indian or Alaska Native (Non-Hispanic) and White (Non-Hispanic) racial and ethnic groups are also heavily impacted by the outbreak, indicating that resources should be invested to prevent and treat disease in these populations.
Lastly, the results show that severe disease disproportionately occurs in older adults aged 65+. Given the burden of severe disease is likely to be linked with mortality from infection, investment in protecting this group may be more significant in reducing deaths overall.