ARCHIVED: COVID-19 Cases by Population Characteristics Over Time

data.sfgov.org | Last Updated 22 Jan 2024

<strong>A. SUMMARY</strong> This archived dataset includes data for population characteristics that are no longer being reported publicly. The date on which each population characteristic type was archived can be found in the field “data_loaded_at”. <strong>B. HOW THE DATASET IS CREATED</strong> Data on the population characteristics of COVID-19 cases are from:  * Case interviews  * Laboratories  * Medical providers    These multiple streams of data are merged, deduplicated, and undergo data verification processes.   <u> Race/ethnicity</u> * We include all race/ethnicity categories that are collected for COVID-19 cases. * The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups. <u> Gender</u> * The City collects information on gender identity using <a href="https://www.sfdph.org/dph/files/PoliciesProcedures/COM5_SexGenderGuidelines.pdf">these guidelines</a>. <u>Skilled Nursing Facility (SNF) occupancy</u> * A Skilled Nursing Facility (SNF) is a type of long-term care facility that provides care to individuals, generally in their 60s and older, who need functional assistance in their daily lives.  * This dataset includes data for COVID-19 cases reported in Skilled Nursing Facilities (SNFs) through 12/31/2022, archived on 1/5/2023. These data were identified where “Characteristic_Type” = ‘Skilled Nursing Facility Occupancy’. <u>Sexual orientation</u> * The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable. * The City doesn’t collect or report information about sexual orientation for persons under 12 years of age. * Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old. <a href = "https://www.sfdph.org/dph/files/PoliciesProcedures/COM9_SexualOrientationGuidelines.pdf">Learn more about our data collection guidelines pertaining to sexual orientation</a>. <u>Comorbidities</u> * Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death. <u>Homelessness</u> Persons are identified as homeless based on several data sources: * self-reported living situation * the location at the time of testing * Department of Public Health homelessness and health databases * Residents in Single-Room Occupancy hotels are not included in these figures. These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions. <u>Single Room Occupancy (SRO) tenancy</u> * SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces. * The details of a person's living arrangements are verified during case interviews. <u>Transmission Type</u> * Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown. <strong>C. UPDATE PROCESS</strong> This dataset has been archived and will no longer update as of 9/11/2023. <strong>D. HOW TO USE THIS DATASET</strong> Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a <a href="https://data.sfgov.org/d/cedd-86uf">view based on the San Francisco Population and Demographic Census dataset</a>. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cases on each date. New cases are the count of cases within that characteristic group where the positive tests were collected on that specific specimen collection date. Cumulative cases are the running total of all San Francisco cases in that characteristic group up to the specimen collection date listed. This data may not be immediately available for recently reported cases. Data updates as more information becomes available. To explore data on the total number of cases, use <u><a href="https://data.sfgov.org/COVID-19/COVID-19-Cases-Over-Time/gyr2-k29z">the ARCHIVED: COVID-19 Cases Over Time dataset</a></u>. <strong>E. CHANGE LOG</strong> <UL><LI>9/11/2023 - data on COVID-19 cases by population characteristics over time are no longer being updated. The date on which each population characteristic type was archived can be found in the field “data_loaded_at”. <LI>6/6/2023 - data on cases by transmission type have been removed. See section ARCHIVED DATA for more detail. <LI>5/16/2023 - data on cases by sexual orientation, comorbidities, homelessness, and single room occupancy have been removed. See section ARCHIVED DATA for more detail. <LI>4/6/2023 - the State implemented system updates to improve the integrity of historical data. <LI>2/21/2023 - system updates to improve reliability and accuracy of cases data were implemented. <LI>1/31/2023 - updated “population_estimate” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates. <LI>1/5/2023 - data on SNF cases removed. See section ARCHIVED DATA for more detail. <LI>3/23/2022 - ‘Native American’ changed to ‘American Indian or Alaska Native’ to align with the census. <LI>1/22/2022 - system updates to improve timeliness and accuracy of cases and deaths data were implemented. <LI>7/15/2022 - reinfections added to cases dataset. See section SUMMARY for more information on how reinfections are identified.</UL>

This dataset has the following 9 columns:

Column NameAPI Column NameData TypeDescriptionSample Values
specimen_collection_datespecimen_collection_datecalendar_dateThe date the positive test was collected (the date the person was tested)
characteristic_typecharacteristic_typetextOverall topic area for a given population characteristic. These are subgroups or demographic cross-sections, like age
characteristic_groupcharacteristic_grouptextEach group or category within a characteristic type or topic area. ex 0-4 yrs, 5-10 yrs
characteristic_group_sort_ordercharacteristic_group_sort_ordernumberSort order of characteristic group to aid in visualizing data
population_estimatepopulation_estimatenumber2016-2020 5-year American Community Survey (ACS) population estimate for a given characteristic type and characteristic group
new_casesnew_casesnumberCases are counted as confirmed on the date of specimen collection after a positive lab test result
cumulative_casescumulative_casesnumberCumulative cases as of the specified date
data_as_ofdata_as_ofcalendar_dateTimestamp for last update date in source system
data_loaded_atdata_loaded_atcalendar_dateTimestamp when data entered the Socrata system