COVID-19 Cases by Population Characteristics Over Time

data.sfgov.org | Last Updated 8 Aug 2022

Note: On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021. Some fluctuation in historic data may result when this change is implemented on July 15, 2022. Note: As of March 2022, the race/ethnicity label changed from Native American to American Indian or Alaska Native to align with the Census. Note: On January 22, 2022, system updates to improve the timeliness and accuracy of San Francisco COVID-19 cases and deaths data were implemented. You might see some fluctuations in historic data as a result of this change. Due to the changes, starting on January 22, 2022, the number of new cases reported daily will be higher than under the old system as cases that would have taken longer to process will be reported earlier. <strong>A. SUMMARY</strong> This dataset shows San Francisco COVID-19 cases by population characteristics and by specimen collection date. Cases are included on the date the positive test was collected. Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how cases have been distributed among different subgroups. This information can reveal trends and disparities among groups. Data is lagged by five days, meaning the most recent specimen collection date included is 5 days prior to today. Tests take time to process and report, so more recent data is less reliable. <strong>B. HOW THE DATASET IS CREATED</strong> Data on the population characteristics of COVID-19 cases and deaths are from: * Case interviews * Laboratories * Medical providers These multiple streams of data are merged, deduplicated, and undergo data verification processes. This data may not be immediately available for recently reported cases because of the time needed to process tests and validate cases. Daily case totals on previous days may increase or decrease. <u><a href="https://data.sfgov.org/s/nudz-9tg2/">Learn more</a></u>. Data are continually updated to maximize completeness of information and reporting on San Francisco residents with COVID-19. Data notes on each population characteristic type is listed below. <u> Race/ethnicity</u> * We include all race/ethnicity categories that are collected for COVID-19 cases. * The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups. <u> Sexual orientation</u> * The City began asking adults 18 years old or older for their sexual orientation identification during case interviews as of April 28, 2020. Sexual orientation data prior to this date is unavailable. * The City doesn’t collect or report information about sexual orientation for persons under 12 years of age. * Case investigation interviews transitioned to the California Department of Public Health, Virtual Assistant information gathering beginning December 2021. The Virtual Assistant is only sent to adults who are 18+ years old.<u><a href="https://www.sfdph.org/dph/files/PoliciesProcedures/COM9_SexualOrientationGuidelines.pdf"> Learn more about our data collection guidelines pertaining to sexual orientation</a></u>. <u> Gender</u> * The City collects information on gender identity using <a href="https://www.sfdph.org/dph/files/PoliciesProcedures/COM5_SexGenderGuidelines.pdf">these guidelines</a>. <u> Comorbidities</u> * Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death. <u> Transmission type</u> * Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown. <u>Homelessness</u> Persons are identified as homeless based on several data sources: * self-reported living situation * the location at the time of testing * Department of Public Health homelessness and health databases * Residents in Single-Room Occupancy hotels are not included in these figures. These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions. <u> Skilled Nursing Facility (SNF) occupancy</u> * A Skilled Nursing Facility (SNF) is a type of long-term care facility that provides care to individuals, generally in their 60s and older, who need functional assistance in their daily lives. * Facilities are mandated to report COVID-19 cases or deaths among their residents. The City follows up with these facilities to confirm. * There may be differences between the City’s SNF data and the California Department of Public Health (CDPH) dashboard. The difference may be because the City and the State use different methods to collect and validate data. <u> Single Room Occupancy (SRO) tenancy</u> * SRO buildings are defined by the San Francisco Housing Code as having six or more "residential guest rooms" which may be attached to shared bathrooms, kitchens, and living spaces. * The details of a person's living arrangements are verified during case interviews. <strong>C. UPDATE PROCESS</strong> Updates automatically at 05:00 AM Pacific Time each day. Redundant runs are scheduled at 07:00 AM and 09:00 AM in case of pipeline failure. Dataset will not update on the business day following any federal holiday. <strong>D. HOW TO USE THIS DATASET</strong> This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cases on each date. New cases are the count of cases within that characteristic group where the positive tests were collected on that specific specimen collection date. Cumulative cases are the running total of all San Francisco cases in that characteristic group up to the specimen collection date listed. Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity groups are from <u><a href="https://data.census.gov/cedsci/table?q=B03002&g=0500000US06075&tid=ACSDT5Y2019.B03002&hidePreview=false">the 2019 5-year American Community Survey</a></u>. San Francisco population estimates for age groups are from <a href="https://www.cdc.gov/nchs/nvss/bridged_race/data_documentation.htm#vintage2019">the National Center for Health Statistics</a>. This data may not be immediately available for recently reported cases. Data updates as more information becomes available. To explore data on the total number of cases, use <u><a href="https://data.sfgov.org/d/g2di-xufg">the COVID-19 Cases Over Time dataset</a></u>.

This dataset has the following 9 columns:

Column NameAPI Column NameData TypeDescriptionSample Values
Specimen Collection Datespecimen_collection_datecalendar_dateThe date the positive test was collected (the date the person was tested)
Characteristic Typecharacteristic_typetextOverall topic area for a given population characteristic. These are subgroups or demographic cross-sections, like age
Characteristic Groupcharacteristic_grouptextEach group or category within a characteristic type or topic area. ex 0-4 yrs, 5-10 yrs
Characteristic Group Sort Ordercharacteristic_group_sort_ordernumberSort order of characteristic group to aid in visualizing data
New Casesnew_casesnumberCases are counted as confirmed on the date of specimen collection after a positive lab test result
Cumulative Casescumulative_casesnumberCumulative Cases
Population Estimatepopulation_estimatenumberPopulation estimate for a given characteristic type and characteristic group
Data As Ofdata_as_ofcalendar_dateTimestamp for last update date in source system
Data Loaded Atdata_loaded_atcalendar_dateTimestamp when data entered the socrata system