Skip to content

Commit 6d8f160

Browse files
authored
Clarify NSSP documentation, esp 'HSA' semantics (#1634)
1 parent 4a462d4 commit 6d8f160

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

docs/api/covidcast-signals/nssp.md

+7-1
Original file line numberDiff line numberDiff line change
@@ -43,6 +43,7 @@ As of May 2024, NSSP received data from 78% of US EDs.
4343

4444
The percent visits signals are calculated as a fraction of visits at facilities reporting to NSSP, rather than all facilities in the area.
4545
`county`, `state` and `nation` level data is reported as-is from NSSP, without modification, while `hhs`, `hrr` and `msa` are estimated by Delphi.
46+
State and HSA-level values are calculated and published by NSSP; County level values are not published individually, but are approximations copied from the HSA the county is in (every county in an HSA will have identical values).
4647

4748
### Geographic weighting
4849
As the original data is a percentage and raw case counts are not available, `hrr`,`msa`, and `hhs` values are computed from county-level data using a weighted mean. Each county is assigned a weight equal to its population in the last census (2020). Unreported counties are implicitly treated as having a weight of 0 or a value equal to the group mean.
@@ -69,6 +70,8 @@ The following states report no data through NSSP at the county level: CA, WA, AK
6970

7071
South Dakota, Missouri, and territories report no data through NSSP at the state level.
7172

73+
The only completely non-reporting state is Missouri.
74+
7275

7376
## Lag and Backfill
7477

@@ -84,6 +87,9 @@ Counties with `NA` values are as originally reported in the dataset from which t
8487

8588
## Limitations
8689

90+
As noted above, only state and HSA-level values are calculated and published by NSSP; County level values are not published individually, but are approximations copied from the HSA the county is in (every county in an HSA will have identical values).
91+
The HSA (Health Service Area) definitions used are known as ["NCI Modified"](https://seer.cancer.gov/seerstat/variables/countyattribs/hsa.html).
92+
8793
There is substantial missingness at the county level. This tends to impact more rural and lower population locations. See the [missingness section](#missingness) for more information.
8894

8995
Not all counties contain reporting EDs, including in states where NSSP reports state-level data.
@@ -104,4 +110,4 @@ Some low population counties occasionally report outliers, e.g. 33.33%, 50%, 100
104110
This source is derived from the CDC's [Respiratory Virus Response NSSP Emergency Department Visit Trajectories dataset](https://data.cdc.gov/Public-Health-Surveillance/2023-Respiratory-Virus-Response-NSSP-Emergency-Dep/rdmq-nq56/about_data).
105111
There is another version of the dataset that includes [state data only](https://data.cdc.gov/Public-Health-Surveillance/2023-Respiratory-Virus-Response-NSSP-Emergency-Dep/7mra-9cq9/about_data).
106112

107-
This data was originally published by the CDC, and is made available here as a convenience to the forecasting community under the terms of the original license, which is [U.S. Government Public Domain](https://www.usa.gov/government-copyright).
113+
This data was originally published by the CDC, and is made available here as a convenience to the forecasting community under the terms of the original license, which is [U.S. Government Public Domain](https://www.usa.gov/government-copyright).

0 commit comments

Comments
 (0)