Skip to content

Commit

Permalink
covid act now links and context around deactivation
Browse files Browse the repository at this point in the history
  • Loading branch information
nmdefries committed Jul 10, 2024
1 parent 5f23936 commit e26f2c9
Showing 1 changed file with 21 additions and 16 deletions.
37 changes: 21 additions & 16 deletions docs/api/covidcast-signals/covid-act-now.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,11 +15,13 @@ grand_parent: COVIDcast Main Endpoint
* **Time type:** day (see [date format docs](../covidcast_times.md))
* **License:** [CC BY-NC](../covidcast_licensing.md#creative-commons-attribution-noncommercial)

The COVID Act Now (CAN) data source provides COVID-19 testing statistics, such as positivity rates and total tests performed.
The county-level positivity rates and test totals are pulled directly from CAN.
While CAN provides this data potentially from multiple sources, we only use data sourced from the
The [COVID Act Now (CAN)](https://covidactnow.org/) data source provides COVID-19 testing statistics, such as positivity rates and total tests performed.
The county-level positivity rates and test totals are pulled directly from CAN using [their API](https://covidactnow.org/data-api).
While CAN provides this data potentially from multiple sources, we only use data that CAN sources from the
[CDC's COVID-19 Integrated County View](https://covid.cdc.gov/covid-data-tracker/#county-view).

Delphi's mirror of the CAN data was deactivated in December 2021 (last issue 2021-12-10) in favor of the [DSEW CPR data](./dsew-cpr.md), which reports the same information under the `covid_naat_pct_positive_7dav` signal.


| Signal | Description |
|--------------------------------|----------------------------------------------------------------|
Expand All @@ -34,9 +36,9 @@ While CAN provides this data potentially from multiple sources, we only use data

## Estimation

The quantities received from CAN / CDC are the county-level positivity rate and total tests,
which are based on the counts of PCR specimens tested.
In particular, they are also already smoothed with a 7-day-average.
We receive county-level positivity rate and total tests from CAN, originating from the CDC.
These quantiles are based on the counts of PCR specimens tested.
They are also already smoothed with a 7-day-average.

For a fixed location $$i$$ and time $$t$$, let $$Y_{it}$$ denote the number of PCR specimens
tested that have a positive result. Let $$N_{it}$$ denote the total number of PCR specimens tested.
Expand Down Expand Up @@ -79,38 +81,41 @@ $$

### Smoothing

No additional smoothing is done to avoid double-smoothing, since the data pulled from CAN / CDC
No additional smoothing is done to avoid double-smoothing, since the CAN data
is already smoothed with a 7-day-average.

## Limitations

Estimates for geographical levels beyond counties may be inaccurate due to how aggregations
are done on smoothed values instead of the raw values. Ideally we would aggregate raw values
Estimates for geographical levels beyond counties may be inaccurate because our aggregations
are performed on smoothed values instead of the raw values.
Ideally we would aggregate raw values
then smooth, but the raw values are not accessible in this case.

The positivity rate here should not be interpreted as the population positivity rate as
The reported test positivity rate should not be interpreted as the population positivity rate as
the testing performed are typically not randomly sampled, especially for early data
with lower testing volumes.

A few counties, most notably in California, are also not covered by this data source.

Entries with zero total tests performed are also suppressed, even if it was actually the case that
Entries with zero total tests performed are suppressed, even if it was actually the case that
no tests were performed for the day.

## Lag and Backfill

The lag for these signals varies depending on the reporting patterns of individual counties.
Most counties have their latest data report with a lag of 2 days, while others can take 9 days
or more in the case of California counties.
or more, as is the case with California counties.

These signals are also backfilled as backlogged test results could get assigned to older 7-day timeframes.
Most recent test positivity rates do not change substantially with backfill (having a median delta of close to 0).
However, most recent total tests performed is expected to increase in later data revisions (having a median increase of 7%).
Revisions are sometimes made to the data. For example, backlogged test results can get assigned to past dates.
The majority of recent test positivity rates do not change substantially with backfill (having a median delta of close to 0).
However, the majority of recent total tests performed is expected to increase in later data revisions (having a median increase of 7%).
Values more than 5 days in the past are expected to remain fairly static (with total tests performed
having a median increase of 1% of less), as most major revisions have already occurred.

## Source and Licensing

County-level testing data is scraped by CAN from the
County-level testing data is scraped by [CAN](https://covidactnow.org/) from the
[CDC's COVID-19 Integrated County View](https://covid.cdc.gov/covid-data-tracker/#county-view),
and made available through [CAN's API](https://covidactnow.org/tools).

The data is made available under a [CC BY-NC](../covidcast_licensing.md#creative-commons-attribution-noncommercial) license.

0 comments on commit e26f2c9

Please sign in to comment.