Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Synthesize "national" data for nchs-mortality:deaths_covid_incidence_prop #1906

Closed
melange396 opened this issue Oct 31, 2023 · 2 comments · Fixed by #1912 or #1938
Closed

Synthesize "national" data for nchs-mortality:deaths_covid_incidence_prop #1906

melange396 opened this issue Oct 31, 2023 · 2 comments · Fixed by #1912 or #1938
Assignees

Comments

@melange396
Copy link
Contributor

There are currently only "state" specific data points for NCHS Mortality series. That is, only geo_type=state is provided for any of the data with source=nchs-mortality. It would be advantageous for us to have this information aggregated to the national level. We can (presumably) synthesize this by summing numbers from all states to produce a new "nation" data series (for each reference and issue date available).

In cmu-delphi/www-covidcast#1233, on the Covidcast Dashboard, we intend to replace the old and no-longer-updated JHU deaths signal (jhu-csse:deaths_7dav_incidence_prop) with one from NCHS Mortality Data (nchs-mortality:deaths_covid_incidence_prop). The default view of the dashboard uses a geographic perspective of the United States as a whole. Without an available "nation"-level signal for deaths, it shows as "N/A" in plots and numeric text.

According to our signal documentation, the only currently active signals that do not provide "nation"-level data are from the sources nchs-mortality and quidel. Do we want to construct national data for all of these?

@melange396
Copy link
Contributor Author

It looks like we do something similar for doctor_visits data, by summing across counties to produce "nation" data (among other aggregationss). See:

sensor = update_sensor(
filepath=claims_file,
startdate=startdate,
enddate=enddate,
dropdate=dropdate,
geo=geo,

mapping_func = geo_map.geo_func[geo.lower()]


@melange396
Copy link
Contributor Author

(this text is copied from #1912 (comment) ; its more appropriate here)

nchs-mortality national-level data went in last thursday for the first time...

Did this work for you in your local testing? Since these values are coming out as "NULL", it makes me think the denominator here is 0 or null, which then makes me think that the national population is not getting set properly here.

Also, to make this message go away, we need to mark the nchs-mortality signals as having national-level data in the spreadsheet and then get it transferred to the csv. We can take care of that after the signals are properly acquired.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants