Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Population #2026

Open
dsweber2 opened this issue Aug 19, 2024 · 1 comment
Open

Population #2026

dsweber2 opened this issue Aug 19, 2024 · 1 comment
Labels
API addition New signals

Comments

@dsweber2
Copy link
Contributor

We use population at various geo levels to aggregate some signals. @RoniRos has suggested that we make these publicly available, both for reproducibility and as a useful signal in their own right.

We have some geo_type's population in the geo_mapper utility already, and this is about making those available. Unclear whether we should use version or reference_date to record populations changing over time (at the moment, the geo_mapper has only 2 values). reference_date would more accurately reflect the meaning, but version would allow compression to only the actually present values.

@dsweber2 dsweber2 added the API addition New signals label Aug 19, 2024
@RoniRos
Copy link
Member

RoniRos commented Aug 20, 2024

Unclear whether we should use version or reference_date to record populations changing over time (at the moment, the geo_mapper has only 2 values). reference_date would more accurately reflect the meaning, but version would allow compression to only the actually present values.

I'm not sure what the right answer is, but let me share what I think would be the use cases: I expect population sizes to be revised once a year, by importing extrapolation estimates from the US Census Bureau. Then different signals will use the population numbers that are relevant to their respective reference_date. So it feels at least like 'reference_date' is very appropriate here.

Another use case is in deriving what I call a "micro density" signal, which is people density per land area, but calculated at the very fine level of census block or census track, then geo-aggregated by weighing by population size (so you get the "average density" that a person in the country experiences). It's basically #people/land area, but at a very fine geo level. This will require population sizes at these fine geo-levels, The land areas at census level (excluding e.g. bodies of water) are available and unlikely to change for a given geo unit (unless the geo units themselves change). So "micro density" is also expected to be revised annually, with the population revision.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
API addition New signals
Projects
None yet
Development

No branches or pull requests

2 participants