project-parks-pulse

Description

National parks in the United States, often hailed as "America's best idea," have seen a notable surge in interest in recent years. This project aims to develop a comprehensive measure of individual park health, known as the Park Health Index (PHI), by analyzing the U.S. National Parks System (NPS) and other publicly available datasets. The PHI will consider various factors such as park usage, climate data, management practices, and potential hazards, assigning appropriate weights to each to provide a holistic view of a park's current condition.

Furthermore, this project seeks to analyze trends in park visitation and the impact of climate change over a 10-year period (2011-2022) to offer insights into park management and conservation efforts.

By achieving these goals, the project aims to (1) raise awareness about the health of national parks, both as a snapshot and over time, (2) uncover patterns useful for assessing and evaluating park management, and (3) provide a benchmark to highlight areas of risk, enabling policymakers to make informed decisions regarding parks conservation and resource allocation.

Built with

Python
matplotlib/Seaborn
Statsmodels
Pandas
Numpy
BeautifulSoup4
geopandas
lxml
cssselect
fastkml

Data Sources

DMR: https://www.pewtrusts.org/
NPS Spending Data: https://www.nps.gov/aboutus/budget.htm
Weather data: https://www.visualcrossing.com/
NP Basic Info(name/location/size) Data: https://en.wikipedia.org/wiki/List_of_national_parks_of_the_United_States
NP Visitation Data: https://irma.nps.gov/Stats/SSRSReports/National%20Reports/Annual%20Visitation%20and%20Record%20Year%20by%20Park%20(1904%20-%20Last%20Calendar%20Year)

Getting Started

To get started with the Dashboard, follow these steps:

Clone the repository: git clone https://github.com/SeongyeonYang/30122-project-parks-pulse.git
Run poetry install to install the necessary packages
Run poetry shell to activate the virtual environment
Run python -m ppp to open the visualization and analysis

Project Directory Structure

ppp
- analysis
  - visualizations
  - app.py
  - .py files about analysis (total 6)
- cleaning
  - cleaned_data
    - cleaned_time_series_all.csv (all merged files based on time series)
    - cleaned_nps_info.csv (all merged files based on information)
    - .py files about cleaned data before merge into 2 files above (total 14)
    - .py files about factor_ratio in 2015 and 2019 (total 4)
  - raw_data
    - climate
    - nps_spending
    - dmr-2015-2019.csv
    - dmr-2023.csv
    - npca-orphaned-wells.csv
    - nps-boundary.geojson
    - nps-nightsky-monitoring.kml
    - nps-parkcode.csv

Data Visualization Demo

Scree Plot of eigen value from factor analysis
2015_composite_index_distribution
Average Temperature Over Years by Region
Correlation Matrix of Visitation, Spending, and Climate Variables (2011-2022).png

Authors

Minh Nghiem
Seongyeon Yang
Yi-Huai Chang
Diyanet Nijiati

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
ppp		ppp
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
proj_paper.pdf		proj_paper.pdf
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

project-parks-pulse

Description

Built with

Data Sources

Getting Started

Project Directory Structure

Data Visualization Demo

Authors

About

Releases

Packages

Contributors 4

Languages

uchicago-mscapp-projects/parks-pulse

Folders and files

Latest commit

History

Repository files navigation

project-parks-pulse

Description

Built with

Data Sources

Getting Started

Project Directory Structure

Data Visualization Demo

Authors

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages