Data Wrangling

National Parks

Data Wrangling is the process of transforming data from an unreadable format to another format to make it more appropriate for analysis. This project takes three different forms of data and merges it into one all with the National Parks as its theme.

Each of the three data sources has its own unique information but also some overlap. The National Parks API has all information about each of the national parks in its system. The National Parks Trails CSV is derived from the AllTrails website and has information on the trails located in the National Parks. The National Parks elevation table from Wikipedia is a table of the highest and lowest elevations in 63 National Parks. Initially. looking at the data for each source, the National Parks API looks to be the parent as it has the most accurate information coming directly from the National Parks Service. I will be using the park’s endpoint to get data about all the National Parks in its database. The CSV file contains the name of the National Park which I will use to connect it to the API data. The website also has the National Park name which will connect to the API data.

Built With

Python 3, pandas, matplotlib, seaborn, scipy, numpy, wordcloud BeautifulSoup, requests, re, json, configparser, sqlite3

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
AllTrails data - nationalpark.csv		AllTrails data - nationalpark.csv
National_Parks.ipynb		National_Parks.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Wrangling

National Parks

Built With

About

Releases

Packages

Languages

cableredi/Data-Wrangling

Folders and files

Latest commit

History

Repository files navigation

Data Wrangling

National Parks

Built With

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages