This is your second real data-programming task and it may be more time-consuming than the previous exercises. The good side of the story is that it is a groupwork so you can split the tasks between your groupmates, and save time in this way.
The central task here is to pull information over google API-s, and manipulate the resulting json data into nice readable tables. This involves reading about the API documentation, experimenting with the queries, and a lot of hassling with lists, data frames to squeeze uncooperative results into the desired form! Unfortunately, this is a frequent experience with APIs. Data comes in a million different ways, the API-s are fluid, and bad documentation is more of a rule than exception.
Finally, you put the result in a rmarkdown file and compile it to an html.
Have fun!
Your task is to display some information about elected officials, representing two US addresses. You are free to pick any address as long as these are from a different state. Even more, if you change the address, your report should update accordingly when knitted, potentially showing completely different officials if the new address is from another state.
This is a perfect task for API-s. You can get all you need from google civic information API that links addresses to elected officials, and can give you additional information about the U.S. Representatives.
The repo also includes a few files that help you to get started:
- index_stub.rmd: a stub for the report. We recommend you to take this file as template and start filling it with your code and text. It also includes suggestions and example pieces of code.
- We recommend you to rename index_stub into index as the final file should be labelled index.html.
- process-data.R: template stub file that should contain your main data analysis code. You may take this too as the template for your actual code.
- index.html: an example how the result should look like
- styles.css: a proposed stylesheet (how to use this is visible from the .rmd file).
- For those who want to compile it outside RStudio, the set also includes Makefile. (If you work in RStudio, it is not needed.)
See the included index.html file.
See google civic information API documentation.
Google API requires a valid key, follow the instructions on google cloud support.
Store the key on your computer:
- don't upload it to GitHub!
- don't put it in your code that will be uploaded to GitHub!
There are many ways to achieve this. However, in order to make grading easier, please do it like this:
- create an R file keys.R in your Documents folder (the normal Documents folder of your computer, not inside this project). The file should contain a single line of R code:
google.key <- "123xyz"
The file should not contain anything else. Please use exactly the same names for the keys and the keyfile.
- in your r-markdown file, in the first code cunk, you load this file as
source("~/Documents/keys.R")
I recommend to experiment first with the queries on the interactive console, provided at google cloud. In particular, you should manage to:
- get the elected officials' info from Google's Civic Information. First read the relevant documentation for Civic Information API reference and representatives by address. Note that Civic Platform expects authentication to be provided as key=your-google-civic-key in the query. In this task you only need to extract the full list of representatives by address.
Write a short introduction (a few sentences) where you explain where you get the following information. Provide API links! Use bold/italics according to need.
Write a function (in the process-data.R file) that takes address as an argument and outputs a markdown table of representatives. Note that the GET request provides two tables (embedded into the same json): offices, and officials. Offices are the corresponding position (say, President of the US), officials are the persons currently at that position (here Donald J. Trump). Offices data includes reference for the corresponding officials. Note: there are more than one person for certain offices, such as US Senate.
The final table should contain all elected officials from the Civic Information for the address. The table should include
- name
- position
- party
- email (link)
- phone
- photo (note: the style.css helps to ensure the photos are of suitable size)
Ensure that missing information (such as missing phone numbers) are
displayed in an pleasant and informative way (such as not available
or '-') and not as <NA>
or !Error
or other kind of ugly code.
Pick the other address (from another state), and use the function you created above to create another similar table. Chances are that something will look wrong and you have to fix it in the function.
- The tables should look human-readable and pleasant, and must not include leftovers of JSON, raw HTML, computer code etc.
Section (points) | Task | Points |
---|---|---|
intro (5) | address | 1 |
API links | 2 | |
appropriate explanations included | 2 | |
Table of Officials (55) | Names displayed, linked to website | 10 |
photos visible | 15 | |
missing photos replaced by '-' or similar | 10 | |
phone, party, position included | 5 | |
missing phones etc replaced by '-' or similar | 10 | |
emails are links | 5 | |
Data processing (40) | code clear, logical, efficient | 10 |
uses the same function for both addresses | 10 | |
works with different address | 10 | |
API keys hidden, not uploaded | 5 | |
code organized into appropriate | ||
functions/files | 5 |