Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add script to convert data.json to a TSV file #5 #40

Merged
merged 1 commit into from
Dec 9, 2019
Merged

Conversation

pdurbin
Copy link
Member

@pdurbin pdurbin commented Dec 6, 2019

Closes #5

Related to #7

The idea with the "json2tsv" script in this pull request is to start treating Dataverse installations as tabular data.

The "data.json" file in this repo is used as input.

A TSV file called dataverse-installations.tsv is created, which can be used in Dataverse itself to play around with Dataverse features. Perhaps someday we can also have a Jupyter Notebook to explore the data further.

Here are some screenshots of playing with the TSV file in Dataverse 4.18.1:

Screen Shot 2019-12-06 at 1 04 31 PM

Screen Shot 2019-12-06 at 12 59 59 PM

Screen Shot 2019-12-06 at 1 00 33 PM
Screen Shot 2019-12-06 at 1 01 11 PM
Screen Shot 2019-12-06 at 1 01 29 PM

Screen Shot 2019-12-06 at 1 10 53 PM
Screen Shot 2019-12-06 at 1 11 01 PM
Screen Shot 2019-12-06 at 1 12 17 PM

Screen Shot 2019-12-06 at 1 13 24 PM
Screen Shot 2019-12-06 at 1 14 08 PM

@pdurbin
Copy link
Member Author

pdurbin commented Dec 9, 2019

@shlake can you please take a look at this pull request? I meant to ping @janetm late last night (Monday morning for her) but I forgot. I doubt she'll object to merging it. And we can always take it out later if we don't like it.

I'm mentioning Janet because part of my vision for this to to add the TSV file to the "Dataverse Installation Personas" dataset I recently added at IQSS/dataverse-sample-data@ec21353

The TSV file would be the first file in the Dataverse Installation Personas dataset. Then we would add a Jupyter notebook to make some plots like the ones above. We could explore the Jupyter Notebook in Whole Tale (or Binder once jupyterhub/binderhub#969 has been merged). We could keep adding files to support reproducibility, eventually making a "compute capsule" ( IQSS/dataverse#6085 ).

I hope this is making sense! 😄

Copy link
Contributor

@shlake shlake left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@shlake will need to add info on this script to README file. Will add an issue.

@shlake shlake merged commit 21d4d83 into master Dec 9, 2019
@janetm
Copy link

janetm commented Dec 9, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Dataverse installations as a tabular file (CSV or TSV)
3 participants