add script to convert data.json to a TSV file #5 #40

pdurbin · 2019-12-06T18:15:29Z

Closes #5

Related to #7

The idea with the "json2tsv" script in this pull request is to start treating Dataverse installations as tabular data.

The "data.json" file in this repo is used as input.

A TSV file called dataverse-installations.tsv is created, which can be used in Dataverse itself to play around with Dataverse features. Perhaps someday we can also have a Jupyter Notebook to explore the data further.

Here are some screenshots of playing with the TSV file in Dataverse 4.18.1:

pdurbin · 2019-12-09T16:16:07Z

@shlake can you please take a look at this pull request? I meant to ping @janetm late last night (Monday morning for her) but I forgot. I doubt she'll object to merging it. And we can always take it out later if we don't like it.

I'm mentioning Janet because part of my vision for this to to add the TSV file to the "Dataverse Installation Personas" dataset I recently added at IQSS/dataverse-sample-data@ec21353

The TSV file would be the first file in the Dataverse Installation Personas dataset. Then we would add a Jupyter notebook to make some plots like the ones above. We could explore the Jupyter Notebook in Whole Tale (or Binder once jupyterhub/binderhub#969 has been merged). We could keep adding files to support reproducibility, eventually making a "compute capsule" ( IQSS/dataverse#6085 ).

I hope this is making sense! 😄

shlake

@shlake will need to add info on this script to README file. Will add an issue.

janetm · 2019-12-09T20:19:29Z

Hi Phil I’m happy with any work you do! Will try to have a look today. Sorry I’ve been out of action, last week at an ML workshop and yesterday we ran a workshop around the recovery and use of an historically valuable demographic and population dataset. First half of the day was tech focused to showcase multiple similar projects at ANU to bring experience together, build a community of practice together etc. it was really promising. Will share more about this as we go. Thx all J ---------------------------------------------------- Janet McDougall | Senior Data Archivist Australian Data Archive (ADA) ANU Centre for Social Research and Methods The Australian National University T: +61 2 6125 0571<tel:+61%202%206125%200571> E: [email protected]<mailto:[email protected]> W: http://ada.edu.au/<http://ada.edu.au/%3Chttp://assda.anu.edu.au/%3Chttp://ada.edu.au/%3chttp:/assda.anu.edu.au/%3Chttp://ada.edu.au/%3chttp:/assda.anu.edu.au/%3chttp:/ada.edu.au/%3chttp:/assda.anu.edu.au/%3E%3E%3E> ---------------------------------------------------- On 10 Dec 2019, at 03:16, Philip Durbin <[email protected]<mailto:[email protected]>> wrote: @shlake<https://github.com/shlake> can you please take a look at this pull request? I meant to ping @janetm<https://github.com/janetm> late last night (Monday morning for her) but I forgot. I doubt she'll object to merging it. And we can always take it out later if we don't like it. I'm mentioning Janet because part of my vision for this to to add the TSV file to the "Dataverse Installation Personas" dataset I recently added at IQSS/dataverse-sample-data@ec21353<IQSS/dataverse-sample-data@ec21353> The TSV file would be the first file in the Dataverse Installation Personas dataset. Then we would add a Jupyter notebook to make some plots like the ones above. We could explore the Jupyter Notebook in Whole Tale (or Binder once jupyterhub/binderhub#969<jupyterhub/binderhub#969> has been merged). We could keep adding files to support reproducibility, eventually making a "compute capsule" ( IQSS/dataverse#6085<IQSS/dataverse#6085> ). I hope this is making sense! 😄 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#40?email_source=notifications&email_token=AAJISVFBSEKDYN5CIDS3TDDQXZVMRA5CNFSM4JW6QTXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEGJXVBY#issuecomment-563313287>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAJISVDTORGNFKLHY6Q5D5LQXZVMRANCNFSM4JW6QTXA>.

add script to convert data.json to a TSV file #5

aa9890e

pdurbin requested a review from shlake December 6, 2019 18:15

This was referenced Dec 6, 2019

launch year for each Dataverse installation #7

Open

Dataverse installations as a tabular file (CSV or TSV) #5

Closed

pdurbin requested a review from janetm December 6, 2019 18:54

pdurbin assigned shlake Dec 9, 2019

shlake approved these changes Dec 9, 2019

View reviewed changes

shlake merged commit 21d4d83 into master Dec 9, 2019

shlake mentioned this pull request Dec 9, 2019

Document json2tsv.py Script #41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add script to convert data.json to a TSV file #5 #40

add script to convert data.json to a TSV file #5 #40

pdurbin commented Dec 6, 2019

pdurbin commented Dec 9, 2019

shlake left a comment

janetm commented Dec 9, 2019 via email

add script to convert data.json to a TSV file #5 #40

add script to convert data.json to a TSV file #5 #40

Conversation

pdurbin commented Dec 6, 2019

pdurbin commented Dec 9, 2019

shlake left a comment

Choose a reason for hiding this comment

janetm commented Dec 9, 2019 via email