-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add script to convert data.json to a TSV file #5 #40
Conversation
@shlake can you please take a look at this pull request? I meant to ping @janetm late last night (Monday morning for her) but I forgot. I doubt she'll object to merging it. And we can always take it out later if we don't like it. I'm mentioning Janet because part of my vision for this to to add the TSV file to the "Dataverse Installation Personas" dataset I recently added at IQSS/dataverse-sample-data@ec21353 The TSV file would be the first file in the Dataverse Installation Personas dataset. Then we would add a Jupyter notebook to make some plots like the ones above. We could explore the Jupyter Notebook in Whole Tale (or Binder once jupyterhub/binderhub#969 has been merged). We could keep adding files to support reproducibility, eventually making a "compute capsule" ( IQSS/dataverse#6085 ). I hope this is making sense! 😄 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@shlake will need to add info on this script to README file. Will add an issue.
Hi Phil
I’m happy with any work you do! Will try to have a look today.
Sorry I’ve been out of action, last week at an ML workshop and yesterday we ran a workshop around the recovery and use of an historically valuable demographic and population dataset. First half of the day was tech focused to showcase multiple similar projects at ANU to bring experience together, build a community of practice together etc. it was really promising.
Will share more about this as we go.
Thx all
J
----------------------------------------------------
Janet McDougall | Senior Data Archivist
Australian Data Archive (ADA)
ANU Centre for Social Research and Methods
The Australian National University
T: +61 2 6125 0571<tel:+61%202%206125%200571>
E: [email protected]<mailto:[email protected]>
W: http://ada.edu.au/<http://ada.edu.au/%3Chttp://assda.anu.edu.au/%3Chttp://ada.edu.au/%3chttp:/assda.anu.edu.au/%3Chttp://ada.edu.au/%3chttp:/assda.anu.edu.au/%3chttp:/ada.edu.au/%3chttp:/assda.anu.edu.au/%3E%3E%3E>
----------------------------------------------------
On 10 Dec 2019, at 03:16, Philip Durbin <[email protected]<mailto:[email protected]>> wrote:
@shlake<https://github.com/shlake> can you please take a look at this pull request? I meant to ping @janetm<https://github.com/janetm> late last night (Monday morning for her) but I forgot. I doubt she'll object to merging it. And we can always take it out later if we don't like it.
I'm mentioning Janet because part of my vision for this to to add the TSV file to the "Dataverse Installation Personas" dataset I recently added at IQSS/dataverse-sample-data@ec21353<IQSS/dataverse-sample-data@ec21353>
The TSV file would be the first file in the Dataverse Installation Personas dataset. Then we would add a Jupyter notebook to make some plots like the ones above. We could explore the Jupyter Notebook in Whole Tale (or Binder once jupyterhub/binderhub#969<jupyterhub/binderhub#969> has been merged). We could keep adding files to support reproducibility, eventually making a "compute capsule" ( IQSS/dataverse#6085<IQSS/dataverse#6085> ).
I hope this is making sense! 😄
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#40?email_source=notifications&email_token=AAJISVFBSEKDYN5CIDS3TDDQXZVMRA5CNFSM4JW6QTXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEGJXVBY#issuecomment-563313287>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAJISVDTORGNFKLHY6Q5D5LQXZVMRANCNFSM4JW6QTXA>.
|
Closes #5
Related to #7
The idea with the "json2tsv" script in this pull request is to start treating Dataverse installations as tabular data.
The "data.json" file in this repo is used as input.
A TSV file called dataverse-installations.tsv is created, which can be used in Dataverse itself to play around with Dataverse features. Perhaps someday we can also have a Jupyter Notebook to explore the data further.
Here are some screenshots of playing with the TSV file in Dataverse 4.18.1: