-
-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add live demos #30
Comments
Maybe cog generates the workflow YAML and the recipes markdown from the same source somehow? |
I'm going to put the demos in the README itself, so that they end up on https://datasette.io/ and PyPI. |
I have 19 git scraping repos of my own: https://github.com/search?q=topic%3Agit-scraping+user%3Asimonw&type=repositories Options for demos:
|
And there are repos by other people:
|
Here's the 511 demo recipe:
|
Here's the CA fires demo:
Can graph the acres burned for a specific fire using this:
|
It's a shame both of these demos need a |
PGE is a good demo for that:
Shows Does take about an hour to generate though! |
Maybe FARA is good - I can link to the blog entry that explains it, and it's CSV, and the DB is probably quite small. Can maybe find Manafort in it as a demo? |
FARA is no good because the |
https://github.com/simonw/neededge-history/blob/main/v1.xml is interesting. It showed me a bug in the
Actually it's not interesting, because the data in it never changes - the only column you get is |
I'm going to try for It's only 247 commits, but the first one takes a LONG time because it has ~185,000 records in it. |
Yeah I'm going to do the trees one for the CSV example. So the examples are:
|
... but the trees export breaks at simonw/sf-tree-history@3fb63a9 - which is why I added this feature:
|
I'm going to stash the intermediary |
Had to use |
Here's the tree recipe that finally worked:
|
Most exciting demo is still likely to be this one: https://github.com/adolph/getIncidentsGit
Could do with a couple of extra conversions, in particular the
Also would be neat if columns like |
My big concern at this point is size: three of my four preferred demos are multiple hundreds of MB. I think they're going to work on Cloud Run but it's towards the upper limit of what I'm happy to host there. |
Deploying the first version of this with:
So big but maybe not too big? |
That deployed fine and it seems snappy enough: https://git-history-demos-j7hipcg4aq-uc.a.run.app/ |
That change to the |
I'll configure |
https://git-history-demos.datasette.io/ is working now. |
I added |
Got the bucket and the credentials now. Adding those credentials to this GitHub repository as secrets |
https://git-history-demos.datasette.io/ was just successfully deployed using a manual trigger of the new GitHub Actions workflow! https://github.com/simonw/git-history/runs/4441234271?check_suite_focus=true |
I'm not convinced UPDATE: no it was working fine, see notes on simonw/datasette-remote-metadata#3 |
Add these to the README and I can close the issue. |
git-history-demos.datasette.io hosts three example databases created using this tool:
|
Refs #30. I want to use the fix for this: simonw/datasette#1544
Wrote up part of this as a TIL: https://til.simonwillison.net/github-actions/s3-bucket-github-actions |
Also this blog entry: https://simonwillison.net/2021/Dec/7/git-history/ |
I'm tempted to pull a bunch of different example repos on a schedule and bundle them into the same demo instance.
Could have a
recipes.md
documentation page that shares the same demos and shows how they were built, using cog somehow.The text was updated successfully, but these errors were encountered: