Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance Clarification #8

Open
benjaminy opened this issue Oct 30, 2023 · 2 comments
Open

Performance Clarification #8

benjaminy opened this issue Oct 30, 2023 · 2 comments

Comments

@benjaminy
Copy link

Is my understanding correct that push and pull operations using this remote require [up/down]loading the entire (compressed) repo? That seems like a bummer in situations where the repo is large and the deltas are small. Does anyone know how feasible it would be to make a remote that works more incrementally?

@mih
Copy link
Member

mih commented Oct 30, 2023

Please note that this project is no longer active. The presently most advanced implementation is here https://docs.datalad.org/projects/next/en/latest/generated/datalad_next.gitremotes.datalad_annex.html

A more incremental behavior is feasible, but the Q is what a use case for it would be?

In practice, many storage systems have significant request latencies, but little streaming throughput limits. For reasonably large repos we found that a plain reupload is faster and simpler. When more complex patterns are needed, it is often better to use a git-based remote directly, because it also addresses other issues that come with this simplification.

@benjaminy
Copy link
Author

benjaminy commented Oct 30, 2023

Thank you for the quick reply. My long-term goal is to build a framework for (asynchronous) collaboration applications (shared calendar, address book, etc) that has the smallest possible dependence on any 3rd party service/server. My working architecture sketch involves each group member storing a (encrypted) copy of the team's data on a commodity cloud service. If I can make this work with a Git repo, that seems like it will provide a lot of useful concurrent versioning functionality.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants