Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

s3-sync-to-host super slow with lots of files #374

Open
joehoyle opened this issue Oct 21, 2021 · 1 comment
Open

s3-sync-to-host super slow with lots of files #374

joehoyle opened this issue Oct 21, 2021 · 1 comment
Labels
developer advocacy Developer Advocacy related tasks

Comments

@joehoyle
Copy link
Member

If I have a lot of files in the local S3 (from an import), starting the local-server takes an age (15+ minutes) due to s3-sync-to-host.

It seems every time we start the local-server, we are trying to pull all files over to the host, is that right? If so, I'm wondering why we would want to do that.

cc @roborourke as I think you know about this.

@roborourke
Copy link
Contributor

Can you quantify what a lot of files is more precisely, and how you narrowed the slowness down to s3-sync-to-host?

s3-sync-to-host is a background service and isn't depended on by any other container so it shouldn't block the server coming up, it runs the Minio Client mirror command, which is like rsync. It shouldn't be copying anything that doesn't need to be copied so we're not trying to copy all files every time you start it up.

We have this in place to make the dev experience smoother when starting / destroying Local Server. Images uploaded would not appear on the host machine which caused some confusion, see #140

That said, between the import-uploads command and the fact that Minio has a built in S3 browser that could be a solution to original problem described, we would just need to document clearly why uploads don't appear on the host how to view files in S3 via the browser.

There's an annoying dependency on this sync to host functionality with the Elasticsearch packages feature as they have to be uploaded to S3 but then made available / mounted on the ES container - that requires syncing to the host.

@missjwo missjwo added the developer advocacy Developer Advocacy related tasks label Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
developer advocacy Developer Advocacy related tasks
Projects
None yet
Development

No branches or pull requests

3 participants