Refactor using parallel workers for get/put operations #5

EXPEddrewery · 2018-03-28T00:19:03Z

List operation adds keys to a queue
Workers retrieve key from queue and copy the key from the source to the destination backend
Queue Size configuration added (defaults to 10000)
Workers configuration added (defaults to 100)

- List operation adds keys to a queue - Workers retrieve key from queue and copy the key from the source to the destination backend - Queue Size configuration added (defaults to 10000) - Workers configuration added (defaults to 100)

criloz · 2018-03-28T04:17:11Z

hi @EXPEddrewery, thanks for taking your time and make the pull request, unfortunately I don't believe that I can approve due that there is not much to gain adding go coroutines to the script, the task is bounded by the network io, but if you have a strong argument in favor to adopt coroutines I can reconsider. thanks!!

…Group notify

EXPEddrewery · 2018-03-28T06:34:59Z

Thanks for replying. I closed this because it wasn't quite ready and also I haven't proven it works 100% yet.

Our use case is a very large Vault backend (900K objects) that we are moving from S3 to DynamoDB.

It's true that networkIO is one restriction, as is the throughput on DynamoDB.

In my initial tests with the original code, it was going to be hours to transfer that many objects.

I did a test with 10 workers and it came down to about an hour and then another test with 64 which was about 15 minutes.

I had to ramp up the DynamoDB throughput to over 5000 to do this and even then there was throttling, but it certainly shortened the change over time for our use case. I also used a C5 instance for maximum bandwidth.

Once I finish testing and I'm happy with it, I'll push the final code and you can decide whether it's worth including.

I've reduced the workers to default to 1 which should be the same level of performance as you have now.

criloz · 2018-03-28T16:26:10Z

@EXPEddrewery that numbers seem very nice, I think that it is worth 100% them, I would be ready to test it by my self.

- Move some logging to debug - Add debug logging switch to configuration

… match current behaviour

EXPEddrewery · 2018-03-29T08:02:35Z

Hi again. I've tested it a few times now with my S3 to DynamoDB migration and the numbers seem reasonably good.

I don't have any logs to provide but essentially I've left your original logging the same unless you pass the new debug configuration parameter as true.

Below is a screenshot of the metrics from the migration test I did today with 10K write capacity on DynamoDB and 64 workers with our Vault cluster that has approximately 950K objects in S3.

It took just over 15 minutes to run on a c5.large and as you can see from the graphs, the it was throttled pretty heavily and never reached the 10K max. We'll be going with a 5K Max and 64 workers for our migration once we plan some downtime.

Let me know if you would like me to add some documentation for the configuration changes.

EXPEddrewery closed this Mar 28, 2018

EXPEddrewery added 2 commits March 28, 2018 10:21

Update to Vault 0.8.2

6ceae19

Formatting code

3347ef6

EXPEddrewery added 2 commits March 28, 2018 14:57

Reduce default workers to 10 as this would suit most people

1871f03

Reduce workers to 1 to match current behaviour and use defer for Wait…

62be348

…Group notify

EXPEddrewery added 2 commits March 29, 2018 12:19

Fix waitGroup argument to pass by reference

c815557

- Move some logging to debug - Add debug logging switch to configuration

Change extra logging for parallel workers to debug instead of info to…

96dd652

… match current behaviour

EXPEddrewery reopened this Mar 29, 2018

EXPEddrewery closed this Mar 29, 2018

EXPEddrewery reopened this Mar 29, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor using parallel workers for get/put operations #5

Refactor using parallel workers for get/put operations #5

EXPEddrewery commented Mar 28, 2018

criloz commented Mar 28, 2018

EXPEddrewery commented Mar 28, 2018

criloz commented Mar 28, 2018

EXPEddrewery commented Mar 29, 2018 •

edited

Loading

Refactor using parallel workers for get/put operations #5

Are you sure you want to change the base?

Refactor using parallel workers for get/put operations #5

Conversation

EXPEddrewery commented Mar 28, 2018

criloz commented Mar 28, 2018

EXPEddrewery commented Mar 28, 2018

criloz commented Mar 28, 2018

EXPEddrewery commented Mar 29, 2018 • edited Loading

EXPEddrewery commented Mar 29, 2018 •

edited

Loading