Implement PageAllocation as a handle into a PagedAttentionCache, allowing publishing and releasing an allocation via handle rather than cache #55
Job | Run time |
---|---|
4m 41s | |
4m 24s | |
4m 41s | |
4m 0s | |
4m 2s | |
4m 6s | |
3m 23s | |
5m 21s | |
34m 38s |
Job | Run time |
---|---|
4m 41s | |
4m 24s | |
4m 41s | |
4m 0s | |
4m 2s | |
4m 6s | |
3m 23s | |
5m 21s | |
34m 38s |