Implement PageAllocation as a handle into a PagedAttentionCache, allowing publishing and releasing an allocation via handle rather than cache #17
Job | Run time |
---|---|
5m 51s | |
7m 5s | |
4m 35s | |
4m 12s | |
3m 48s | |
4m 25s | |
3m 37s | |
4m 58s | |
38m 31s |
Job | Run time |
---|---|
5m 51s | |
7m 5s | |
4m 35s | |
4m 12s | |
3m 48s | |
4m 25s | |
3m 37s | |
4m 58s | |
38m 31s |