You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello @masahi! No, my fix in #215 resolved very strong (more than one order) reduction after #214.
About task 1: 1. we observed reduction ~25-30% after #192 2. It was not resolved, I'm investigating the issue
About task 2: I remember about logprobs, but looks like resolving of task 1 requires sampler refactor and I want to do it first (or somebody will do it)
Lunderberg
pushed a commit
to Lunderberg/mlc-llm
that referenced
this issue
Feb 27, 2024
This PR makes API to be consistent across iOS and C++ implementation.
- Use ChatModule as the wrapper API.
- Use the same function name across implementations
modulo style to match language native preference
- Initial round of documentations about these API
Let's collect remaining issues we are aware of related to sampler performance
benchmark_throughput.py
) after Sampler Throughput Optimization #192 when only greedy sampling is used.The text was updated successfully, but these errors were encountered: