You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue will track the progress of this work. I am also hoping to collect some feedback regarding the implementation and the metrics that would be important to measure. Let me know if you have any thoughts.
Some rough requirements:
Ease of use
Should be simple for people to build and run the benchmark on their devices
Model size in the range of 1B - 7B
Larger models do not look feasible at the moment, but can reconsider
Recently, we did a performance benchmark of
llama.cpp
for Apple Silicon M-series chips: #4167I am planning to do a similar benchmark for Apple's mobile chips that are used in iPhones and iPads:
https://en.wikipedia.org/wiki/Apple_silicon#A_series
This issue will track the progress of this work. I am also hoping to collect some feedback regarding the implementation and the metrics that would be important to measure. Let me know if you have any thoughts.
Some rough requirements:
Should be simple for people to build and run the benchmark on their devices
Larger models do not look feasible at the moment, but can reconsider
Ref:
The text was updated successfully, but these errors were encountered: