Skip to content

llama : benchmark for Apple Silicon A-series mobile chips #4358

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
ggerganov opened this issue Dec 7, 2023 · 1 comment
Closed

llama : benchmark for Apple Silicon A-series mobile chips #4358

ggerganov opened this issue Dec 7, 2023 · 1 comment
Labels
performance Speed related topics

Comments

@ggerganov
Copy link
Member

Recently, we did a performance benchmark of llama.cpp for Apple Silicon M-series chips: #4167

I am planning to do a similar benchmark for Apple's mobile chips that are used in iPhones and iPads:

https://en.wikipedia.org/wiki/Apple_silicon#A_series

This issue will track the progress of this work. I am also hoping to collect some feedback regarding the implementation and the metrics that would be important to measure. Let me know if you have any thoughts.

Some rough requirements:

  • Ease of use
    Should be simple for people to build and run the benchmark on their devices
  • Model size in the range of 1B - 7B
    Larger models do not look feasible at the moment, but can reconsider

Ref:

@ggerganov ggerganov added the performance Speed related topics label Dec 7, 2023
@ggerganov ggerganov moved this to Todo in ggml : roadmap Dec 7, 2023
@ggerganov
Copy link
Member Author

Collecting data here: #4508

@ggerganov ggerganov moved this from Todo to Done in ggml : roadmap Dec 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Speed related topics
Projects
None yet
Development

No branches or pull requests

1 participant