llama : benchmark for Apple Silicon A-series mobile chips #4358

ggerganov · 2023-12-07T09:33:40Z

Recently, we did a performance benchmark of llama.cpp for Apple Silicon M-series chips: #4167

I am planning to do a similar benchmark for Apple's mobile chips that are used in iPhones and iPads:

https://en.wikipedia.org/wiki/Apple_silicon#A_series

This issue will track the progress of this work. I am also hoping to collect some feedback regarding the implementation and the metrics that would be important to measure. Let me know if you have any thoughts.

Some rough requirements:

Ease of use
Should be simple for people to build and run the benchmark on their devices
Model size in the range of 1B - 7B
Larger models do not look feasible at the moment, but can reconsider

Ref:

Starting point would be the Swift examples:
- https://github.com/ggerganov/llama.cpp/tree/master/examples/llama.swiftui
- https://github.com/ggerganov/llama.cpp/tree/master/examples/batched.swift

The text was updated successfully, but these errors were encountered:

ggerganov · 2023-12-17T17:58:41Z

Collecting data here: #4508

ggerganov added the performance Speed related topics label Dec 7, 2023

ggerganov added this to ggml : roadmap Dec 7, 2023

ggerganov moved this to Todo in ggml : roadmap Dec 7, 2023

ggerganov mentioned this issue Dec 15, 2023

llama.swiftui : add bench functionality #4483

Merged

5 tasks

ggerganov closed this as completed Dec 17, 2023

ggerganov moved this from Todo to Done in ggml : roadmap Dec 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : benchmark for Apple Silicon A-series mobile chips #4358

llama : benchmark for Apple Silicon A-series mobile chips #4358

ggerganov commented Dec 7, 2023

ggerganov commented Dec 17, 2023

llama : benchmark for Apple Silicon A-series mobile chips #4358

llama : benchmark for Apple Silicon A-series mobile chips #4358

Comments

ggerganov commented Dec 7, 2023

ggerganov commented Dec 17, 2023