change uses of Embedding.weight to Embedding.asLinear #231

davidkoski · 2025-03-10T16:54:43Z

There are several Modules that use

open class Embedding: Module, UnaryLayer, Quantizable {
    public let weight: MLXArray

directly instead of using:

    /// Call the embedding layer as a linear layer.
    ///
    /// Use this for example when input embedding and output projection
    /// weights are tied.
    open func asLinear(_ x: MLXArray) -> MLXArray {
        matmul(x, weight.T)
    }

e.g. Cohere, Starcoder2, OpenELM (this last is fixed recently).

        out = matmul(out, model.embedTokens.weight.T)

should be:

        out = model.embedTokens.asLinear(out)

davidkoski added the good first issue Good for newcomers label Mar 10, 2025

davidkoski mentioned this issue Mar 10, 2025

Fix #218 - unable to load OpenELM #228

Merged

davidkoski added a commit that referenced this issue Mar 10, 2025

fix #231 -- use Embedding.asLinear

9fefdde

davidkoski closed this as completed in f35df96 Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

change uses of Embedding.weight to Embedding.asLinear #231

change uses of Embedding.weight to Embedding.asLinear #231

davidkoski commented Mar 10, 2025

change uses of Embedding.weight to Embedding.asLinear #231

change uses of Embedding.weight to Embedding.asLinear #231

Comments

davidkoski commented Mar 10, 2025