Closed
Description
When trying to load Qwen1.5, the model downloads fully but doesn't appear to load in to memory on MacOS or iOS. After typing a prompt, the error output is "Failed: unhandledKeys(base: "Embedding", keys: ["biases", "scales"])
Using MLX 0.11.0
Other linked models work as per the repo code but this is the smallest, which looks like best one for older devices with less RAM and would be great to get it working.