Request Support for Mistral-8x22B #6580

rankaiyx · 2024-04-10T05:04:18Z

Feature Description

Support for Mixtral-8x22B

Mistral AI has just opened up a large model, Mistral 8x22B, with magnetic links again, with a model file size of 281.24 GB.

According to the name of the model, Mistral 8x22B is the Super Bowl version of "mixtral-8x7b", which was opened up last year, and the parameter size has more than tripled-it is made up of eight expert networks with 22 billion parameters (8 x 22B).

magnet:?xt=urn:btih:9238b09245d0d8cd915be09927769d5f7584c1c9&dn=mixtral-8x22b&tr=udp%3A%2F%http://2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%http://2Ftracker.opentrackr.org%3A1337%2Fannounce

Motivation

It should be a good model.

LiuChaoXD · 2024-04-10T05:56:00Z

+1

anunknowperson · 2024-04-10T10:47:04Z

It is not a Mistral Medium, it's a new model. Mistral Medium has different context length, etc. and Mistral Medium was leaked earlier.
They said it's a brand new model.

phymbert · 2024-04-10T10:51:08Z

Did someone download the torrent ? Is it an HF model with modeling code or only weights inside without the architecture ?

rankaiyx · 2024-04-10T10:55:57Z

It is not a Mistral Medium, it's a new model. Mistral Medium has different context length, etc. and Mistral Medium was leaked earlier. They said it's a brand new model.

Okay, I'll change the title.

simsi-andy · 2024-04-10T10:56:15Z

@phymbert

Don't know if usefull but it's already up on huggingface. https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1

(You'll find many uploads).

phymbert · 2024-04-10T11:01:43Z

Don't know if usefull but it's already up on huggingface. https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1

It is useful, thanks, I did not notice they changed the org. Let's go then

simsi-andy · 2024-04-10T14:33:28Z

It just works. =D

https://huggingface.co/MaziyarPanahi/Mixtral-8x22B-v0.1-GGUF/tree/main

digiwombat · 2024-04-10T14:35:30Z

Confirmed the IQ3_XS runs without changes.

Dampfinchen · 2024-04-10T16:53:33Z

Is it really the exact same architecture though? Perhaps there are some subtle optimizations.

phymbert · 2024-04-10T17:07:25Z

It looks so, just bigger:
https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1/blob/main/config.json
https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1/blob/main/config.json

schmorp · 2024-04-18T06:11:48Z

Unfortunately, convert fails with Mixtral 8x22b instruct:

ValueError: Vocab size mismatch (model has 32768, but Mixtral-8x22B-Instruct-v0.1/tokenizer.json has 32769).

This off-by-little (sometimes 1, sometimes a few more) is actually a very common problem with older models that I quantize, but because they are older, I haven't bothered reporting it yet.

stefanvarunix · 2024-04-19T09:08:52Z

#6740

tholin · 2024-04-19T12:46:15Z

Unfortunately, convert fails with Mixtral 8x22b instruct:

ValueError: Vocab size mismatch (model has 32768, but Mixtral-8x22B-Instruct-v0.1/tokenizer.json has 32769).

This off-by-little (sometimes 1, sometimes a few more) is actually a very common problem with older models that I quantize, but because they are older, I haven't bothered reporting it yet.

That is because of a bug in the original mistral ai upload. Open the file tokenizer.json and change "TOOL_RESULT" into "TOOL_RESULTS" and the conversion should work.

https://huggingface.co/mistralai/Mixtral-8x22B-Instruct-v0.1/discussions/6

schmorp · 2024-04-20T02:48:37Z

@tholin: indeed, thanks a lot!

schmorp · 2024-04-20T05:20:33Z

@tholin: while convert.py succeeds, it results in a 11GB output file, so something still doesn't work. (b2699)

Update: no longer happens with b2715

github-actions · 2024-06-07T01:06:56Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

rankaiyx added the enhancement New feature or request label Apr 10, 2024

rankaiyx changed the title ~~Request Support for Mistral-Medium：8x22B~~ Request Support for Mistral-8x22B Apr 10, 2024

phymbert mentioned this issue Apr 10, 2024

convert.py : add consolidated.safetensors for mixtral 8x22b #6587

Merged

phymbert mentioned this issue Apr 10, 2024

Is official mixtral 8x22b working properly? #6592

Closed

github-actions bot added the stale label May 24, 2024

github-actions bot closed this as completed Jun 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Request Support for Mistral-8x22B #6580

Request Support for Mistral-8x22B #6580

rankaiyx commented Apr 10, 2024

LiuChaoXD commented Apr 10, 2024

Uh oh!

anunknowperson commented Apr 10, 2024

Uh oh!

phymbert commented Apr 10, 2024

Uh oh!

rankaiyx commented Apr 10, 2024

Uh oh!

simsi-andy commented Apr 10, 2024 •

edited

Loading

Uh oh!

phymbert commented Apr 10, 2024

Uh oh!

simsi-andy commented Apr 10, 2024

Uh oh!

digiwombat commented Apr 10, 2024

Uh oh!

Dampfinchen commented Apr 10, 2024 •

edited

Loading

Uh oh!

phymbert commented Apr 10, 2024

Uh oh!

schmorp commented Apr 18, 2024 •

edited

Loading

Uh oh!

stefanvarunix commented Apr 19, 2024

Uh oh!

tholin commented Apr 19, 2024

Uh oh!

schmorp commented Apr 20, 2024

Uh oh!

schmorp commented Apr 20, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Jun 7, 2024

Uh oh!

Request Support for Mistral-8x22B #6580

Request Support for Mistral-8x22B #6580

Comments

rankaiyx commented Apr 10, 2024

Feature Description

Motivation

LiuChaoXD commented Apr 10, 2024

Uh oh!

anunknowperson commented Apr 10, 2024

Uh oh!

phymbert commented Apr 10, 2024

Uh oh!

rankaiyx commented Apr 10, 2024

Uh oh!

simsi-andy commented Apr 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phymbert commented Apr 10, 2024

Uh oh!

simsi-andy commented Apr 10, 2024

Uh oh!

digiwombat commented Apr 10, 2024

Uh oh!

Dampfinchen commented Apr 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phymbert commented Apr 10, 2024

Uh oh!

schmorp commented Apr 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stefanvarunix commented Apr 19, 2024

Uh oh!

tholin commented Apr 19, 2024

Uh oh!

schmorp commented Apr 20, 2024

Uh oh!

schmorp commented Apr 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 7, 2024

Uh oh!

simsi-andy commented Apr 10, 2024 •

edited

Loading

Dampfinchen commented Apr 10, 2024 •

edited

Loading

schmorp commented Apr 18, 2024 •

edited

Loading

schmorp commented Apr 20, 2024 •

edited

Loading