[RFC] Implement a mechanism to detect the type of model being read

With all the variant of ML model out now - gpt2/gptneox/llama/gptj, I wonder if theres a way to infer the model's type from reading it?...

Right now, if someone gives me a random model file with obscured name, I'd first need to checksum it, then look up the hash on HF for the model cards, then look through their docs/paper for the model type, and sometime I'd get confused between gptj/gptneox/llama hahah


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] Implement a mechanism to detect the type of model being read #147

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC] Implement a mechanism to detect the type of model being read #147

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions