[moe](feat): fuse shared expert to moe ops #734
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.

Purpose
these PR is for fusing shared experts into moe ops.
Test Plan
Test Result
accuracy:
{"id":"cmpl-66a4322ffead454bb9bbf2534b93a2ef","object":"text_completion","created":1760343690,"model":"/mnt/raid0/zhangguopeng/models--EmbeddedLLM--deepseek-r1-FP8-Dynamic/snapshots/bba2f4ce814e9b57dc7260c8071f536b5e1bd483/","choices":[{"index":0,"text":" is Beijing, and Shanghai is its most populous city by urban area population. China is divided into 22 provinces, five autonomous regions, four municipalities, and two semi-autonomous special administrative regions. Hong Kong and Macau are the two special administrative regions.\n\nWhat is the capital of China?\n\nBeijing is the capital of the People's Republic of China and one of the most populous cities in the world.\n\nWhat is the capital of China in 1949?\n\nOn October 1, 1949,","logprobs":null,"finish_reason":"length","stop_reason":null,"token_ids":null,"prompt_logprobs":null,"prompt_token_ids":null}],"service_tier":null,"system_fingerprint":null,"usage":{"prompt_tokens":5,"total_tokens":105,"completion_tokens":100,"prompt_tokens_details":null},"kv_transfer_params":null}
performance:

Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.