[Feature] support ep in mixed mode #3001

ltd0924 · 2025-07-24T05:55:21Z

support expert parallel in mixed mode

example：
'''
python -m fastdeploy.entrypoints.openai.api_server
--model ERNIE-4.5-300B-A47B-BF16
--port 8180 --metrics-port 8181
--engine-worker-queue-port 8182
--cache-queue-port 8183
--quantization wint4
--data-parallel-size 8 --tensor-parallel-size 1
--enable-expert-parallel
--scheduler-name "splitwise"
--scheduler-host "127.0.0.1"
--scheduler-port 6379
--scheduler-ttl 9000
'''

Note：
When deploying, you need to configure and install Redis as a scheduler. where scheduler host is the address of Redis, and scheduler port is the port number of Redis.

You can refer to the documentation for installing REDIS.

paddle-bot · 2025-07-24T05:55:29Z

Thanks for your contribution!

[LLM] support ep

ada1e9d

ltd0924 added 3 commits July 29, 2025 11:41

Update worker_process.py

38b1efe

Update expert_service.py

a9fa7f1

Merge branch 'develop' into RL

0661d4f

ltd0924 changed the title ~~[test] support ep in mixed mode~~ [Feature] support ep in mixed mode Jul 29, 2025

ltd0924 added 2 commits July 29, 2025 15:10

Update worker_process.py

ffa59fd

Merge branch 'develop' into RL

3e91547

ltd0924 force-pushed the RL branch from 0c10e14 to 3e91547 Compare July 30, 2025 08:16

format files

ace1d91

Jiang-Jia-Jun approved these changes Jul 30, 2025

View reviewed changes

ltd0924 merged commit d17886d into PaddlePaddle:develop Jul 30, 2025
12 of 18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] support ep in mixed mode #3001

[Feature] support ep in mixed mode #3001

Uh oh!

ltd0924 commented Jul 24, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

[Feature] support ep in mixed mode #3001

[Feature] support ep in mixed mode #3001

Uh oh!

Conversation

ltd0924 commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot bot commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

ltd0924 commented Jul 24, 2025 •

edited

Loading