[Feature] support min_p_sampling #2872

lizexu123 · 2025-07-16T13:28:40Z

功能描述
我们参考了flashinfer的实现(感谢)，实现了min_p_from_prob,支持min_p以张量的形式传入，既支持gpu kernel的形式，也支持paddle散op的形式
使用方式
服务方式请求:

response = client.chat.completions.create(
    model="default",
    messages=[
        {"role": "user", "content": "北京天安门在哪里?"},
    ],
    temperature=0.1,
    metadata={"min_p":0.1},
    stream=False,
)

print(response.choices[0].message.content)
print("\n")

离线方式:

from fastdeploy.engine.sampling_params import SamplingParams
from fastdeploy.entrypoints.llm import LLM

model_name_or_path = "Qwen/Qwen3-0.6B"

sampling_params = SamplingParams(temperature=1.0,min_p=0.1)
llm = LLM(model=model_name_or_path, tensor_parallel_size=1,reasoning_parser="qwen3")
prompt = "北京天安门在哪里?"
messages = [{"role": "user", "content": prompt}]
output = llm.chat([messages],
                   sampling_params)

print(output)

paddle-bot · 2025-07-16T13:28:45Z

Thanks for your contribution!

yuanlehome · 2025-07-18T06:36:22Z

fastdeploy/model_executor/layers/sample/sampler.py

            sampled_token_ids=next_tokens,
            logprobs_tensors=logprobs_tensors,
        )
+        self.step+=1


yuanlehome · 2025-07-18T06:39:07Z

fastdeploy/model_executor/layers/sample/ops/top_k_top_p_sampling.py

+    """
+    min_p_sampling
+    """
+    if paddle.count_nonzero(min_p_arr)==0:


pre-commit all files

咱们的pre-commit失效了吗

qingqing01

单测需要和小算子组合比对正确性

qingqing01 · 2025-07-18T06:41:01Z

test/layers/test_min_p.py

+# limitations under the License.
+
+
+import matplotlib.pyplot as plt


单测不比依赖matplotlib

* Fastdeploy support min_p * add test_min_p * fix * min_p_sampling * update * delete vl_gpu_model_runner.py * fix * Align usage of min_p with vLLM * fix * modified unit test * fix test_min_sampling * pre-commit all files * fix * fix * fix * fix xpu_model_runner.py

lizexu123 added 4 commits July 14, 2025 08:27

Fastdeploy support min_p

da68f6d

add test_min_p

4df179f

fix

f8e391d

min_p_sampling

edb4202

lizexu123 added 7 commits July 16, 2025 13:30

update

53bdfb2

merge develop

abd4a75

delete vl_gpu_model_runner.py

8d33bd4

fix

3e8cd85

Align usage of min_p with vLLM

4618992

fix

12ebeb2

merge develop

e8dde63

yuanlehome reviewed Jul 18, 2025

View reviewed changes

qingqing01 reviewed Jul 18, 2025

View reviewed changes

modified unit test

5b9ffc5

lizexu123 force-pushed the min_p_1 branch 2 times, most recently from 644ac9d to 13d4cdd Compare July 18, 2025 09:56

fix test_min_sampling

5c335b1

lizexu123 force-pushed the min_p_1 branch from 13d4cdd to 5c335b1 Compare July 18, 2025 10:02

lizexu123 added 6 commits July 21, 2025 03:00

merge develop

d66530c

pre-commit all files

417dea6

fix

132eece

fix

30ff611

fix

e21671b

fix xpu_model_runner.py

25b6f93

yuanlehome approved these changes Jul 21, 2025

View reviewed changes

yuanlehome merged commit 67990e0 into PaddlePaddle:develop Jul 21, 2025
4 of 5 checks passed

zeroRains mentioned this pull request Aug 2, 2025

[Bug Fix] fix the bug in test_sampler #3157

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] support min_p_sampling #2872

[Feature] support min_p_sampling #2872

Uh oh!

lizexu123 commented Jul 16, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 16, 2025

Uh oh!

yuanlehome Jul 18, 2025

Uh oh!

yuanlehome Jul 18, 2025

Uh oh!

lizexu123 Jul 18, 2025

Uh oh!

qingqing01 left a comment

Uh oh!

qingqing01 Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# limitations under the License.


		import matplotlib.pyplot as plt

[Feature] support min_p_sampling #2872

[Feature] support min_p_sampling #2872

Uh oh!

Conversation

lizexu123 commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot bot commented Jul 16, 2025

Uh oh!

yuanlehome Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

yuanlehome Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

lizexu123 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

qingqing01 left a comment

Choose a reason for hiding this comment

Uh oh!

qingqing01 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lizexu123 commented Jul 16, 2025 •

edited

Loading