FastDeploy icon indicating copy to clipboard operation
FastDeploy copied to clipboard

[Cherry-Pick][Feature]Add a switch for logprobs/prompt_logprobs token decoding.(#5436)

Open qwes5s5 opened this issue 1 month ago • 2 comments

Motivation

:bulb: If this PR is a Cherry Pick, the PR title needs to follow the format by adding the [Cherry-Pick] label at the very beginning and appending the original PR ID at the end. For example, [Cherry-Pick][CI] Add check trigger and logic(#5191)

:bulb: 如若此PR是Cherry Pick,PR标题需遵循格式,在最开始加上[Cherry-Pick]标签,以及最后面加上原PR ID,例如[Cherry-Pick][CI] Add check trigger and logic(#5191)

Modifications

Usage or Command

Accuracy Tests

Checklist

  • [x] Add at least a tag in the PR title.
    • Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
    • You can add new tags based on the PR content, but the semantics must be clear.
  • [x] Format your code, run pre-commit before commit.
  • [x] Add unit tests. Please write the reason in this PR if no unit tests.
  • [x] Provide accuracy results.
  • [x] If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

qwes5s5 avatar Dec 15 '25 13:12 qwes5s5

Thanks for your contribution!

paddle-bot[bot] avatar Dec 15 '25 13:12 paddle-bot[bot]

Codecov Report

:x: Patch coverage is 47.05882% with 9 lines in your changes missing coverage. Please review. :warning: Please upload report for BASE (release/2.4@99b4024). Learn more about missing BASE report.

Files with missing lines Patch % Lines
fastdeploy/entrypoints/openai/serving_chat.py 36.36% 4 Missing and 3 partials :warning:
...astdeploy/entrypoints/openai/serving_completion.py 50.00% 1 Missing and 1 partial :warning:
Additional details and impacted files
@@              Coverage Diff               @@
##             release/2.4    #5572   +/-   ##
==============================================
  Coverage               ?   59.01%           
==============================================
  Files                  ?      327           
  Lines                  ?    40636           
  Branches               ?     6178           
==============================================
  Hits                   ?    23980           
  Misses                 ?    14792           
  Partials               ?     1864           
Flag Coverage Δ
GPU 59.01% <47.05%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

:rocket: New features to boost your workflow:
  • :snowflake: Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

codecov-commenter avatar Dec 15 '25 15:12 codecov-commenter