FastDeploy icon indicating copy to clipboard operation
FastDeploy copied to clipboard

[Bug fix] Sync status for caching output cache

Open rainyfly opened this issue 4 weeks ago • 2 comments

Motivation

缓存输出 token 时,确认同步 token processor 和 resource manager 中关于请求的状态,当前请求尚未被抢占。

Modifications

Usage or Command

Accuracy Tests

Checklist

  • [ ] Add at least a tag in the PR title.
    • Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
    • You can add new tags based on the PR content, but the semantics must be clear.
  • [ ] Format your code, run pre-commit before commit.
  • [ ] Add unit tests. Please write the reason in this PR if no unit tests.
  • [ ] Provide accuracy results.
  • [ ] If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

rainyfly avatar Dec 16 '25 07:12 rainyfly

Thanks for your contribution!

paddle-bot[bot] avatar Dec 16 '25 07:12 paddle-bot[bot]

Codecov Report

:x: Patch coverage is 28.57143% with 10 lines in your changes missing coverage. Please review. :warning: Please upload report for BASE (develop@21fa2ba). Learn more about missing BASE report.

Files with missing lines Patch % Lines
fastdeploy/output/token_processor.py 20.00% 3 Missing and 1 partial :warning:
fastdeploy/engine/sched/resource_manager_v1.py 0.00% 2 Missing :warning:
fastdeploy/model_executor/pre_and_post_process.py 33.33% 1 Missing and 1 partial :warning:
fastdeploy/worker/gpu_model_runner.py 33.33% 1 Missing and 1 partial :warning:
Additional details and impacted files
@@            Coverage Diff             @@
##             develop    #5584   +/-   ##
==========================================
  Coverage           ?   62.60%           
==========================================
  Files              ?      329           
  Lines              ?    41710           
  Branches           ?     6371           
==========================================
  Hits               ?    26114           
  Misses             ?    13621           
  Partials           ?     1975           
Flag Coverage Δ
GPU 62.60% <28.57%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

:rocket: New features to boost your workflow:
  • :snowflake: Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

codecov-commenter avatar Dec 16 '25 08:12 codecov-commenter