【TI-Consisent】Added Metric logits_stats to the ZMQ branch#6979
Open
liuruyan wants to merge 6 commits intoPaddlePaddle:developfrom
Open
【TI-Consisent】Added Metric logits_stats to the ZMQ branch#6979liuruyan wants to merge 6 commits intoPaddlePaddle:developfrom
liuruyan wants to merge 6 commits intoPaddlePaddle:developfrom
Conversation
|
Thanks for your contribution! |
…into logit_stat_dev
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #6979 +/- ##
==========================================
Coverage ? 73.83%
==========================================
Files ? 399
Lines ? 56093
Branches ? 8853
==========================================
Hits ? 41415
Misses ? 11707
Partials ? 2971
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
…into logit_stat_dev
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
背景:出于对训推一致性丰富检测指标及长期CI/CE监控考量,对sample之后的logits添加logits_stats(min/max/mean/std),用来保证确定性及稳定性。
Modifications
数据结构及接口:由于logprob与logits_stat同样是输出的重要检测指标且均从logits计算而来,暂时实现先将logits_stat存入LogprobsTensors数据结构保存,并升级logprob传播链路上相关接口,支持同时透传logits_stats
FLAG:添加与enable_logprob同级别model_config:
self.compute_logits_stats = False,且在server启动时支持配置--compute-logits-statsUsage or Command
--compute-logits-stats,--enable-logproblogprobs=True,top_logprobs=0response = client.chat.completions.create( model="null", messages=[ {"role": "system", "content": "I'm a helpful AI assistant."}, {"role": "user", "content": "把李白的静夜思改写为现代诗"}, ], stream=True, # False max_tokens=100, logprobs=True, top_logprobs=0 )Accuracy Tests
本PR不涉及精度修改
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.