feat(tools): add triton_kernel_extractor pipeline#706
Merged
Xreki merged 1 commit intoMay 14, 2026
Conversation
|
Thanks for your contribution! |
1cefba0 to
5db658d
Compare
Xreki
previously approved these changes
May 14, 2026
5db658d to
8638de1
Compare
Migrate from PassNet. This tool compiles subgraphs via graph_net_bench.torch.test_compiler, filters by speedup, and extracts autotuning-selected (subgraph, triton_kernel, ptx) triples. Now that it lives in GraphNet, PYTHONPATH for graph_net_bench is resolved automatically from the repo root. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
8638de1 to
6d28d69
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
从 PassNet 仓库( PaddlePaddle/PassNet#80 )迁移
triton_kernel_extractor到 GraphNet。该工具依赖graph_net_bench.torch.test_compiler、graph_net_visual,遂放在同一仓库下。功能
对子图样本执行五步流水线,产出配对的
subgraph, triton_kernel, ptx:graph_net_bench.torch.test_compiler --kernel-time)[Speedup][kernel]:指标筛选正优化样本output_code.py提取 autotuning 选中的 Triton kernel 源码 + 对应 PTX用法