Add operator fusion optimization support #993
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
概述
实现了负载资源互感知的编译优化技术模块:算子融合优化框架,支持自动算子融合,通过内核级优化提升推理性能。
主要变更
核心融合模块
FusionScheduler: 运行时调度器,动态决策融合或回退执行路径
FusionConfig: 可配置的融合行为(开关、缓存、调试模式)
FusionHeuristics: 静态启发式规则进行融合决策(张量大小、节点数阈值)
KernelCompiler: 管理融合内核的编译和缓存
GraphConverter: 将计算图转换为可融合子图表示
SubGraph: 描述算子序列和依赖关系的数据结构
配置示例
文件变更