-
Notifications
You must be signed in to change notification settings - Fork 287
[OMNIML-3252][ONNX] Add real Q/DQ scales in Autotune #951
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
gcunhase
wants to merge
42
commits into
NVIDIA:main
Choose a base branch
from
gcunhase:dev/gcunhasergio/autotune_real_qdq_scales
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+788
−121
Open
Changes from all commits
Commits
Show all changes
42 commits
Select commit
Hold shift + click to select a range
3554ecf
Initial autotune codebase
gcunhase 56a67e3
Add more tests
gcunhase 6d58b4a
Refactor: PR #702
gcunhase 710319a
Remove python path in tests
gcunhase 98a60b5
Recover docstrings and simplify code (->, , )
gcunhase 91cef9c
Added unittest for workflows.py (failing)
gcunhase 7937cc2
Fix: 'Autotuning failed: 'PatternSchemes' object has no attribute 'no…
gcunhase 7c4e14b
Updated workflow test to test TRT and PythonTRT benchmarking
gcunhase 64836ed
Fix test: use_trtexec flag
gcunhase a9af36a
Add real scales to Q/DQ nodes
gcunhase 29e8dd2
fix precommit failures
gcunhase 7f69882
Fix: Add->Q/DQ->Activation(Relu)
gcunhase bb030be
Fix: correctly dequantize Add input with shared Q/DQ
gcunhase 616227c
[5916893] Fix weighted ops quantization logic: both input and weights…
gcunhase afee0a4
Changed keep_output_dir to True as default
gcunhase faf0bbb
test_workflow was moved to 'tests/gpu/onnx'
gcunhase 08bf713
Removed cli.py, moved into __main__.py
gcunhase 81fce48
Removed PatternSchemes import from region_pattern.py: no longer needed.
gcunhase 7a57b8d
Added intermediate Autotune model to be removed at the end of the qua…
gcunhase a71fc91
Removed _MUTATION_SPECS from autotuner.py: moved to autotuner_base.py
gcunhase 01e8be0
Removed test_config and test_pattern_cache. Should be added in the or…
gcunhase ad7a60d
Fixed minor coderabbit suggestions
gcunhase 7589668
Moved autotune imports to the top of the file
gcunhase db4c3ef
Eliminate intermediate ONNX export in _find_nodes_to_quantize_autotun…
gcunhase 42a0bdf
Add support for Add->Q/DQ->Relu patterns by including those 'Add' nod…
gcunhase a70dbd3
Add integration test
gcunhase e1c8af7
Remove 'keep_output_dir' arg (no longer needed due to tmp path)
gcunhase 159b9f2
Remove 'get_quantized_nodes' and other comments that are no longer ne…
gcunhase 51df982
Added docstring for 'default_dq_dtype' in workflows.py
gcunhase 1dc03cd
Added mode presets and additional autotune configurations
gcunhase ddacbcb
Fixed tmp_path in test
gcunhase 689a907
Fixed copilot comments
gcunhase b64322f
Fix: skip rewiring in graph_utils if no index is found. This prevents…
gcunhase 0a32bea
Match args for preset mode default
gcunhase 7730b51
Exposed _StoreWithExplicitFlag
gcunhase eb0e064
Renamed new_ips to new_insertion_points
gcunhase 7cc54a5
Address coderabbit and copilot issues + other minor issues
gcunhase 8634b74
Address additional coderabbit and copilot issues
gcunhase 0d82f64
Added real scales test in the integration workflow
gcunhase ee87330
Address additional copilot issues: includes fix for op_types_to_quant…
gcunhase 1a531b9
nit: added docstring and comment
gcunhase ede8df0
Created autotune utils
gcunhase File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.