Integrate Automated QDQ placement tool - Part 1 #701

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

willg-nv wants to merge 1 commit into NVIDIA:main from willg-nv:dev-willg-integrate-auto-qdq-placement-part1

+2,198 −0

willg-nv commented Dec 17, 2025 •

edited

Loading

What does this PR do?

Type of change: new feature

Overview: This PR integrates an automatical QDQ placment tool into ModelOpt.

This PR is the 1/4 parts of the change, it contains the following changes:

Defines common types: Region, RegionType, Error types
Defines InsertionPoints (the logical localtion to place QDQ pairs), InsertionScheme (a set of insertion points)
Unit tests for new types

Part 1: #701
Part 2: #702
Part 3: #703
Part 4: #704

Usage

        # Region type usage:
        region = Region(region_id=1, level=0, region_type=RegionType.LEAF)
        assert region.get_id() == 1
        assert region.get_level() == 0
        region.add_node(1) # 1 is the index of ONNX graph node
        ...

        point = NodeInputInsertionPoint(node_index=0, input_index=2)
        assert point.node_index == 0 # relative node index in region
        assert point.input_index == 2 # relative input tensor index in specific node
        resolved = point.resolve(region, graph)
        ...

Testing

Implement unit tests, all tests could get passed.

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes
Did you write any new necessary tests?: Yes
Did you add or update any necessary documentation?: No, document change will be included in part 4.
Did you update Changelog?: No, this could be done when all parts of the change are merged.

Additional Information

willg-nv requested a review from a team as a code owner

December 17, 2025 06:18

willg-nv requested a review from gcunhase

December 17, 2025 06:18

copy-pr-bot bot commented Dec 17, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

This was referenced Dec 17, 2025

Integrate Automated QDQ placement tool - Part 2 #702

Open

Integrate Automated QDQ placement tool - Part 3 #703

Open

Integrate Automated QDQ placement tool - Part 4 #704

Open

longlee0622 reviewed

View reviewed changes

modelopt/onnx/quantization/autotune/common.py Outdated Show resolved Hide resolved


          Integrate Automated QDQ placement tool - part 1

f872e70

Signed-off-by: Will Guo <[email protected]>

willg-nv force-pushed the dev-willg-integrate-auto-qdq-placement-part1 branch from 9c53783 to f872e70 Compare

December 19, 2025 05:32

Author

willg-nv commented Dec 22, 2025

Hi @gcunhase, could you help me review this PR? thanks!

ajrasane reviewed

View reviewed changes

modelopt/onnx/quantization/autotune/common.py Show resolved Hide resolved

ajrasane reviewed

View reviewed changes

modelopt/onnx/quantization/autotune/common.py

+                      self.children.append(child)
+                      child.set_parent(self)
+                  def _is_descendant_of(self, potential_ancestor: "Region") -> bool:

Contributor

ajrasane Jan 5, 2026

This seems like a handy util. Do we want to keep this public?

modelopt/onnx/quantization/autotune/common.py

+                      """
+                      return self.nodes
+                  def get_all_nodes_recursive(self, _visited: set[int] | None = None) -> set[int]:

Contributor

ajrasane Jan 6, 2026

nit: can be renamed to get_region_nodes_and_descendants()

modelopt/onnx/quantization/autotune/common.py

Comment on lines +277 to +280

+                      # Detect cycles
+                      if self.id in _visited:
+                          logger.warning(f"Cycle detected in region {self.id} during node traversal")
+                          return set()

Contributor

ajrasane Jan 6, 2026

add_child() alaready prevents adding cycles in the graph. So I think this check is redundant.

Author

willg-nv Jan 7, 2026

I think this check could help avoid infinite loop when graph editing encountered cycle is unexpectedly. I think this lines could be replaced with an assert statement which could make debug easier.

modelopt/onnx/quantization/autotune/common.py

+                          logger.warning(f"Cycle detected in region {self.id} during node traversal")
+                          return set()
+                      _visited.add(self.id)

Contributor

ajrasane Jan 6, 2026

We can remove _visited in that case too.

modelopt/onnx/quantization/autotune/common.py

Comment on lines +297 to +299

+                      # Detect cycles
+                      if self.id in _visited:
+                          return False

Contributor

ajrasane Jan 6, 2026

visited and cycle detection logic can be removed from here too.

tests/unit/onnx/quantization/autotune/test_insertion_points.py

Comment on lines +338 to +376

+              def run_tests():
+                  """Run all insertion point and scheme tests."""
+                  print("=" * 70)
+                  print("Autotuner Insertion Points & Schemes Test Suite")
+                  print("=" * 70)
+                  # Create test suite
+                  loader = unittest.TestLoader()
+                  suite = unittest.TestSuite()
+                  # Add test classes
+                  suite.addTests(loader.loadTestsFromTestCase(TestNodeInputInsertionPoint))
+                  suite.addTests(loader.loadTestsFromTestCase(TestRegionOutputInsertionPoint))
+                  suite.addTests(loader.loadTestsFromTestCase(TestChildRegionInputInsertionPoint))
+                  suite.addTests(loader.loadTestsFromTestCase(TestInsertionScheme))
+                  # Run with verbose output
+                  runner = unittest.TextTestRunner(verbosity=2)
+                  result = runner.run(suite)
+                  # Summary
+                  print("\n" + "=" * 70)
+                  print("Test Summary")
+                  print("=" * 70)
+                  print(f"Tests run: {result.testsRun}")
+                  print(f"Successes: {result.testsRun - len(result.failures) - len(result.errors)}")
+                  print(f"Failures: {len(result.failures)}")
+                  print(f"Errors: {len(result.errors)}")
+                  if result.wasSuccessful():
+                      print("\n✓ All insertion point and scheme tests passed!")
+                      return 0
+                  else:
+                      print("\n✗ Some tests failed")
+                      return 1
+              if __name__ == "__main__":
+                  sys.exit(run_tests())

Contributor

ajrasane Jan 6, 2026

Please remove this boilerplate code. You can refer to the other tests to see how they are called with pytest.

tests/unit/onnx/quantization/autotune/test_region.py

+                      assert region.get_type() == RegionType.LEAF
+                      assert region.get_parent() is None
+                      assert len(region.get_children()) == 0
+                      print("✓ LEAF region creation")

Contributor

ajrasane Jan 6, 2026

Remove the print statements from this file too

tests/unit/onnx/quantization/autotune/test_region.py

+              class TestRegion(unittest.TestCase):
+                  """Test Region class functionality."""
+                  def test_leaf_region_creation(self):

Contributor

ajrasane Jan 6, 2026

The three region creation tests are almost identical. It would be great if you could parametrize them.

tests/unit/onnx/quantization/autotune/test_region.py

Comment on lines +129 to +145

+                  def test_is_leaf(self):
+                      """Test checking if region is LEAF type."""
+                      leaf = Region(region_id=1, level=0, region_type=RegionType.LEAF)
+                      composite = Region(region_id=2, level=1, region_type=RegionType.COMPOSITE)
+                      assert leaf.get_type() == RegionType.LEAF
+                      assert composite.get_type() != RegionType.LEAF
+                      print("✓ Region LEAF type check")
+                  def test_is_composite(self):
+                      """Test checking if region is COMPOSITE type."""
+                      leaf = Region(region_id=1, level=0, region_type=RegionType.LEAF)
+                      composite = Region(region_id=2, level=1, region_type=RegionType.COMPOSITE)
+                      assert leaf.get_type() != RegionType.COMPOSITE
+                      assert composite.get_type() == RegionType.COMPOSITE
+                      print("✓ Region COMPOSITE type check")

Contributor

ajrasane Jan 6, 2026

These two tests can also be combined.

tests/unit/onnx/quantization/autotune/test_region.py

Comment on lines +189 to +219

+              def run_tests():
+                  """Run all Region tests."""
+                  print("=" * 70)
+                  print("Region Class Test Suite")
+                  print("=" * 70)
+                  loader = unittest.TestLoader()
+                  suite = unittest.TestSuite()
+                  suite.addTests(loader.loadTestsFromTestCase(TestRegion))
+                  runner = unittest.TextTestRunner(verbosity=2)
+                  result = runner.run(suite)
+                  print("\n" + "=" * 70)
+                  print("Test Summary")
+                  print("=" * 70)
+                  print(f"Tests run: {result.testsRun}")
+                  print(f"Successes: {result.testsRun - len(result.failures) - len(result.errors)}")
+                  print(f"Failures: {len(result.failures)}")
+                  print(f"Errors: {len(result.errors)}")
+                  if result.wasSuccessful():
+                      print("\n✓ All Region tests passed!")
+                      return 0
+                  else:
+                      print("\n✗ Some tests failed")
+                      return 1
+              if __name__ == "__main__":
+                  sys.exit(run_tests())

Contributor

ajrasane Jan 6, 2026

This boilerplate code can be removed too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

ajrasane ajrasane left review comments

gcunhase Awaiting requested review from gcunhase gcunhase is a code owner automatically assigned from NVIDIA/modelopt-onnx-codeowners

+1 more reviewer

longlee0622 longlee0622 left review comments

At least 1 approving review is required to merge this pull request.

Labels

None yet