Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data type #251

vraspar · 2025-11-10T22:20:16Z

Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data types following ONNX spec PR #7446. These complement existing INT4/UINT4 support with similar packed representation.

Changes

Enums (_enums.py)

Added INT2=25, UINT2=26 enum values
Added ml_dtypes.int2/uint2 mappings and type properties (bitwidth=2, signed/unsigned)
Added short names "i2"/"u2"

Packing (_type_casting.py)

Implemented pack_2bitx4() / unpack_2bitx4() for 4-values-per-byte packing
Handles non-multiple-of-4 sizes with padding

Core (_core.py)

Updated PackedTensor to support 2-bit types (was 4-bit only)
Added INT2/UINT2 to non-numpy-native types list
Updated byte representation to pack 2-bit types

Serialization (serde.py)

Added 2-bit unpacking from raw_data
Added INT2/UINT2 to int32_data serialization paths

Torch adapters (tensor_adapters.py)

Added torch.int2/uint2 dtype mappings for future compatibility
PyTorch doesn't yet support creating tensors with these types

Example

import numpy as np
import ml_dtypes
import onnx_ir as ir

# Create and serialize INT2 tensor
array = np.array([-2, -1, 0, 1], dtype=ml_dtypes.int2)
tensor = ir.Tensor(array)
proto = ir.serde.to_proto(tensor)

# 4 elements packed into 1 byte
assert len(proto.raw_data) == 1

# Round-trip works
restored = ir.serde.from_proto(proto)
assert np.array_equal(restored.numpy(), array)

Original prompt

Helpe me create PR for this issue: #250

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Co-authored-by: vraspar <[email protected]>

…them yet Co-authored-by: vraspar <[email protected]>

Co-authored-by: vraspar <[email protected]>

codecov · 2025-11-11T16:09:45Z

❌ 2 Tests Failed:

Tests completed	Failed	Passed	Skipped
947	2	945	0

View the top 2 failed test(s) by shortest run time

::src.onnx_ir.serde_test

Stack Traces | 0s run time

src\onnx_ir\serde_test.py:146: in <module>
    class TensorProtoTensorTest(unittest.TestCase):
src\onnx_ir\serde_test.py:278: in TensorProtoTensorTest
    @parameterized.parameterized.expand(
.nox\test\Lib\site-packages\parameterized\parameterized.py:585: in parameterized_expand_wrapper
    raise ValueError(
E   ValueError: Parameters iterable is empty (hint: use `parameterized.expand([], skip_on_empty=True)` to skip this test when the input is empty)

src.onnx_ir.serde_test.TensorProtoTensorTest::test_tensor_proto_tensor_uint_5_INT2

Stack Traces | 0.003s run time

.nox\test_onnx_weekly\Lib\site-packages\parameterized\parameterized.py:620: in standalone_func
    return func(*(a + p.args), **p.kwargs, **kw)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
src\onnx_ir\serde_test.py:336: in test_tensor_proto_tensor_uint
    np.testing.assert_array_equal(np.from_dlpack(tensor), tensor.numpy())
                                  ^^^^^^^^^^^^^^^^^^^^^^
src\onnx_ir\serde.py:351: in __dlpack__
    return self.numpy().__dlpack__(stream=stream)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
E   BufferError: DLPack only supports signed/unsigned integers, float and complex dtypes.

To view more test analytics, go to the Test Analytics Dashboard
_{📋 Got 3 mins? Take this short survey to help us improve Test Analytics.}

justinchuby · 2025-11-11T16:10:47Z

src/onnx_ir/_core.py


    This function is used for serializing the tensor to bytes. It handles the
-    special cases for 4-bit data types and endianness.
+    special cases for 2-bit and 4-bit data types and endianness.


I realized we don’t need to handle endianess because these are subtbyte types. So endianess is irrelevant

justinchuby · 2025-12-10T18:43:58Z

@vraspar could you check the test errors? Thanks

Co-authored-by: vraspar <[email protected]>

Signed-off-by: Justin Chu <[email protected]>

Copilot AI and others added 4 commits November 10, 2025 21:32

Initial plan

1e7b2c4

Add comprehensive support for UINT2/INT2 data types

80a4f0f

Co-authored-by: vraspar <[email protected]>

Remove INT2/UINT2 from torch tensor tests as PyTorch doesn't support …

c2eebfa

…them yet Co-authored-by: vraspar <[email protected]>

Add support for UINT2/INT2 data types

74307b3

Co-authored-by: vraspar <[email protected]>

justinchuby reviewed Nov 11, 2025

View reviewed changes

vraspar changed the title ~~Copilot/fix issue 250 handling~~ Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data type Nov 27, 2025

Copilot AI and others added 3 commits December 15, 2025 20:07

Update tensors.md documentation to include INT2/UINT2 data types

aabcb44

Co-authored-by: vraspar <[email protected]>

Fix failing test cases and bug in _enums.py

773976c

Merge branch 'main' into copilot/fix-issue-250-handling

3412e41

vraspar marked this pull request as ready for review December 15, 2025 23:35

vraspar requested review from a team and titaiwangms as code owners December 15, 2025 23:35

vraspar requested a review from justinchuby December 15, 2025 23:38

Update tensor proto tests for INT2 and UINT2

9ad95f7

Signed-off-by: Justin Chu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data type #251

Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data type #251

Uh oh!

vraspar commented Nov 10, 2025

Uh oh!

codecov bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

justinchuby Nov 11, 2025

Uh oh!

justinchuby commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data type #251

Are you sure you want to change the base?

Adds INT2 (2-bit signed) and UINT2 (2-bit unsigned) data type #251

Uh oh!

Conversation

vraspar commented Nov 10, 2025

Changes

Example

Uh oh!

codecov bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❌ 2 Tests Failed:

Uh oh!

justinchuby Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Nov 11, 2025 •

edited

Loading