-
Notifications
You must be signed in to change notification settings - Fork 78
Remove an unnecessary corner case #5934
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
!test |
|
Review updated until commit 8dec3f9 Description
|
| Relevant files | |||
|---|---|---|---|
| Enhancement |
|
PR Reviewer Guide
Here are some key observations to aid the review process:
| 🧪 PR contains tests |
| ⚡ Recommended focus areas for review |
Corner Case Removal Validation
|
Test failures
-
(Medium, 3)
Shape mismatch in Thunder nvFuser higher-order inplace alias update testsTest Name A100 GB200 H100 Source thunder.tests.test_update_aliases.test_higher_order_inplace_alias_update_nvfuser_cuda_thunder.dtypes.float32 ❌ ❌ ❌ -
(Medium, 1)
Scalar numerical mismatch in thunder nanogpt NVFuser CUDA test_networksTest Name H100 Source thunder.tests.test_networks.test_nanogpt_complete_autograd_nvfuser_cuda_thunder.dtypes.float32 ❌
Greptile OverviewGreptile Summary
Confidence Score: 5/5
Important Files Changed
Sequence DiagramsequenceDiagram
participant Caller as Scheduler/Heuristic
participant Mapper as ContiguousInnerDimensionsMapper
participant TV as TensorView
participant TD as TensorDomain
participant Dep as DependencyCheck
participant IR as SimplifyingIrBuilder
Caller->>Mapper: getContigMergeOfInnerSize(tv)
Mapper->>TV: getMaybeAllocationDomain()
Mapper->>Mapper: mappedLogicalIds(tv)
Mapper->>TD: noReductions(alloc)
Mapper->>TV: domain()->contiguity()
alt contiguity includes reductions
Mapper->>Mapper: filter contiguity to alloc_no_reductions
end
Mapper->>Mapper: assert alloc_no_reductions.size == contiguity.size
loop iterate inner alloc dims (reverse)
Mapper->>Mapper: skip if extent==1 or broadcast
Mapper->>Dep: getAllExprsBetween(logicalDomain, alloc_iid)
Mapper->>Mapper: validate only device splits
Mapper->>IR: divExpr(getProjectedExtent, num_devices)
Mapper->>IR: mulExpr(product, sharded_extent)
end
Mapper-->>Caller: simplifyExpr(product_of_inner_extents)
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1 file reviewed, 1 comment
Additional Comments (1)
Removing the |
|
!test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1 file reviewed, no comments
No description provided.