-
Notifications
You must be signed in to change notification settings - Fork 54
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Lower stream-parallelized
LinearOp
into Host IR AG+GEMM overlap algo
#3736
opened Jan 20, 2025 by
samnordmann
Loading…
[Do not merge] Overlap benchmark: AG+GEMM distributed matmul with
HostIr
and ParallelType::Stream
#3719
opened Jan 16, 2025 by
samnordmann
Loading…
In the permissive bfs traversal, don't allow reverse traversal
#3717
opened Jan 16, 2025 by
naoyam
Loading…
Split Hopper MMA by warp-tile before instruction tile
on hold
This issue should be revisited in the future
#3642
opened Dec 24, 2024 by
jacobhinkle
Loading…
Support outer reduction scheduler with SOL autotuning
Autotune
Generate heuristics through machine learning models.
#3618
opened Dec 19, 2024 by
rdspring1
Loading…
[wgmma] Insert commit_group and wait_group after mma_async
Matmuls
#3573
opened Dec 11, 2024 by
jacobhinkle
•
Draft
Previous Next
ProTip!
Follow long discussions with comments:>50.